Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models

October 23, 2023 · Entered Twilight · 🏛 Conference on Empirical Methods in Natural Language Processing

Repo contents: .gitignore, README.md, asqa, assets, bing_search.py, dsp, environment.yml, get_wiki.py, run_toc.py, toc, utils.py

Authors Gangwoo Kim, Sungdong Kim, Byeongguk Jeon, Joonsuk Park, Jaewoo Kang arXiv ID 2310.14696 Category cs.CL: Computation & Language Citations 52 Venue Conference on Empirical Methods in Natural Language Processing Repository https://github.com/gankim/tree-of-clarifications ⭐ 54 Last Checked 1 month ago

Abstract

Questions in open-domain question answering are often ambiguous, allowing multiple interpretations. One approach to handling them is to identify all possible interpretations of the ambiguous question (AQ) and to generate a long-form answer addressing them all, as suggested by Stelmakh et al., (2022). While it provides a comprehensive response without bothering the user for clarification, considering multiple dimensions of ambiguity and gathering corresponding knowledge remains a challenge. To cope with the challenge, we propose a novel framework, Tree of Clarifications (ToC): It recursively constructs a tree of disambiguations for the AQ -- via few-shot prompting leveraging external knowledge -- and uses it to generate a long-form answer. ToC outperforms existing baselines on ASQA in a few-shot setup across the metrics, while surpassing fully-supervised baselines trained on the whole training set in terms of Disambig-F1 and Disambig-ROUGE. Code is available at https://github.com/gankim/tree-of-clarifications.