Knowledge Boundary of Large Language Models: A Survey

December 17, 2024 · The Cartographer · 🏛 Annual Meeting of the Association for Computational Linguistics

"No code URL or promise found in abstract"
"Title-pattern auto-detect: Knowledge Boundary of Large Language Models: A Survey"

Evidence collected by the PWNC Scanner

Authors Moxin Li, Yong Zhao, Wenxuan Zhang, Shuaiyi Li, Wenya Xie, See-Kiong Ng, Tat-Seng Chua, Yang Deng arXiv ID 2412.12472 Category cs.CL: Computation & Language Citations 31 Venue Annual Meeting of the Association for Computational Linguistics Last Checked 7 days ago

Abstract

Although large language models (LLMs) store vast amount of knowledge in their parameters, they still have limitations in the memorization and utilization of certain knowledge, leading to undesired behaviors such as generating untruthful and inaccurate responses. This highlights the critical need to understand the knowledge boundary of LLMs, a concept that remains inadequately defined in existing research. In this survey, we propose a comprehensive definition of the LLM knowledge boundary and introduce a formalized taxonomy categorizing knowledge into four distinct types. Using this foundation, we systematically review the field through three key lenses: the motivation for studying LLM knowledge boundaries, methods for identifying these boundaries, and strategies for mitigating the challenges they present. Finally, we discuss open challenges and potential research directions in this area. We aim for this survey to offer the community a comprehensive overview, facilitate access to key issues, and inspire further advancements in LLM knowledge research.