English
Related papers

Related papers: Knowledge-Augmented Language Model Verification

200 papers

Despite their success at many natural language processing (NLP) tasks, large language models still struggle to effectively leverage knowledge for knowledge-intensive tasks, manifesting limitations such as generating incomplete, non-factual,…

Computation and Language · Computer Science 2024-10-03 Yougang Lyu , Lingyong Yan , Shuaiqiang Wang , Haibo Shi , Dawei Yin , Pengjie Ren , Zhumin Chen , Maarten de Rijke , Zhaochun Ren

Large Language Models (LLMs) store an extensive amount of factual knowledge obtained from vast collections of text. To effectively utilize these models for downstream tasks, it is crucial to have reliable methods for measuring their…

Computation and Language · Computer Science 2023-06-13 Pouya Pezeshkpour

Large Language Models (LLMs) have garnered significant attention due to their remarkable ability to process information across various languages. Despite their capabilities, they exhibit inconsistencies in handling identical queries in…

Computation and Language · Computer Science 2024-06-24 Yue Huang , Chenrui Fan , Yuan Li , Siyuan Wu , Tianyi Zhou , Xiangliang Zhang , Lichao Sun

Large Language Models (LLMs) have shown great potential in Natural Language Processing (NLP) tasks. However, recent literature reveals that LLMs generate nonfactual responses intermittently, which impedes the LLMs' reliability for further…

Computation and Language · Computer Science 2024-03-22 Yukun Zhao , Lingyong Yan , Weiwei Sun , Guoliang Xing , Chong Meng , Shuaiqiang Wang , Zhicong Cheng , Zhaochun Ren , Dawei Yin

Autoregressive large language models (LLMs) pre-trained by next token prediction are inherently proficient in generative tasks. However, their performance on knowledge-driven tasks such as factual knowledge querying remains unsatisfactory.…

Computation and Language · Computer Science 2026-01-14 Peng Yu , Cheng Deng , Beiya Dai , Xinbing Wang , Ying Wen

Recent large language models (LLMs) achieve strong performance in generating promising reasoning paths for complex tasks. However, despite powerful generation ability, LLMs remain weak at verifying their own answers, revealing a persistent…

Computation and Language · Computer Science 2026-02-10 Yuxin Chen , Yu Wang , Yi Zhang , Ziang Ye , Zhengzhou Cai , Yaorui Shi , Qi Gu , Hui Su , Xunliang Cai , Xiang Wang , An Zhang , Tat-Seng Chua

Problem-solving has been a fundamental driver of human progress in numerous domains. With advancements in artificial intelligence, Large Language Models (LLMs) have emerged as powerful tools capable of tackling complex problems across…

Machine Learning · Computer Science 2025-05-07 Da Zheng , Lun Du , Junwei Su , Yuchen Tian , Yuqi Zhu , Jintian Zhang , Lanning Wei , Ningyu Zhang , Huajun Chen

Self-correction has emerged as a promising solution to boost the reasoning performance of large language models (LLMs), where LLMs refine their solutions using self-generated critiques that pinpoint the errors. This work explores whether…

Computation and Language · Computer Science 2024-06-07 Yunxiang Zhang , Muhammad Khalifa , Lajanugen Logeswaran , Jaekyeom Kim , Moontae Lee , Honglak Lee , Lu Wang

Knowledge editing methods (KEs) can update language models' obsolete or inaccurate knowledge learned from pre-training. However, KEs can be used for malicious applications, e.g., inserting misinformation and toxic content. Knowing whether a…

Computation and Language · Computer Science 2025-02-11 Paul Youssef , Zhixue Zhao , Christin Seifert , Jörg Schlötterer

Large language models (LLMs) have demonstrated strong capabilities in text understanding and generation. However, they often lack factuality, producing a mixture of true and false information, especially in long-form generation. In this…

Computation and Language · Computer Science 2025-09-26 Lifu Tu , Rui Meng , Shafiq Joty , Yingbo Zhou , Semih Yavuz

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the…

Computation and Language · Computer Science 2024-04-12 Linyi Yang , Shuibai Zhang , Zhuohao Yu , Guangsheng Bao , Yidong Wang , Jindong Wang , Ruochen Xu , Wei Ye , Xing Xie , Weizhu Chen , Yue Zhang

Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with…

Computation and Language · Computer Science 2026-02-16 Hao Chen , Ye He , Yuchun Fan , Yukun Yan , Zhenghao Liu , Qingfu Zhu , Maosong Sun , Wanxiang Che

Large Language Models (LLMs) augmented with retrieval mechanisms have demonstrated significant potential in fact-checking tasks by integrating external knowledge. However, their reliability decreases when confronted with conflicting…

Computation and Language · Computer Science 2025-05-26 Ziyu Ge , Yuhao Wu , Daniel Wai Kit Chin , Roy Ka-Wei Lee , Rui Cao

Given varying prompts regarding a factoid question, can a large language model (LLM) reliably generate factually correct answers? Existing LLMs may generate distinct responses for different prompts. In this paper, we study the problem of…

Computation and Language · Computer Science 2023-10-31 Qingxiu Dong , Jingjing Xu , Lingpeng Kong , Zhifang Sui , Lei Li

Large language models (LLMs) have shown impressive prowess in solving a wide range of tasks with world knowledge. However, it remains unclear how well LLMs are able to perceive their factual knowledge boundaries, particularly under…

Computation and Language · Computer Science 2024-11-20 Ruiyang Ren , Yuhao Wang , Yingqi Qu , Wayne Xin Zhao , Jing Liu , Hao Tian , Hua Wu , Ji-Rong Wen , Haifeng Wang

Although achieving great success, Large Language Models (LLMs) usually suffer from unreliable hallucinations. Although language attribution can be a potential solution, there are no suitable benchmarks and evaluation metrics to attribute…

Computation and Language · Computer Science 2024-05-24 Xinze Li , Yixin Cao , Liangming Pan , Yubo Ma , Aixin Sun

Verifiable generation aims to let the large language model (LLM) generate text with supporting documents, which enables the user to flexibly verify the answer and makes the LLM's output more reliable. Retrieval plays a crucial role in…

Computation and Language · Computer Science 2024-03-28 Xiaonan Li , Changtai Zhu , Linyang Li , Zhangyue Yin , Tianxiang Sun , Xipeng Qiu

Recent advancements in test time compute, particularly through the use of verifier models, have significantly enhanced the reasoning capabilities of Large Language Models (LLMs). This generator-verifier approach closely resembles the…

Machine Learning · Computer Science 2024-10-11 Jianing Qi , Hao Tang , Zhigang Zhu

Retrieval-Augmented Generation (RAG) systems have shown promise in enhancing the performance of Large Language Models (LLMs). However, these systems face challenges in effectively integrating external knowledge with the LLM's internal…

Large language models appear to learn facts from the large text corpora they are trained on. Such facts are encoded implicitly within their many parameters, making it difficult to verify or manipulate what knowledge has been learned.…

Computation and Language · Computer Science 2022-10-27 Yifan Hou , Wenxiang Jiao , Meizhen Liu , Carl Allen , Zhaopeng Tu , Mrinmaya Sachan
‹ Prev 1 2 3 10 Next ›