Related papers: Knowledge-Augmented Language Model Verification

KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

Despite their success at many natural language processing (NLP) tasks, large language models still struggle to effectively leverage knowledge for knowledge-intensive tasks, manifesting limitations such as generating incomplete, non-factual,…

Computation and Language · Computer Science 2024-10-03 Yougang Lyu , Lingyong Yan , Shuaiqiang Wang , Haibo Shi , Dawei Yin , Pengjie Ren , Zhumin Chen , Maarten de Rijke , Zhaochun Ren

Measuring and Modifying Factual Knowledge in Large Language Models

Large Language Models (LLMs) store an extensive amount of factual knowledge obtained from vast collections of text. To effectively utilize these models for downstream tasks, it is crucial to have reliable methods for measuring their…

Computation and Language · Computer Science 2023-06-13 Pouya Pezeshkpour

1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?

Large Language Models (LLMs) have garnered significant attention due to their remarkable ability to process information across various languages. Despite their capabilities, they exhibit inconsistencies in handling identical queries in…

Computation and Language · Computer Science 2024-06-24 Yue Huang , Chenrui Fan , Yuan Li , Siyuan Wu , Tianyi Zhou , Xiangliang Zhang , Lichao Sun

Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method

Large Language Models (LLMs) have shown great potential in Natural Language Processing (NLP) tasks. However, recent literature reveals that LLMs generate nonfactual responses intermittently, which impedes the LLMs' reliability for further…

Computation and Language · Computer Science 2024-03-22 Yukun Zhao , Lingyong Yan , Weiwei Sun , Guoliang Xing , Chong Meng , Shuaiqiang Wang , Zhicong Cheng , Zhaochun Ren , Dawei Yin

KaLM: Knowledge-aligned Autoregressive Language Modeling via Dual-view Knowledge Graph Contrastive Learning

Autoregressive large language models (LLMs) pre-trained by next token prediction are inherently proficient in generative tasks. However, their performance on knowledge-driven tasks such as factual knowledge querying remains unsatisfactory.…

Computation and Language · Computer Science 2026-01-14 Peng Yu , Cheng Deng , Beiya Dai , Xinbing Wang , Ying Wen

Learning to Self-Verify Makes Language Models Better Reasoners

Recent large language models (LLMs) achieve strong performance in generating promising reasoning paths for complex tasks. However, despite powerful generation ability, LLMs remain weak at verifying their own answers, revealing a persistent…

Computation and Language · Computer Science 2026-02-10 Yuxin Chen , Yu Wang , Yi Zhang , Ziang Ye , Zhengzhou Cai , Yaorui Shi , Qi Gu , Hui Su , Xunliang Cai , Xiang Wang , An Zhang , Tat-Seng Chua

Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey

Problem-solving has been a fundamental driver of human progress in numerous domains. With advancements in artificial intelligence, Large Language Models (LLMs) have emerged as powerful tools capable of tackling complex problems across…

Machine Learning · Computer Science 2025-05-07 Da Zheng , Lun Du , Junwei Su , Yuchen Tian , Yuqi Zhu , Jintian Zhang , Lanning Wei , Ningyu Zhang , Huajun Chen

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Self-correction has emerged as a promising solution to boost the reasoning performance of large language models (LLMs), where LLMs refine their solutions using self-generated critiques that pinpoint the errors. This work explores whether…

Computation and Language · Computer Science 2024-06-07 Yunxiang Zhang , Muhammad Khalifa , Lajanugen Logeswaran , Jaekyeom Kim , Moontae Lee , Honglak Lee , Lu Wang

Has this Fact been Edited? Detecting Knowledge Edits in Language Models

Knowledge editing methods (KEs) can update language models' obsolete or inaccurate knowledge learned from pre-training. However, KEs can be used for malicious applications, e.g., inserting misinformation and toxic content. Knowing whether a…

Computation and Language · Computer Science 2025-02-11 Paul Youssef , Zhixue Zhao , Christin Seifert , Jörg Schlötterer

Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown

Large language models (LLMs) have demonstrated strong capabilities in text understanding and generation. However, they often lack factuality, producing a mixture of true and false information, especially in long-form generation. In this…

Computation and Language · Computer Science 2025-09-26 Lifu Tu , Rui Meng , Shafiq Joty , Yingbo Zhou , Semih Yavuz

Supervised Knowledge Makes Large Language Models Better In-context Learners

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the…

Computation and Language · Computer Science 2024-04-12 Linyi Yang , Shuibai Zhang , Zhuohao Yu , Guangsheng Bao , Yidong Wang , Jindong Wang , Ruochen Xu , Wei Ye , Xing Xie , Weizhu Chen , Yue Zhang

Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with…

Computation and Language · Computer Science 2026-02-16 Hao Chen , Ye He , Yuchun Fan , Yukun Yan , Zhenghao Liu , Qingfu Zhu , Maosong Sun , Wanxiang Che

Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs

Large Language Models (LLMs) augmented with retrieval mechanisms have demonstrated significant potential in fact-checking tasks by integrating external knowledge. However, their reliability decreases when confronted with conflicting…

Computation and Language · Computer Science 2025-05-26 Ziyu Ge , Yuhao Wu , Daniel Wai Kit Chin , Roy Ka-Wei Lee , Rui Cao

Statistical Knowledge Assessment for Large Language Models

Given varying prompts regarding a factoid question, can a large language model (LLM) reliably generate factually correct answers? Existing LLMs may generate distinct responses for different prompts. In this paper, we study the problem of…

Computation and Language · Computer Science 2023-10-31 Qingxiu Dong , Jingjing Xu , Lingpeng Kong , Zhifang Sui , Lei Li

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

Large language models (LLMs) have shown impressive prowess in solving a wide range of tasks with world knowledge. However, it remains unclear how well LLMs are able to perceive their factual knowledge boundaries, particularly under…

Computation and Language · Computer Science 2024-11-20 Ruiyang Ren , Yuhao Wang , Yingqi Qu , Wayne Xin Zhao , Jing Liu , Hao Tian , Hua Wu , Ji-Rong Wen , Haifeng Wang

Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution

Although achieving great success, Large Language Models (LLMs) usually suffer from unreliable hallucinations. Although language attribution can be a potential solution, there are no suitable benchmarks and evaluation metrics to attribute…

Computation and Language · Computer Science 2024-05-24 Xinze Li , Yixin Cao , Liangming Pan , Yubo Ma , Aixin Sun

LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Verifiable generation aims to let the large language model (LLM) generate text with supporting documents, which enables the user to flexibly verify the answer and makes the LLM's output more reliable. Retrieval plays a crucial role in…

Computation and Language · Computer Science 2024-03-28 Xiaonan Li , Changtai Zhu , Linyang Li , Zhangyue Yin , Tianxiang Sun , Xipeng Qiu

VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers

Recent advancements in test time compute, particularly through the use of verifier models, have significantly enhanced the reasoning capabilities of Large Language Models (LLMs). This generator-verifier approach closely resembles the…

Machine Learning · Computer Science 2024-10-11 Jianing Qi , Hao Tang , Zhigang Zhu

Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective

Retrieval-Augmented Generation (RAG) systems have shown promise in enhancing the performance of Large Language Models (LLMs). However, these systems face challenges in effectively integrating external knowledge with the LLM's internal…

Machine Learning · Computer Science 2024-11-25 Shenglai Zeng , Jiankun Zhang , Bingheng Li , Yuping Lin , Tianqi Zheng , Dante Everaert , Hanqing Lu , Hui Liu , Hui Liu , Yue Xing , Monica Xiao Cheng , Jiliang Tang

Adapters for Enhanced Modeling of Multilingual Knowledge and Text

Large language models appear to learn facts from the large text corpora they are trained on. Such facts are encoded implicitly within their many parameters, making it difficult to verify or manipulate what knowledge has been learned.…

Computation and Language · Computer Science 2022-10-27 Yifan Hou , Wenxiang Jiao , Meizhen Liu , Carl Allen , Zhaopeng Tu , Mrinmaya Sachan