Related papers: Recitation-Augmented Language Models

Retrieval-Generation Synergy Augmented Large Language Models

Large language models augmented with task-relevant documents have demonstrated impressive performance on knowledge-intensive tasks. However, regarding how to obtain effective documents, the existing methods are mainly divided into two…

Computation and Language · Computer Science 2023-10-10 Zhangyin Feng , Xiaocheng Feng , Dezhi Zhao , Maojin Yang , Bing Qin

Post-training an LLM for RAG? Train on Self-Generated Demonstrations

Large language models (LLMs) often struggle with knowledge intensive NLP tasks, such as answering "Who won the latest World Cup?" because the knowledge they learn during training may be insufficient or outdated. Conditioning generation on…

Computation and Language · Computer Science 2025-03-04 Matthew Finlayson , Ilia Kulikov , Daniel M. Bikel , Barlas Oguz , Xilun Chen , Aasish Pappu

VeriCite: Towards Reliable Citations in Retrieval-Augmented Generation via Rigorous Verification

Retrieval-Augmented Generation (RAG) has emerged as a crucial approach for enhancing the responses of large language models (LLMs) with external knowledge sources. Despite the impressive performance in complex question-answering tasks, RAG…

Information Retrieval · Computer Science 2025-10-14 Haosheng Qian , Yixing Fan , Jiafeng Guo , Ruqing Zhang , Qi Chen , Dawei Yin , Xueqi Cheng

Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation

Despite the significant progress of large language models (LLMs) in various tasks, they often produce factual errors due to their limited internal knowledge. Retrieval-Augmented Generation (RAG), which enhances LLMs with external knowledge…

Computation and Language · Computer Science 2024-10-10 Yuanjie Lyu , Zihan Niu , Zheyong Xie , Chao Zhang , Tong Xu , Yang Wang , Enhong Chen

REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training

Pre-trained Language Models (PLMs) have achieved great success on Machine Reading Comprehension (MRC) over the past few years. Although the general language representation learned from large-scale corpora does benefit MRC, the poor support…

Computation and Language · Computer Science 2021-05-19 Fangkai Jiao , Yangyang Guo , Yilin Niu , Feng Ji , Feng-Lin Li , Liqiang Nie

Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation

While hallucinations of large language models could been alleviated through retrieval-augmented generation and citation generation, how the model utilizes internal knowledge is still opaque, and the trustworthiness of its generated answers…

Computation and Language · Computer Science 2025-04-22 Jiajun Shen , Tong Zhou , Yubo Chen , Delai Qiu , Shengping Liu , Kang Liu , Jun Zhao

REALM: Retrieval-Augmented Language Model Pre-Training

Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering. However, this knowledge is stored implicitly in the parameters of a neural network, requiring…

Computation and Language · Computer Science 2020-02-21 Kelvin Guu , Kenton Lee , Zora Tung , Panupong Pasupat , Ming-Wei Chang

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Despite their remarkable capabilities, large language models (LLMs) often produce responses containing factual inaccuracies due to their sole reliance on the parametric knowledge they encapsulate. Retrieval-Augmented Generation (RAG), an ad…

Computation and Language · Computer Science 2023-10-19 Akari Asai , Zeqiu Wu , Yizhong Wang , Avirup Sil , Hannaneh Hajishirzi

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

We introduce SelfCite, a novel self-supervised approach that aligns LLMs to generate high-quality, fine-grained, sentence-level citations for the statements in their generated responses. Instead of only relying on costly and labor-intensive…

Computation and Language · Computer Science 2025-06-17 Yung-Sung Chuang , Benjamin Cohen-Wang , Shannon Zejiang Shen , Zhaofeng Wu , Hu Xu , Xi Victoria Lin , James Glass , Shang-Wen Li , Wen-tau Yih

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate…

Computation and Language · Computer Science 2021-04-13 Patrick Lewis , Ethan Perez , Aleksandra Piktus , Fabio Petroni , Vladimir Karpukhin , Naman Goyal , Heinrich Küttler , Mike Lewis , Wen-tau Yih , Tim Rocktäschel , Sebastian Riedel , Douwe Kiela

Look, Recite, Then Answer: Enhancing VLM Performance via Self-Generated Knowledge Hints

Vision-Language Models (VLMs) exhibit significant performance plateaus in specialized domains like precision agriculture, primarily due to "Reasoning-Driven Hallucination" where linguistic priors override visual perception. A key bottleneck…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Xisheng Feng

Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

Trustworthy language models should provide both correct and verifiable answers. However, citations generated directly by standalone LLMs are often unreliable. As a result, current systems insert citations by querying an external retriever…

Artificial Intelligence · Computer Science 2026-04-07 Yukun Huang , Sanxing Chen , Jian Pei , Manzil Zaheer , Bhuwan Dhingra

Rethinking with Retrieval: Faithful Large Language Model Inference

Despite the success of large language models (LLMs) in various natural language processing (NLP) tasks, the stored knowledge in these models may inevitably be incomplete, out-of-date, or incorrect. This motivates the need to utilize…

Computation and Language · Computer Science 2023-01-03 Hangfeng He , Hongming Zhang , Dan Roth

On the Capacity of Citation Generation by Large Language Models

Retrieval-augmented generation (RAG) appears as a promising method to alleviate the "hallucination" problem in large language models (LLMs), since it can incorporate external traceable resources for response generation. The essence of RAG…

Computation and Language · Computer Science 2024-10-16 Haosheng Qian , Yixing Fan , Ruqing Zhang , Jiafeng Guo

Beyond Memorization: The Challenge of Random Memory Access in Language Models

Recent developments in Language Models (LMs) have shown their effectiveness in NLP tasks, particularly in knowledge-intensive tasks. However, the mechanisms underlying knowledge storage and memory access within their parameters remain…

Computation and Language · Computer Science 2024-07-23 Tongyao Zhu , Qian Liu , Liang Pang , Zhengbao Jiang , Min-Yen Kan , Min Lin

Retrieval Feedback Memory Enhancement Large Model Retrieval Generation Method

Large Language Models (LLMs) have shown remarkable capabilities across diverse tasks, yet they face inherent limitations such as constrained parametric knowledge and high retraining costs. Retrieval-Augmented Generation (RAG) augments the…

Information Retrieval · Computer Science 2025-08-26 Leqian Li , Dianxi Shi , Jialu Zhou , Xinyu Wei , Mingyue Yang , Songchang Jin , Shaowu Yang

Finding Answers in Thought Matters: Revisiting Evaluation on Large Language Models with Reasoning

Evaluating generative models, such as large language models (LLMs), commonly involves question-answering tasks where the final answer is selected based on probability of answer choices. On the other hand, for models requiring reasoning, the…

Computation and Language · Computer Science 2025-10-17 Hwiyeol Jo , Joosung Lee , Jaehone Lee , Sang-Woo Lee , Joonsuk Park , Kang Min Yoo

Enabling Large Language Models to Generate Text with Citations

Large language models (LLMs) have emerged as a widely-used tool for information seeking, but their generated outputs are prone to hallucination. In this work, our aim is to allow LLMs to generate text with citations, improving their factual…

Computation and Language · Computer Science 2023-11-01 Tianyu Gao , Howard Yen , Jiatong Yu , Danqi Chen

LLM Guided Inductive Inference for Solving Compositional Problems

While large language models (LLMs) have demonstrated impressive performance in question-answering tasks, their performance is limited when the questions require knowledge that is not included in the model's training data and can only be…

Computation and Language · Computer Science 2023-09-22 Abhigya Sodani , Lauren Moos , Matthew Mirman

RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging

We unveil that internal representations in large language models (LLMs) serve as reliable proxies of learned knowledge, and propose RECALL, a novel representation-aware model merging framework for continual learning without access to…

Computation and Language · Computer Science 2025-10-24 Bowen Wang , Haiyuan Wan , Liwen Shi , Chen Yang , Peng He , Yue Ma , Haochen Han , Wenhao Li , Tiao Tan , Yongjian Li , Fangming Liu , Yifan Gong , Sheng Zhang