English
Related papers

Related papers: Recitation-Augmented Language Models

200 papers

Large language models augmented with task-relevant documents have demonstrated impressive performance on knowledge-intensive tasks. However, regarding how to obtain effective documents, the existing methods are mainly divided into two…

Computation and Language · Computer Science 2023-10-10 Zhangyin Feng , Xiaocheng Feng , Dezhi Zhao , Maojin Yang , Bing Qin

Large language models (LLMs) often struggle with knowledge intensive NLP tasks, such as answering "Who won the latest World Cup?" because the knowledge they learn during training may be insufficient or outdated. Conditioning generation on…

Computation and Language · Computer Science 2025-03-04 Matthew Finlayson , Ilia Kulikov , Daniel M. Bikel , Barlas Oguz , Xilun Chen , Aasish Pappu

Retrieval-Augmented Generation (RAG) has emerged as a crucial approach for enhancing the responses of large language models (LLMs) with external knowledge sources. Despite the impressive performance in complex question-answering tasks, RAG…

Information Retrieval · Computer Science 2025-10-14 Haosheng Qian , Yixing Fan , Jiafeng Guo , Ruqing Zhang , Qi Chen , Dawei Yin , Xueqi Cheng

Despite the significant progress of large language models (LLMs) in various tasks, they often produce factual errors due to their limited internal knowledge. Retrieval-Augmented Generation (RAG), which enhances LLMs with external knowledge…

Computation and Language · Computer Science 2024-10-10 Yuanjie Lyu , Zihan Niu , Zheyong Xie , Chao Zhang , Tong Xu , Yang Wang , Enhong Chen

Pre-trained Language Models (PLMs) have achieved great success on Machine Reading Comprehension (MRC) over the past few years. Although the general language representation learned from large-scale corpora does benefit MRC, the poor support…

Computation and Language · Computer Science 2021-05-19 Fangkai Jiao , Yangyang Guo , Yilin Niu , Feng Ji , Feng-Lin Li , Liqiang Nie

While hallucinations of large language models could been alleviated through retrieval-augmented generation and citation generation, how the model utilizes internal knowledge is still opaque, and the trustworthiness of its generated answers…

Computation and Language · Computer Science 2025-04-22 Jiajun Shen , Tong Zhou , Yubo Chen , Delai Qiu , Shengping Liu , Kang Liu , Jun Zhao

Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering. However, this knowledge is stored implicitly in the parameters of a neural network, requiring…

Computation and Language · Computer Science 2020-02-21 Kelvin Guu , Kenton Lee , Zora Tung , Panupong Pasupat , Ming-Wei Chang

Despite their remarkable capabilities, large language models (LLMs) often produce responses containing factual inaccuracies due to their sole reliance on the parametric knowledge they encapsulate. Retrieval-Augmented Generation (RAG), an ad…

Computation and Language · Computer Science 2023-10-19 Akari Asai , Zeqiu Wu , Yizhong Wang , Avirup Sil , Hannaneh Hajishirzi

We introduce SelfCite, a novel self-supervised approach that aligns LLMs to generate high-quality, fine-grained, sentence-level citations for the statements in their generated responses. Instead of only relying on costly and labor-intensive…

Computation and Language · Computer Science 2025-06-17 Yung-Sung Chuang , Benjamin Cohen-Wang , Shannon Zejiang Shen , Zhaofeng Wu , Hu Xu , Xi Victoria Lin , James Glass , Shang-Wen Li , Wen-tau Yih

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate…

Vision-Language Models (VLMs) exhibit significant performance plateaus in specialized domains like precision agriculture, primarily due to "Reasoning-Driven Hallucination" where linguistic priors override visual perception. A key bottleneck…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Xisheng Feng

Trustworthy language models should provide both correct and verifiable answers. However, citations generated directly by standalone LLMs are often unreliable. As a result, current systems insert citations by querying an external retriever…

Artificial Intelligence · Computer Science 2026-04-07 Yukun Huang , Sanxing Chen , Jian Pei , Manzil Zaheer , Bhuwan Dhingra

Despite the success of large language models (LLMs) in various natural language processing (NLP) tasks, the stored knowledge in these models may inevitably be incomplete, out-of-date, or incorrect. This motivates the need to utilize…

Computation and Language · Computer Science 2023-01-03 Hangfeng He , Hongming Zhang , Dan Roth

Retrieval-augmented generation (RAG) appears as a promising method to alleviate the "hallucination" problem in large language models (LLMs), since it can incorporate external traceable resources for response generation. The essence of RAG…

Computation and Language · Computer Science 2024-10-16 Haosheng Qian , Yixing Fan , Ruqing Zhang , Jiafeng Guo

Recent developments in Language Models (LMs) have shown their effectiveness in NLP tasks, particularly in knowledge-intensive tasks. However, the mechanisms underlying knowledge storage and memory access within their parameters remain…

Computation and Language · Computer Science 2024-07-23 Tongyao Zhu , Qian Liu , Liang Pang , Zhengbao Jiang , Min-Yen Kan , Min Lin

Large Language Models (LLMs) have shown remarkable capabilities across diverse tasks, yet they face inherent limitations such as constrained parametric knowledge and high retraining costs. Retrieval-Augmented Generation (RAG) augments the…

Information Retrieval · Computer Science 2025-08-26 Leqian Li , Dianxi Shi , Jialu Zhou , Xinyu Wei , Mingyue Yang , Songchang Jin , Shaowu Yang

Evaluating generative models, such as large language models (LLMs), commonly involves question-answering tasks where the final answer is selected based on probability of answer choices. On the other hand, for models requiring reasoning, the…

Computation and Language · Computer Science 2025-10-17 Hwiyeol Jo , Joosung Lee , Jaehone Lee , Sang-Woo Lee , Joonsuk Park , Kang Min Yoo

Large language models (LLMs) have emerged as a widely-used tool for information seeking, but their generated outputs are prone to hallucination. In this work, our aim is to allow LLMs to generate text with citations, improving their factual…

Computation and Language · Computer Science 2023-11-01 Tianyu Gao , Howard Yen , Jiatong Yu , Danqi Chen

While large language models (LLMs) have demonstrated impressive performance in question-answering tasks, their performance is limited when the questions require knowledge that is not included in the model's training data and can only be…

Computation and Language · Computer Science 2023-09-22 Abhigya Sodani , Lauren Moos , Matthew Mirman

We unveil that internal representations in large language models (LLMs) serve as reliable proxies of learned knowledge, and propose RECALL, a novel representation-aware model merging framework for continual learning without access to…

Computation and Language · Computer Science 2025-10-24 Bowen Wang , Haiyuan Wan , Liwen Shi , Chen Yang , Peng He , Yue Ma , Haochen Han , Wenhao Li , Tiao Tan , Yongjian Li , Fangming Liu , Yifan Gong , Sheng Zhang
‹ Prev 1 2 3 10 Next ›