Related papers: Efficient Test-Time Retrieval Augmented Generation

Corrective Retrieval Augmented Generation

Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG) is a practicable…

Computation and Language · Computer Science 2024-10-08 Shi-Qi Yan , Jia-Chen Gu , Yun Zhu , Zhen-Hua Ling

MAIN-RAG: Multi-Agent Filtering Retrieval-Augmented Generation

Large Language Models (LLMs) are becoming essential tools for various natural language processing tasks but often suffer from generating outdated or incorrect information. Retrieval-Augmented Generation (RAG) addresses this issue by…

Computation and Language · Computer Science 2025-01-03 Chia-Yuan Chang , Zhimeng Jiang , Vineeth Rakesh , Menghai Pan , Chin-Chia Michael Yeh , Guanchu Wang , Mingzhi Hu , Zhichao Xu , Yan Zheng , Mahashweta Das , Na Zou

Reliable Decision Making via Calibration Oriented Retrieval Augmented Generation

Recently, Large Language Models (LLMs) have been increasingly used to support various decision-making tasks, assisting humans in making informed decisions. However, when LLMs confidently provide incorrect information, it can lead humans to…

Information Retrieval · Computer Science 2025-10-16 Chaeyun Jang , Deukhwan Cho , Seanie Lee , Hyungi Lee , Juho Lee

A Survey on Retrieval-Augmented Text Generation for Large Language Models

Retrieval-Augmented Generation (RAG) merges retrieval methods with deep learning advancements to address the static limitations of large language models (LLMs) by enabling the dynamic integration of up-to-date external information. This…

Information Retrieval · Computer Science 2026-05-19 Yizheng Huang , Jimmy Huang

Retrieval-Augmented Generation with Estimation of Source Reliability

Retrieval-Augmented Generation (RAG) is an effective approach to enhance the factual accuracy of large language models (LLMs) by retrieving information from external databases, which are typically composed of diverse sources, to supplement…

Machine Learning · Computer Science 2025-10-15 Jeongyeon Hwang , Junyoung Park , Hyejin Park , Dongwoo Kim , Sangdon Park , Jungseul Ok

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Despite their remarkable capabilities, large language models (LLMs) often produce responses containing factual inaccuracies due to their sole reliance on the parametric knowledge they encapsulate. Retrieval-Augmented Generation (RAG), an ad…

Computation and Language · Computer Science 2023-10-19 Akari Asai , Zeqiu Wu , Yizhong Wang , Avirup Sil , Hannaneh Hajishirzi

Aligning Extraction and Generation for Robust Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) enhances LLMs with external knowledge, yet generation remains vulnerable to retrieval-induced noise and uncertain placement of relevant chunks, often causing hallucinations. We present Ext2Gen, an…

Computation and Language · Computer Science 2025-11-18 Hwanjun Song , Jeonghwan Choi , Minseok Kim

DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering

Retrieval-Augmented Generation (RAG) has recently demonstrated the performance of Large Language Models (LLMs) in the knowledge-intensive tasks such as Question-Answering (QA). RAG expands the query context by incorporating external…

Machine Learning · Computer Science 2024-06-18 Zijian Hei , Weiling Liu , Wenjie Ou , Juyi Qiao , Junming Jiao , Guowen Song , Ting Tian , Yi Lin

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation Download PDF Neal Gregory Lawton, Alfy Samuel, Anoop Kumar, Daben Liu Published: 20 Aug 2025, Retrieval augmented generation (RAG) is a popular…

Computation and Language · Computer Science 2025-10-21 Neal Gregory Lawton , Alfy Samuel , Anoop Kumar , Daben Liu

Meta Knowledge for Retrieval Augmented Large Language Models

Retrieval Augmented Generation (RAG) is a technique used to augment Large Language Models (LLMs) with contextually relevant, time-critical, or domain-specific information without altering the underlying model parameters. However,…

Information Retrieval · Computer Science 2024-08-20 Laurent Mombaerts , Terry Ding , Adi Banerjee , Florian Felice , Jonathan Taws , Tarik Borogovac

Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems

This study aims to improve the accuracy and quality of large-scale language models (LLMs) in answering questions by integrating Elasticsearch into the Retrieval Augmented Generation (RAG) framework. The experiment uses the Stanford Question…

Information Retrieval · Computer Science 2024-10-21 Jiajing Chen , Runyuan Bao , Hongye Zheng , Zhen Qi , Jianjun Wei , Jiacheng Hu

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

Large Language Models (LLMs) exhibit remarkable capabilities but are prone to generating inaccurate or hallucinatory responses. This limitation stems from their reliance on vast pretraining datasets, making them susceptible to errors in…

Computation and Language · Computer Science 2024-04-02 Chi-Min Chan , Chunpu Xu , Ruibin Yuan , Hongyin Luo , Wei Xue , Yike Guo , Jie Fu

Optimizing and Evaluating Enterprise Retrieval-Augmented Generation (RAG): A Content Design Perspective

Retrieval-augmented generation (RAG) is a popular technique for using large language models (LLMs) to build customer-support, question-answering solutions. In this paper, we share our team's practical experience building and maintaining…

Information Retrieval · Computer Science 2024-10-18 Sarah Packowski , Inge Halilovic , Jenifer Schlotfeldt , Trish Smith

Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models

Retrieval-Augmented Generation (RAG) has been shown to enhance the factual accuracy of Large Language Models (LLMs), but existing methods often suffer from limited reasoning capabilities in effectively using the retrieved evidence,…

Computation and Language · Computer Science 2024-10-03 Shayekh Bin Islam , Md Asib Rahman , K S M Tozammel Hossain , Enamul Hoque , Shafiq Joty , Md Rizwan Parvez

Parametric Retrieval Augmented Generation

Retrieval-augmented generation (RAG) techniques have emerged as a promising solution to enhance the reliability of large language models (LLMs) by addressing issues like hallucinations, outdated knowledge, and domain adaptation. In…

Computation and Language · Computer Science 2025-01-28 Weihang Su , Yichen Tang , Qingyao Ai , Junxi Yan , Changyue Wang , Hongning Wang , Ziyi Ye , Yujia Zhou , Yiqun Liu

Progressive Searching for Retrieval in RAG

Retrieval Augmented Generation (RAG) is a promising technique for mitigating two key limitations of large language models (LLMs): outdated information and hallucinations. RAG system stores documents as embedding vectors in a database. Given…

Information Retrieval · Computer Science 2026-02-10 Taehee Jeong , Xingzhe Zhao , Peizu Li , Markus Valvur , Weihua Zhao

Improving Retrieval for RAG based Question Answering Models on Financial Documents

The effectiveness of Large Language Models (LLMs) in generating accurate responses relies heavily on the quality of input provided, particularly when employing Retrieval Augmented Generation (RAG) techniques. RAG enhances LLMs by sourcing…

Information Retrieval · Computer Science 2024-08-02 Spurthi Setty , Harsh Thakkar , Alyssa Lee , Eden Chung , Natan Vidra

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Iterative retrieval refers to the process in which the model continuously queries the retriever during generation to enhance the relevance of the retrieved knowledge, thereby improving the performance of Retrieval-Augmented Generation…

Computation and Language · Computer Science 2024-12-02 Tian Yu , Shaolei Zhang , Yang Feng

When Retrieval Succeeds and Fails: Rethinking Retrieval-Augmented Generation for LLMs

Large Language Models (LLMs) have enabled a wide range of applications through their powerful capabilities in language understanding and generation. However, as LLMs are trained on static corpora, they face difficulties in addressing…

Computation and Language · Computer Science 2025-10-13 Yongjie Wang , Yue Yu , Kaisong Song , Jun Lin , Zhiqi Shen

SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains

Retrieval-augmented generation (RAG) enhances the question-answering (QA) abilities of large language models (LLMs) by integrating external knowledge. However, adapting general-purpose RAG systems to specialized fields such as science and…

Computation and Language · Computer Science 2025-01-28 Ran Xu , Hui Liu , Sreyashi Nag , Zhenwei Dai , Yaochen Xie , Xianfeng Tang , Chen Luo , Yang Li , Joyce C. Ho , Carl Yang , Qi He