Related papers: RAG-Stack: Co-Optimizing RAG Quality and Performan…

Toward Optimal Search and Retrieval for RAG

Retrieval-augmented generation (RAG) is a promising method for addressing some of the memory-related challenges associated with Large Language Models (LLMs). Two separate systems form the RAG pipeline, the retriever and the reader, and the…

Computation and Language · Computer Science 2024-11-13 Alexandria Leto , Cecilia Aguerrebere , Ishwar Bhati , Ted Willke , Mariano Tepper , Vy Ai Vo

RAGe: A Retrieval-Augmented Generation Evaluation Framework

Deploying Large Language Model (LLM) applications, particularly those relying on Retrieval-Augmented Generation (RAG), remains challenging due to high computational demands, outdated knowledge bases, and the need to manually select optimal…

Information Retrieval · Computer Science 2026-05-28 Larissa Guder , João Pedro de Moura , Arthur Accorsi , Gustavo Losch do Amaral , Maurício Cecílio Magnaguagno , Felipe Meneguzzi , Marcio Sorraglia Pinho , Dalvan Griebler

SRAG: RAG with Structured Data Improves Vector Retrieval

Retrieval Augmented Generation (RAG) provides the necessary informational grounding to LLMs in the form of chunks retrieved from a vector database or through web search. RAG could also use knowledge graph triples as a means of providing…

Information Retrieval · Computer Science 2026-03-31 Shalin Shah , Srikanth Ryali , Ramasubbu Venkatesh

A Survey on Knowledge-Oriented Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) has gained significant attention in recent years for its potential to enhance natural language understanding and generation by combining large-scale retrieval systems with generative models. RAG…

Computation and Language · Computer Science 2025-03-18 Mingyue Cheng , Yucong Luo , Jie Ouyang , Qi Liu , Huijie Liu , Li Li , Shuo Yu , Bohou Zhang , Jiawei Cao , Jie Ma , Daoyu Wang , Enhong Chen

RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines

Retrieval-augmented generation (RAG) pipelines have become the de-facto approach for building AI assistants with access to external, domain-specific knowledge. Given a user query, RAG pipelines typically first retrieve (R) relevant…

Human-Computer Interaction · Computer Science 2025-04-21 Quentin Romero Lauro , Shreya Shankar , Sepanta Zeighami , Aditya Parameswaran

RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving

Retrieval-augmented generation (RAG), which combines large language models (LLMs) with retrievals from external knowledge databases, is emerging as a popular approach for reliable LLM serving. However, efficient RAG serving remains an open…

Information Retrieval · Computer Science 2025-03-24 Wenqi Jiang , Suvinay Subramanian , Cat Graves , Gustavo Alonso , Amir Yazdanbakhsh , Vidushi Dadu

HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm for enhancing the performance of large language models (LLMs) by integrating external knowledge into the generation process. A key component of RAG pipelines is the…

Computation and Language · Computer Science 2025-04-07 Yuwei An , Yihua Cheng , Seo Jin Park , Junchen Jiang

Investigating the Robustness of Retrieval-Augmented Generation at the Query Level

Large language models (LLMs) are very costly and inefficient to update with new information. To address this limitation, retrieval-augmented generation (RAG) has been proposed as a solution that dynamically incorporates external knowledge…

Computation and Language · Computer Science 2025-07-10 Sezen Perçin , Xin Su , Qutub Sha Syed , Phillip Howard , Aleksei Kuvshinov , Leo Schwinn , Kay-Ulrich Scholl

On The Reproducibility Limitations of RAG Systems

Retrieval-Augmented Generation (RAG) is increasingly employed in generative AI-driven scientific workflows to integrate rapidly evolving scientific knowledge bases, yet its reliability is frequently compromised by non-determinism in their…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-24 Baiqiang Wang , Dongfang Zhao , Nathan R Tallent , Luanzheng Guo

Metadata-Driven Retrieval-Augmented Generation for Financial Question Answering

Retrieval-Augmented Generation (RAG) struggles on long, structured financial filings where relevant evidence is sparse and cross-referenced. This paper presents a systematic investigation of advanced metadata-driven Retrieval-Augmented…

Information Retrieval · Computer Science 2025-10-29 Michail Dadopoulos , Anestis Ladas , Stratos Moschidis , Ioannis Negkakis

RAGTrace: Understanding and Refining Retrieval-Generation Dynamics in Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) systems have emerged as a promising solution to enhance large language models (LLMs) by integrating external knowledge retrieval with generative capabilities. While significant advancements have been…

Human-Computer Interaction · Computer Science 2025-08-11 Sizhe Cheng , Jiaping Li , Huanchen Wang , Yuxin Ma

Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm to enhance large language models (LLMs) by conditioning generation on external evidence retrieved at inference time. While RAG addresses critical limitations of…

Information Retrieval · Computer Science 2025-06-03 Chaitanya Sharma

Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems

Retrieval-Augmented Generation (RAG) has emerged as a promising framework to mitigate hallucinations in Large Language Models (LLMs), yet its overall performance is dependent on the underlying retrieval system. In the finance domain,…

Information Retrieval · Computer Science 2025-03-20 Sejong Kim , Hyunseo Song , Hyunwoo Seo , Hyunjun Kim

Searching for Best Practices in Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) techniques have proven to be effective in integrating up-to-date information, mitigating hallucinations, and enhancing response quality, particularly in specialized domains. While many RAG approaches…

Computation and Language · Computer Science 2024-07-02 Xiaohua Wang , Zhenghua Wang , Xuan Gao , Feiran Zhang , Yixin Wu , Zhibo Xu , Tianyuan Shi , Zhengyuan Wang , Shizheng Li , Qi Qian , Ruicheng Yin , Changze Lv , Xiaoqing Zheng , Xuanjing Huang

RAISE: RAG Design as an Architecture Search Problem

Retrieval-augmented generation (RAG) systems expose numerous design choices spanning query rewriting, chunking, retrieval depth, reranking, and context compression. In practice, these choices are often configured through heuristics,…

Artificial Intelligence · Computer Science 2026-05-29 Zhen Chen , Yibing Liu , Weihao Xie , Yu Liang , Peilin Chen , Shiqi Wang

Engineering the RAG Stack: A Comprehensive Review of the Architecture and Trust Frameworks for Retrieval-Augmented Generation Systems

This article provides a comprehensive systematic literature review of academic studies, industrial applications, and real-world deployments from 2018 to 2025, providing a practical guide and detailed overview of modern Retrieval-Augmented…

Information Retrieval · Computer Science 2026-01-12 Dean Wampler , Dave Nielson , Alireza Seddighi

Retrieval-Augmented Generation with Estimation of Source Reliability

Retrieval-Augmented Generation (RAG) is an effective approach to enhance the factual accuracy of large language models (LLMs) by retrieving information from external databases, which are typically composed of diverse sources, to supplement…

Machine Learning · Computer Science 2025-10-15 Jeongyeon Hwang , Junyoung Park , Hyejin Park , Dongwoo Kim , Sangdon Park , Jungseul Ok

mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm for enhancing the capabilities of large language models. However, existing RAG evaluation predominantly focuses on text retrieval and relies on opaque, end-to-end…

Information Retrieval · Computer Science 2025-05-19 Chuan Xu , Qiaosheng Chen , Yutong Feng , Gong Cheng

Retrieval-Augmented Generation for AI-Generated Content: A Survey

Advancements in model algorithms, the growth of foundational models, and access to high-quality datasets have propelled the evolution of Artificial Intelligence Generated Content (AIGC). Despite its notable successes, AIGC still faces…

Computer Vision and Pattern Recognition · Computer Science 2024-06-24 Penghao Zhao , Hailin Zhang , Qinhan Yu , Zhengren Wang , Yunteng Geng , Fangcheng Fu , Ling Yang , Wentao Zhang , Jie Jiang , Bin Cui

XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) synergizes the retrieval of pertinent data with the generative capabilities of Large Language Models (LLMs), ensuring that the generated output is not only contextually relevant but also accurate and…

Computation and Language · Computer Science 2025-05-19 Qianren Mao , Yangyifei Luo , Qili Zhang , Yashuo Luo , Zhilong Cao , Jinlong Zhang , HanWen Hao , Zhijun Chen , Weifeng Jiang , Junnan Liu , Xiaolong Wang , Zhenting Huang , Zhixing Tan , Sun Jie , Bo Li , Xudong Liu , Richong Zhang , Jianxin Li