Related papers: Introducing a new hyper-parameter for RAG: Context…

Enhancing Retrieval-Augmented Generation: A Study of Best Practices

Retrieval-Augmented Generation (RAG) systems have recently shown remarkable advancements by integrating retrieval mechanisms into language models, enhancing their ability to produce more accurate and contextually relevant responses.…

Computation and Language · Computer Science 2025-01-14 Siran Li , Linus Stenzel , Carsten Eickhoff , Seyed Ali Bahrainian

Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency

Large language models achieve high task performance yet often hallucinate or rely on outdated knowledge. Retrieval-augmented generation (RAG) addresses these gaps by coupling generation with external search. We analyse how hyperparameters…

Machine Learning · Computer Science 2025-05-14 Adel Ammar , Anis Koubaa , Omer Nacar , Wadii Boulila

Dynamic Context Selection for Retrieval-Augmented Generation: Mitigating Distractors and Positional Bias

Retrieval Augmented Generation (RAG) enhances language model performance by incorporating external knowledge retrieved from large corpora, which makes it highly suitable for tasks such as open domain question answering. Standard RAG systems…

Information Retrieval · Computer Science 2025-12-17 Malika Iratni , Mohand Boughanem , Taoufiq Dkaki

On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems

Retrieval-augmented generation (RAG) has emerged as an approach to augment large language models (LLMs) by reducing their reliance on static knowledge and improving answer factuality. RAG retrieves relevant context snippets and generates an…

Computation and Language · Computer Science 2025-02-21 Juraj Vladika , Florian Matthes

Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) has become a transformative approach for enhancing large language models (LLMs) by grounding their outputs in external knowledge sources. Yet, a critical question persists: how can vast volumes of…

Information Retrieval · Computer Science 2025-04-29 Carlo Merola , Jaspinder Singh

Better RAG using Relevant Information Gain

A common way to extend the memory of large language models (LLMs) is by retrieval augmented generation (RAG), which inserts text retrieved from a larger memory into an LLM's context window. However, the context window is typically limited…

Computation and Language · Computer Science 2025-02-14 Marc Pickett , Jeremy Hartman , Ayan Kumar Bhowmick , Raquib-ul Alam , Aditya Vempaty

Long Context RAG Performance of Large Language Models

Retrieval Augmented Generation (RAG) has emerged as a crucial technique for enhancing the accuracy of Large Language Models (LLMs) by incorporating external information. With the advent of LLMs that support increasingly longer context…

Machine Learning · Computer Science 2024-11-07 Quinn Leng , Jacob Portes , Sam Havens , Matei Zaharia , Michael Carbin

M-RAG: Making RAG Faster, Stronger, and More Efficient

Retrieval-Augmented Generation (RAG) has become a widely adopted paradigm for enhancing the reliability of large language models (LLMs). However, RAG systems are sensitive to retrieval strategies that rely on text chunking to construct…

Information Retrieval · Computer Science 2026-03-31 Sun Xu , Tongkai Xu , Baiheng Xie , Li Huang , Qiang Gao , Kunpeng Zhang

Context Awareness Gate For Retrieval Augmented Generation

Retrieval Augmented Generation (RAG) has emerged as a widely adopted approach to mitigate the limitations of large language models (LLMs) in answering domain-specific questions. Previous research has predominantly focused on improving the…

Machine Learning · Computer Science 2025-01-07 Mohammad Hassan Heydari , Arshia Hemmat , Erfan Naman , Afsaneh Fatemi

Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

The effectiveness of Retrieval-Augmented Generation (RAG) is highly dependent on how documents are chunked, that is, segmented into smaller units for indexing and retrieval. Yet, commonly used "one-size-fits-all" approaches often fail to…

Computation and Language · Computer Science 2026-03-27 Paulo Roberto de Moura Júnior , Jean Lelong , Annabelle Blangero

Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking

Retrieval-Augmented Generation (RAG) systems commonly use chunking strategies for retrieval, which enhance large language models (LLMs) by enabling them to access external knowledge, ensuring that the retrieved information is up-to-date and…

Computation and Language · Computer Science 2025-07-15 Hai Toan Nguyen , Tien Dat Nguyen , Viet Ha Nguyen

Enhancing RAG Efficiency with Adaptive Context Compression

Retrieval-augmented generation (RAG) enhances large language models (LLMs) with external knowledge but incurs significant inference costs due to lengthy retrieved contexts. While context compression mitigates this issue, existing methods…

Computation and Language · Computer Science 2025-09-25 Shuyu Guo , Shuo Zhang , Zhaochun Ren

Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey

Large Language Models (LLMs) showcase remarkable abilities, yet they struggle with limitations such as hallucinations, outdated knowledge, opacity, and inexplicable reasoning. To address these challenges, Retrieval-Augmented Generation…

Computation and Language · Computer Science 2024-10-03 Sourav Verma

HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation

Retrieval-Augmented Generation (RAG) encounters efficiency challenges when scaling to massive knowledge bases while preserving contextual relevance. We propose Hash-RAG, a framework that integrates deep hashing techniques with systematic…

Information Retrieval · Computer Science 2025-06-04 Jinyu Guo , Xunlei Chen , Qiyang Xia , Zhaokun Wang , Jie Ou , Libo Qin , Shunyu Yao , Wenhong Tian

HeteRAG: A Heterogeneous Retrieval-augmented Generation Framework with Decoupled Knowledge Representations

Retrieval-augmented generation (RAG) methods can enhance the performance of LLMs by incorporating retrieved knowledge chunks into the generation process. In general, the retrieval and generation steps usually have different requirements for…

Information Retrieval · Computer Science 2025-04-16 Peiru Yang , Xintian Li , Zhiyang Hu , Jiapeng Wang , Jinhua Yin , Huili Wang , Lizhi He , Shuai Yang , Shangguang Wang , Yongfeng Huang , Tao Qi

Controlled Retrieval-augmented Context Evaluation for Long-form RAG

Retrieval-augmented generation (RAG) enhances large language models by incorporating context retrieved from external knowledge sources. While the effectiveness of the retrieval module is typically evaluated with relevance-based ranking…

Information Retrieval · Computer Science 2026-01-13 Jia-Huei Ju , Suzan Verberne , Maarten de Rijke , Andrew Yates

Influence Guided Context Selection for Effective Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) addresses large language model (LLM) hallucinations by grounding responses in external knowledge, but its effectiveness is compromised by poor-quality retrieved contexts containing irrelevant or noisy…

Computation and Language · Computer Science 2025-10-27 Jiale Deng , Yanyan Shen , Ziyuan Pei , Youmin Chen , Linpeng Huang

Query Decomposition for RAG: Balancing Exploration-Exploitation

Retrieval-augmented generation (RAG) systems address complex user requests by decomposing them into subqueries, retrieving potentially relevant documents for each, and then aggregating them to generate an answer. Efficiently selecting…

Artificial Intelligence · Computer Science 2025-10-22 Roxana Petcu , Kenton Murray , Daniel Khashabi , Evangelos Kanoulas , Maarten de Rijke , Dawn Lawrie , Kevin Duh

Understanding the Design Decisions of Retrieval-Augmented Generation Systems

Retrieval-Augmented Generation (RAG) has emerged as a critical technique for enhancing large language model (LLM) capabilities. However, practitioners face significant challenges when making RAG deployment decisions. While existing research…

Software Engineering · Computer Science 2025-07-22 Shengming Zhao , Yuchen Shao , Yuheng Huang , Jiayang Song , Zhijie Wang , Chengcheng Wan , Lei Ma

SFR-RAG: Towards Contextually Faithful LLMs

Retrieval Augmented Generation (RAG), a paradigm that integrates external contextual information with large language models (LLMs) to enhance factual accuracy and relevance, has emerged as a pivotal area in generative AI. The LLMs used in…

Computation and Language · Computer Science 2024-09-17 Xuan-Phi Nguyen , Shrey Pandit , Senthil Purushwalkam , Austin Xu , Hailin Chen , Yifei Ming , Zixuan Ke , Silvio Savarese , Caiming Xong , Shafiq Joty