English
Related papers

Related papers: RAG-Stack: Co-Optimizing RAG Quality and Performan…

200 papers

Retrieval-augmented generation (RAG) is a promising method for addressing some of the memory-related challenges associated with Large Language Models (LLMs). Two separate systems form the RAG pipeline, the retriever and the reader, and the…

Computation and Language · Computer Science 2024-11-13 Alexandria Leto , Cecilia Aguerrebere , Ishwar Bhati , Ted Willke , Mariano Tepper , Vy Ai Vo

Deploying Large Language Model (LLM) applications, particularly those relying on Retrieval-Augmented Generation (RAG), remains challenging due to high computational demands, outdated knowledge bases, and the need to manually select optimal…

Retrieval Augmented Generation (RAG) provides the necessary informational grounding to LLMs in the form of chunks retrieved from a vector database or through web search. RAG could also use knowledge graph triples as a means of providing…

Information Retrieval · Computer Science 2026-03-31 Shalin Shah , Srikanth Ryali , Ramasubbu Venkatesh

Retrieval-Augmented Generation (RAG) has gained significant attention in recent years for its potential to enhance natural language understanding and generation by combining large-scale retrieval systems with generative models. RAG…

Computation and Language · Computer Science 2025-03-18 Mingyue Cheng , Yucong Luo , Jie Ouyang , Qi Liu , Huijie Liu , Li Li , Shuo Yu , Bohou Zhang , Jiawei Cao , Jie Ma , Daoyu Wang , Enhong Chen

Retrieval-augmented generation (RAG) pipelines have become the de-facto approach for building AI assistants with access to external, domain-specific knowledge. Given a user query, RAG pipelines typically first retrieve (R) relevant…

Human-Computer Interaction · Computer Science 2025-04-21 Quentin Romero Lauro , Shreya Shankar , Sepanta Zeighami , Aditya Parameswaran

Retrieval-augmented generation (RAG), which combines large language models (LLMs) with retrievals from external knowledge databases, is emerging as a popular approach for reliable LLM serving. However, efficient RAG serving remains an open…

Information Retrieval · Computer Science 2025-03-24 Wenqi Jiang , Suvinay Subramanian , Cat Graves , Gustavo Alonso , Amir Yazdanbakhsh , Vidushi Dadu

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm for enhancing the performance of large language models (LLMs) by integrating external knowledge into the generation process. A key component of RAG pipelines is the…

Computation and Language · Computer Science 2025-04-07 Yuwei An , Yihua Cheng , Seo Jin Park , Junchen Jiang

Large language models (LLMs) are very costly and inefficient to update with new information. To address this limitation, retrieval-augmented generation (RAG) has been proposed as a solution that dynamically incorporates external knowledge…

Computation and Language · Computer Science 2025-07-10 Sezen Perçin , Xin Su , Qutub Sha Syed , Phillip Howard , Aleksei Kuvshinov , Leo Schwinn , Kay-Ulrich Scholl

Retrieval-Augmented Generation (RAG) is increasingly employed in generative AI-driven scientific workflows to integrate rapidly evolving scientific knowledge bases, yet its reliability is frequently compromised by non-determinism in their…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-24 Baiqiang Wang , Dongfang Zhao , Nathan R Tallent , Luanzheng Guo

Retrieval-Augmented Generation (RAG) struggles on long, structured financial filings where relevant evidence is sparse and cross-referenced. This paper presents a systematic investigation of advanced metadata-driven Retrieval-Augmented…

Information Retrieval · Computer Science 2025-10-29 Michail Dadopoulos , Anestis Ladas , Stratos Moschidis , Ioannis Negkakis

Retrieval-Augmented Generation (RAG) systems have emerged as a promising solution to enhance large language models (LLMs) by integrating external knowledge retrieval with generative capabilities. While significant advancements have been…

Human-Computer Interaction · Computer Science 2025-08-11 Sizhe Cheng , Jiaping Li , Huanchen Wang , Yuxin Ma

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm to enhance large language models (LLMs) by conditioning generation on external evidence retrieved at inference time. While RAG addresses critical limitations of…

Information Retrieval · Computer Science 2025-06-03 Chaitanya Sharma

Retrieval-Augmented Generation (RAG) has emerged as a promising framework to mitigate hallucinations in Large Language Models (LLMs), yet its overall performance is dependent on the underlying retrieval system. In the finance domain,…

Information Retrieval · Computer Science 2025-03-20 Sejong Kim , Hyunseo Song , Hyunwoo Seo , Hyunjun Kim

Retrieval-augmented generation (RAG) techniques have proven to be effective in integrating up-to-date information, mitigating hallucinations, and enhancing response quality, particularly in specialized domains. While many RAG approaches…

Retrieval-augmented generation (RAG) systems expose numerous design choices spanning query rewriting, chunking, retrieval depth, reranking, and context compression. In practice, these choices are often configured through heuristics,…

Artificial Intelligence · Computer Science 2026-05-29 Zhen Chen , Yibing Liu , Weihao Xie , Yu Liang , Peilin Chen , Shiqi Wang

This article provides a comprehensive systematic literature review of academic studies, industrial applications, and real-world deployments from 2018 to 2025, providing a practical guide and detailed overview of modern Retrieval-Augmented…

Information Retrieval · Computer Science 2026-01-12 Dean Wampler , Dave Nielson , Alireza Seddighi

Retrieval-Augmented Generation (RAG) is an effective approach to enhance the factual accuracy of large language models (LLMs) by retrieving information from external databases, which are typically composed of diverse sources, to supplement…

Machine Learning · Computer Science 2025-10-15 Jeongyeon Hwang , Junyoung Park , Hyejin Park , Dongwoo Kim , Sangdon Park , Jungseul Ok

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm for enhancing the capabilities of large language models. However, existing RAG evaluation predominantly focuses on text retrieval and relies on opaque, end-to-end…

Information Retrieval · Computer Science 2025-05-19 Chuan Xu , Qiaosheng Chen , Yutong Feng , Gong Cheng

Advancements in model algorithms, the growth of foundational models, and access to high-quality datasets have propelled the evolution of Artificial Intelligence Generated Content (AIGC). Despite its notable successes, AIGC still faces…

Computer Vision and Pattern Recognition · Computer Science 2024-06-24 Penghao Zhao , Hailin Zhang , Qinhan Yu , Zhengren Wang , Yunteng Geng , Fangcheng Fu , Ling Yang , Wentao Zhang , Jie Jiang , Bin Cui

Retrieval-augmented generation (RAG) synergizes the retrieval of pertinent data with the generative capabilities of Large Language Models (LLMs), ensuring that the generated output is not only contextually relevant but also accurate and…

‹ Prev 1 2 3 10 Next ›