English
Related papers

Related papers: Embedding-Based Context-Aware Reranker

200 papers

Retrieval-augmented generation (RAG) improves large language model reliability by grounding generated responses in external evidence. However, RAG performance depends on the relevance of retrieved passages, the quality of evidence ranking,…

Information Retrieval · Computer Science 2026-05-05 Fariba Afrin Irany , Sampson Akwafuo

Retrieval-Augmented Generation (RAG) struggles on long, structured financial filings where relevant evidence is sparse and cross-referenced. This paper presents a systematic investigation of advanced metadata-driven Retrieval-Augmented…

Information Retrieval · Computer Science 2025-10-29 Michail Dadopoulos , Anestis Ladas , Stratos Moschidis , Ioannis Negkakis

Retrieval-Augmented Generation (RAG) systems enhance text generation by incorporating external knowledge but often struggle when retrieving context across different text modalities due to semantic gaps. We introduce a generalized…

Machine Learning · Computer Science 2024-11-01 Arihan Yadav , Alan McMillan

Retrieval-Augmented Generation (RAG) methods enhance LLM performance by efficiently filtering relevant context for LLMs, reducing hallucinations and inference cost. However, most existing RAG methods focus on single-step retrieval, which is…

Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for retrieval models within these systems. Traditional end-to-end evaluation methods are computationally expensive. Furthermore, evaluation of the retrieval…

Computation and Language · Computer Science 2024-04-23 Alireza Salemi , Hamed Zamani

Retrieval-Augmented Generation (RAG) depends on document ranking to provide useful evidence for generation, but conventional reranking methods mainly optimize query-document relevance rather than generation usefulness. A relevant document…

Computation and Language · Computer Science 2026-05-07 Zhipeng Song , Yizhi Zhou , Xiangyu Kong , Jiulong Jiao , Xuezhou Ye , Chunqi Gao , Xueqing Shi , Yuhang Zhou , Heng Qi

Retrieval-Augmented Generation (RAG) is a framework for grounding Large Language Models (LLMs) in external, up-to-date information. However, recent advancements in context window size allow LLMs to process inputs of up to 128K tokens or…

Machine Learning · Computer Science 2026-02-26 Seongwoong Shim , Myunsoo Kim , Jae Hyeon Cho , Byung-Jun Lee

In this paper we present APEX-Embedding-7B (Advanced Processing for Epistemic eXtraction), a 7-billion parameter decoder-only text Feature Extraction Model, specifically designed for Document Retrieval-Augmented Generation (RAG) tasks. Our…

Information Retrieval · Computer Science 2024-10-25 Thea Aviss

Retrieval-augmented generation (RAG) systems trained using reinforcement learning (RL) with reasoning are hampered by inefficient context management, where long, noisy retrieved documents increase costs and degrade performance. We introduce…

Computation and Language · Computer Science 2025-10-14 Zhichao Xu , Minheng Wang , Yawei Wang , Wenqian Ye , Yuntao Du , Yunpu Ma , Yijun Tian

Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a…

Computation and Language · Computer Science 2026-04-27 Lichang Song , Ting Long , Yi Chang

Retrieval-Augmented Generation (RAG) encounters efficiency challenges when scaling to massive knowledge bases while preserving contextual relevance. We propose Hash-RAG, a framework that integrates deep hashing techniques with systematic…

Information Retrieval · Computer Science 2025-06-04 Jinyu Guo , Xunlei Chen , Qiyang Xia , Zhaokun Wang , Jie Ou , Libo Qin , Shunyu Yao , Wenhong Tian

Large Language Models (LLMs) have achieved impressive progress in natural language processing, but their limited ability to retain long-term context constrains performance on document-level or multi-turn tasks. Retrieval-Augmented…

Computation and Language · Computer Science 2025-05-20 Zhangyu Wang , Siyuan Gao , Rong Zhou , Hao Wang , Li Ning

The existing Retrieval-Augmented Generation (RAG) systems face significant challenges in terms of cost and effectiveness. On one hand, they need to encode the lengthy retrieved contexts before responding to the input tasks, which imposes…

Computation and Language · Computer Science 2024-09-25 Zheng Liu , Chenyuan Wu , Ninglu Shao , Shitao Xiao , Chaozhuo Li , Defu Lian

Retrieval-Augmented Generation (RAG) systems remain brittle under realistic retrieval noise, even when the required evidence appears in the top-K results. A key reason is that retrievers and rerankers optimize solely for relevance, often…

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge to generate a response within a context with improved accuracy and reduced hallucinations. However, multi-modal RAG systems face…

Machine Learning · Computer Science 2025-01-09 Matin Mortaheb , Mohammad A. Amir Khojastepour , Srimat T. Chakradhar , Sennur Ulukus

Retrieval-augmented Generation (RAG) extends large language models (LLMs) with external knowledge but faces key challenges: restricted effective context length and redundancy in retrieved documents. Pure compression-based approaches reduce…

Computation and Language · Computer Science 2025-07-09 Yiqiao Jin , Kartik Sharma , Vineeth Rakesh , Yingtong Dou , Menghai Pan , Mahashweta Das , Srijan Kumar

Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by integrating external knowledge sources, enabling more accurate and contextually relevant responses tailored to user needs. However, existing RAG systems…

Information Retrieval · Computer Science 2025-04-29 Zirui Guo , Lianghao Xia , Yanhua Yu , Tu Ao , Chao Huang

Document retrieval systems have experienced a revitalized interest with the advent of retrieval-augmented generation (RAG). RAG architecture offers a lower hallucination rate than LLM-only applications. However, the accuracy of the…

Information Retrieval · Computer Science 2024-08-21 Kavsar Huseynova , Jafar Isbarov

A common approach to question answering with retrieval-augmented generation (RAG) is to concatenate documents into a single context and pass it to a language model to generate an answer. While simple, this strategy can obscure the…

Computation and Language · Computer Science 2026-04-27 Jinghong Chen , Jingbiao Mei , Guangyu Yang , Bill Byrne

Dense retrieval systems increasingly need to handle complex queries. In many realistic settings, users express intent through long instructions or task-specific descriptions, while target documents remain relatively simple and static. This…

Information Retrieval · Computer Science 2026-04-07 Seiji Maekawa , Moin Aminnaseri , Pouya Pezeshkpour , Estevam Hruschka
‹ Prev 1 2 3 10 Next ›