Related papers: Embedding-Based Context-Aware Reranker

A Hybrid Retrieval and Reranking Framework for Evidence-Grounded Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) improves large language model reliability by grounding generated responses in external evidence. However, RAG performance depends on the relevance of retrieved passages, the quality of evidence ranking,…

Information Retrieval · Computer Science 2026-05-05 Fariba Afrin Irany , Sampson Akwafuo

Metadata-Driven Retrieval-Augmented Generation for Financial Question Answering

Retrieval-Augmented Generation (RAG) struggles on long, structured financial filings where relevant evidence is sparse and cross-referenced. This paper presents a systematic investigation of advanced metadata-driven Retrieval-Augmented…

Information Retrieval · Computer Science 2025-10-29 Michail Dadopoulos , Anestis Ladas , Stratos Moschidis , Ioannis Negkakis

Mind the Gap: A Generalized Approach for Cross-Modal Embedding Alignment

Retrieval-Augmented Generation (RAG) systems enhance text generation by incorporating external knowledge but often struggle when retrieving context across different text modalities due to semantic gaps. We introduce a generalized…

Machine Learning · Computer Science 2024-11-01 Arihan Yadav , Alan McMillan

Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training

Retrieval-Augmented Generation (RAG) methods enhance LLM performance by efficiently filtering relevant context for LLMs, reducing hallucinations and inference cost. However, most existing RAG methods focus on single-step retrieval, which is…

Machine Learning · Computer Science 2026-05-05 Artyom Sorokin , Nazar Buzun , Alexander Anokhin , Oleg Inozemcev , Egor Vedernikov , Petr Anokhin , Mikhail Burtsev , Trushkov Alexey , Yin Wenshuai , Evgeny Burnaev

Evaluating Retrieval Quality in Retrieval-Augmented Generation

Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for retrieval models within these systems. Traditional end-to-end evaluation methods are computationally expensive. Furthermore, evaluation of the retrieval…

Computation and Language · Computer Science 2024-04-23 Alireza Salemi , Hamed Zamani

CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) depends on document ranking to provide useful evidence for generation, but conventional reranking methods mainly optimize query-document relevance rather than generation usefulness. A relevant document…

Computation and Language · Computer Science 2026-05-07 Zhipeng Song , Yizhi Zhou , Xiangyu Kong , Jiulong Jiao , Xuezhou Ye , Chunqi Gao , Xueqing Shi , Yuhang Zhou , Heng Qi

Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding

Retrieval-Augmented Generation (RAG) is a framework for grounding Large Language Models (LLMs) in external, up-to-date information. However, recent advancements in context window size allow LLMs to process inputs of up to 128K tokens or…

Machine Learning · Computer Science 2026-02-26 Seongwoong Shim , Myunsoo Kim , Jae Hyeon Cho , Byung-Jun Lee

Improving Embedding Accuracy for Document Retrieval Using Entity Relationship Maps and Model-Aware Contrastive Sampling

In this paper we present APEX-Embedding-7B (Advanced Processing for Epistemic eXtraction), a 7-billion parameter decoder-only text Feature Extraction Model, specifically designed for Document Retrieval-Augmented Generation (RAG) tasks. Our…

Information Retrieval · Computer Science 2024-10-25 Thea Aviss

RECON: Reasoning with Condensation for Efficient Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) systems trained using reinforcement learning (RL) with reasoning are hampered by inefficient context management, where long, noisy retrieved documents increase costs and degrade performance. We introduce…

Computation and Language · Computer Science 2025-10-14 Zhichao Xu , Minheng Wang , Yawei Wang , Wenqian Ye , Yuntao Du , Yunpu Ma , Yijun Tian

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a…

Computation and Language · Computer Science 2026-04-27 Lichang Song , Ting Long , Yi Chang

HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation

Retrieval-Augmented Generation (RAG) encounters efficiency challenges when scaling to massive knowledge bases while preserving contextual relevance. We propose Hash-RAG, a framework that integrates deep hashing techniques with systematic…

Information Retrieval · Computer Science 2025-06-04 Jinyu Guo , Xunlei Chen , Qiyang Xia , Zhaokun Wang , Jie Ou , Libo Qin , Shunyu Yao , Wenhong Tian

ELITE: Embedding-Less retrieval with Iterative Text Exploration

Large Language Models (LLMs) have achieved impressive progress in natural language processing, but their limited ability to retain long-term context constrains performance on document-level or multi-turn tasks. Retrieval-Augmented…

Computation and Language · Computer Science 2025-05-20 Zhangyu Wang , Siyuan Gao , Rong Zhou , Hao Wang , Li Ning

Lighter And Better: Towards Flexible Context Adaptation For Retrieval Augmented Generation

The existing Retrieval-Augmented Generation (RAG) systems face significant challenges in terms of cost and effectiveness. On one hand, they need to encode the lengthy retrieved contexts before responding to the input tasks, which imposes…

Computation and Language · Computer Science 2024-09-25 Zheng Liu , Chenyuan Wu , Ninglu Shao , Shitao Xiao , Chaozhuo Li , Defu Lian

Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) systems remain brittle under realistic retrieval noise, even when the required evidence appears in the top-K results. A key reason is that retrievers and rerankers optimize solely for relevance, often…

Computation and Language · Computer Science 2026-02-04 Jiashuo Sun , Pengcheng Jiang , Saizhuo Wang , Jiajun Fan , Heng Wang , Siru Ouyang , Ming Zhong , Yizhu Jiao , Chengsong Huang , Xueqiang Xu , Pengrui Han , Peiran Li , Jiaxin Huang , Ge Liu , Heng Ji , Jiawei Han

Re-ranking the Context for Multimodal Retrieval Augmented Generation

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge to generate a response within a context with improved accuracy and reduced hallucinations. However, multi-modal RAG systems face…

Machine Learning · Computer Science 2025-01-09 Matin Mortaheb , Mohammad A. Amir Khojastepour , Srimat T. Chakradhar , Sennur Ulukus

SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression

Retrieval-augmented Generation (RAG) extends large language models (LLMs) with external knowledge but faces key challenges: restricted effective context length and redundancy in retrieved documents. Pure compression-based approaches reduce…

Computation and Language · Computer Science 2025-07-09 Yiqiao Jin , Kartik Sharma , Vineeth Rakesh , Yingtong Dou , Menghai Pan , Mahashweta Das , Srijan Kumar

LightRAG: Simple and Fast Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by integrating external knowledge sources, enabling more accurate and contextually relevant responses tailored to user needs. However, existing RAG systems…

Information Retrieval · Computer Science 2025-04-29 Zirui Guo , Lianghao Xia , Yanhua Yu , Tu Ao , Chao Huang

Enhanced document retrieval with topic embeddings

Document retrieval systems have experienced a revitalized interest with the advent of retrieval-augmented generation (RAG). RAG architecture offers a lower hallucination rate than LLM-only applications. However, the accuracy of the…

Information Retrieval · Computer Science 2024-08-21 Kavsar Huseynova , Jafar Isbarov

BERAG: Bayesian Ensemble Retrieval-Augmented Generation for Knowledge-based Visual Question Answering

A common approach to question answering with retrieval-augmented generation (RAG) is to concatenate documents into a single context and pass it to a language model to generate an answer. While simple, this strategy can obscure the…

Computation and Language · Computer Science 2026-04-27 Jinghong Chen , Jingbiao Mei , Guangyu Yang , Bill Byrne

Align then Train: Efficient Retrieval Adapter Learning

Dense retrieval systems increasingly need to handle complex queries. In many realistic settings, users express intent through long instructions or task-specific descriptions, while target documents remain relatively simple and static. This…

Information Retrieval · Computer Science 2026-04-07 Seiji Maekawa , Moin Aminnaseri , Pouya Pezeshkpour , Estevam Hruschka