Related papers: VersionRAG: Version-Aware Retrieval-Augmented Gene…

Enhancing Document VQA Models via Retrieval-Augmented Generation

Document Visual Question Answering (Document VQA) must cope with documents that span dozens of pages, yet leading systems still concatenate every page or rely on very large vision-language models, both of which are memory-hungry.…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Eric López , Artemis Llabrés , Ernest Valveny

RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding

Multi-modal Retrieval-Augmented Generation (RAG) has become a critical method for empowering LLMs by leveraging candidate visual documents. However, current methods consider the entire document as the basic retrieval unit, introducing…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Yinglu Li , Zhiying Lu , Zhihang Liu , Yiwei Sun , Chuanbin Liu , Hongtao Xie

Retrieval-Augmented Generation with Estimation of Source Reliability

Retrieval-Augmented Generation (RAG) is an effective approach to enhance the factual accuracy of large language models (LLMs) by retrieving information from external databases, which are typically composed of diverse sources, to supplement…

Machine Learning · Computer Science 2025-10-15 Jeongyeon Hwang , Junyoung Park , Hyejin Park , Dongwoo Kim , Sangdon Park , Jungseul Ok

mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm for enhancing the capabilities of large language models. However, existing RAG evaluation predominantly focuses on text retrieval and relies on opaque, end-to-end…

Information Retrieval · Computer Science 2025-05-19 Chuan Xu , Qiaosheng Chen , Yutong Feng , Gong Cheng

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Retrieval-augmented generation (RAG) is an effective technique that enables large language models (LLMs) to utilize external knowledge sources for generation. However, current RAG systems are solely based on text, rendering it impossible to…

Information Retrieval · Computer Science 2025-03-04 Shi Yu , Chaoyue Tang , Bokai Xu , Junbo Cui , Junhao Ran , Yukun Yan , Zhenghao Liu , Shuo Wang , Xu Han , Zhiyuan Liu , Maosong Sun

VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

We aim to develop a retrieval-augmented generation (RAG) framework that answers questions over a corpus of visually-rich documents presented in mixed modalities (e.g., charts, tables) and diverse formats (e.g., PDF, PPTX). In this paper, we…

Computation and Language · Computer Science 2025-04-15 Ryota Tanaka , Taichi Iki , Taku Hasegawa , Kyosuke Nishida , Kuniko Saito , Jun Suzuki

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Retrieval-Augmented Generation (RAG) is a powerful strategy for improving the factual accuracy of models by retrieving external knowledge relevant to queries and incorporating it into the generation process. However, existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Soyeong Jeong , Kangsan Kim , Jinheon Baek , Sung Ju Hwang

NaviRAG: Towards Active Knowledge Navigation for Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) typically relies on a flat retrieval paradigm that maps queries directly to static, isolated text segments. This approach struggles with more complex tasks that require the conditional retrieval and…

Computation and Language · Computer Science 2026-05-19 Jihao Dai , Dingjun Wu , Yuxuan Chen , Zheni Zeng , Yukun Yan , Zhenghao Liu , Maosong Sun

DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering

Retrieval-Augmented Generation (RAG) has recently demonstrated the performance of Large Language Models (LLMs) in the knowledge-intensive tasks such as Question-Answering (QA). RAG expands the query context by incorporating external…

Machine Learning · Computer Science 2024-06-18 Zijian Hei , Weiling Liu , Wenjie Ou , Juyi Qiao , Junming Jiao , Guowen Song , Ting Tian , Yi Lin

CMRAG: Co-modality-based visual document retrieval and question answering

Retrieval-Augmented Generation (RAG) has become a core paradigm in document question answering tasks. However, existing methods have limitations when dealing with multimodal documents: one category of methods relies on layout analysis and…

Computation and Language · Computer Science 2026-03-09 Wang Chen , Wenhan Yu , Guanqiang Qi , Weikang Li , Yang Li , Lei Sha , Deguo Xia , Jizhou Huang

LightRAG: Simple and Fast Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by integrating external knowledge sources, enabling more accurate and contextually relevant responses tailored to user needs. However, existing RAG systems…

Information Retrieval · Computer Science 2025-04-29 Zirui Guo , Lianghao Xia , Yanhua Yu , Tu Ao , Chao Huang

Corrective Retrieval Augmented Generation

Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG) is a practicable…

Computation and Language · Computer Science 2024-10-08 Shi-Qi Yan , Jia-Chen Gu , Yun Zhu , Zhen-Hua Ling

TagRAG: Tag-guided Hierarchical Knowledge Graph Retrieval-Augmented Generation

Retrieval-Augmented Generation enhances language models by retrieving external knowledge to support informed and grounded responses. However, traditional RAG methods rely on fragment-level retrieval, limiting their ability to address…

Information Retrieval · Computer Science 2026-05-05 Wenbiao Tao , Xinyuan Li , Yunshi Lan , Weining Qian

Evaluating Retrieval-Augmented Generation Variants for Natural Language-Based SQL and API Call Generation

Enterprise systems increasingly require natural language interfaces that can translate user requests into structured operations such as SQL queries and REST API calls. While large language models (LLMs) show promise for code generation…

Software Engineering · Computer Science 2026-02-10 Michael Marketsmüller , Simon Martin , Tim Schlippe

VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

Retrieval-Augmented Generation (RAG) has demonstrated remarkable success in enhancing Large Language Models (LLMs) through external knowledge integration, yet its application has primarily focused on textual content, leaving the rich domain…

Information Retrieval · Computer Science 2025-02-04 Xubin Ren , Lingrui Xu , Long Xia , Shuaiqiang Wang , Dawei Yin , Chao Huang

Aligning LLMs for the Classroom with Knowledge-Based Retrieval -- A Comparative RAG Study

Large language models like ChatGPT are increasingly used in classrooms, but they often provide outdated or fabricated information that can mislead students. Retrieval Augmented Generation (RAG) improves reliability of LLMs by grounding…

Artificial Intelligence · Computer Science 2025-09-10 Amay Jain , Liu Cui , Si Chen

RobustVisRAG: Causality-Aware Vision-Based Retrieval-Augmented Generation under Visual Degradations

Vision-based Retrieval-Augmented Generation (VisRAG) leverages vision-language models (VLMs) to jointly retrieve relevant visual documents and generate grounded answers based on multimodal evidence. However, existing VisRAG models degrade…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 I-Hsiang Chen , Yu-Wei Liu , Tse-Yu Wu , Yu-Chien Chiang , Jen-Chien Yang , Wei-Ting Chen

Use Graph When It Needs: Efficiently and Adaptively Integrating Retrieval-Augmented Generation with Graphs

Large language models (LLMs) often struggle with knowledge-intensive tasks due to hallucinations and outdated parametric knowledge. While Retrieval-Augmented Generation (RAG) addresses this by integrating external corpora, its effectiveness…

Computation and Language · Computer Science 2026-02-04 Su Dong , Qinggang Zhang , Yilin Xiao , Shengyuan Chen , Chuang Zhou , Xiao Huang

HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents

Existing multimodal Retrieval-Augmented Generation (RAG) methods for visually rich documents (VRD) are often biased towards retrieving salient knowledge(e.g., prominent text and visual elements), while largely neglecting the critical…

Information Retrieval · Computer Science 2025-11-26 Anyang Tong , Xiang Niu , ZhiPing Liu , Chang Tian , Yanyan Wei , Zenglin Shi , Meng Wang

TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning

Retrieval-Augmented Generation (RAG) has demonstrated considerable effectiveness in open-domain question answering. However, when applied to heterogeneous documents, comprising both textual and tabular components, existing RAG approaches…

Computation and Language · Computer Science 2025-10-01 Xiaohan Yu , Pu Jian , Chong Chen