English
Related papers

Related papers: VersionRAG: Version-Aware Retrieval-Augmented Gene…

200 papers

Document Visual Question Answering (Document VQA) must cope with documents that span dozens of pages, yet leading systems still concatenate every page or rely on very large vision-language models, both of which are memory-hungry.…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Eric López , Artemis Llabrés , Ernest Valveny

Multi-modal Retrieval-Augmented Generation (RAG) has become a critical method for empowering LLMs by leveraging candidate visual documents. However, current methods consider the entire document as the basic retrieval unit, introducing…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Yinglu Li , Zhiying Lu , Zhihang Liu , Yiwei Sun , Chuanbin Liu , Hongtao Xie

Retrieval-Augmented Generation (RAG) is an effective approach to enhance the factual accuracy of large language models (LLMs) by retrieving information from external databases, which are typically composed of diverse sources, to supplement…

Machine Learning · Computer Science 2025-10-15 Jeongyeon Hwang , Junyoung Park , Hyejin Park , Dongwoo Kim , Sangdon Park , Jungseul Ok

Retrieval-Augmented Generation (RAG) has emerged as a powerful paradigm for enhancing the capabilities of large language models. However, existing RAG evaluation predominantly focuses on text retrieval and relies on opaque, end-to-end…

Information Retrieval · Computer Science 2025-05-19 Chuan Xu , Qiaosheng Chen , Yutong Feng , Gong Cheng

Retrieval-augmented generation (RAG) is an effective technique that enables large language models (LLMs) to utilize external knowledge sources for generation. However, current RAG systems are solely based on text, rendering it impossible to…

Information Retrieval · Computer Science 2025-03-04 Shi Yu , Chaoyue Tang , Bokai Xu , Junbo Cui , Junhao Ran , Yukun Yan , Zhenghao Liu , Shuo Wang , Xu Han , Zhiyuan Liu , Maosong Sun

We aim to develop a retrieval-augmented generation (RAG) framework that answers questions over a corpus of visually-rich documents presented in mixed modalities (e.g., charts, tables) and diverse formats (e.g., PDF, PPTX). In this paper, we…

Computation and Language · Computer Science 2025-04-15 Ryota Tanaka , Taichi Iki , Taku Hasegawa , Kyosuke Nishida , Kuniko Saito , Jun Suzuki

Retrieval-Augmented Generation (RAG) is a powerful strategy for improving the factual accuracy of models by retrieving external knowledge relevant to queries and incorporating it into the generation process. However, existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Soyeong Jeong , Kangsan Kim , Jinheon Baek , Sung Ju Hwang

Retrieval-augmented generation (RAG) typically relies on a flat retrieval paradigm that maps queries directly to static, isolated text segments. This approach struggles with more complex tasks that require the conditional retrieval and…

Computation and Language · Computer Science 2026-05-19 Jihao Dai , Dingjun Wu , Yuxuan Chen , Zheni Zeng , Yukun Yan , Zhenghao Liu , Maosong Sun

Retrieval-Augmented Generation (RAG) has recently demonstrated the performance of Large Language Models (LLMs) in the knowledge-intensive tasks such as Question-Answering (QA). RAG expands the query context by incorporating external…

Machine Learning · Computer Science 2024-06-18 Zijian Hei , Weiling Liu , Wenjie Ou , Juyi Qiao , Junming Jiao , Guowen Song , Ting Tian , Yi Lin

Retrieval-Augmented Generation (RAG) has become a core paradigm in document question answering tasks. However, existing methods have limitations when dealing with multimodal documents: one category of methods relies on layout analysis and…

Computation and Language · Computer Science 2026-03-09 Wang Chen , Wenhan Yu , Guanqiang Qi , Weikang Li , Yang Li , Lei Sha , Deguo Xia , Jizhou Huang

Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by integrating external knowledge sources, enabling more accurate and contextually relevant responses tailored to user needs. However, existing RAG systems…

Information Retrieval · Computer Science 2025-04-29 Zirui Guo , Lianghao Xia , Yanhua Yu , Tu Ao , Chao Huang

Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG) is a practicable…

Computation and Language · Computer Science 2024-10-08 Shi-Qi Yan , Jia-Chen Gu , Yun Zhu , Zhen-Hua Ling

Retrieval-Augmented Generation enhances language models by retrieving external knowledge to support informed and grounded responses. However, traditional RAG methods rely on fragment-level retrieval, limiting their ability to address…

Information Retrieval · Computer Science 2026-05-05 Wenbiao Tao , Xinyuan Li , Yunshi Lan , Weining Qian

Enterprise systems increasingly require natural language interfaces that can translate user requests into structured operations such as SQL queries and REST API calls. While large language models (LLMs) show promise for code generation…

Software Engineering · Computer Science 2026-02-10 Michael Marketsmüller , Simon Martin , Tim Schlippe

Retrieval-Augmented Generation (RAG) has demonstrated remarkable success in enhancing Large Language Models (LLMs) through external knowledge integration, yet its application has primarily focused on textual content, leaving the rich domain…

Information Retrieval · Computer Science 2025-02-04 Xubin Ren , Lingrui Xu , Long Xia , Shuaiqiang Wang , Dawei Yin , Chao Huang

Large language models like ChatGPT are increasingly used in classrooms, but they often provide outdated or fabricated information that can mislead students. Retrieval Augmented Generation (RAG) improves reliability of LLMs by grounding…

Artificial Intelligence · Computer Science 2025-09-10 Amay Jain , Liu Cui , Si Chen

Vision-based Retrieval-Augmented Generation (VisRAG) leverages vision-language models (VLMs) to jointly retrieve relevant visual documents and generate grounded answers based on multimodal evidence. However, existing VisRAG models degrade…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 I-Hsiang Chen , Yu-Wei Liu , Tse-Yu Wu , Yu-Chien Chiang , Jen-Chien Yang , Wei-Ting Chen

Large language models (LLMs) often struggle with knowledge-intensive tasks due to hallucinations and outdated parametric knowledge. While Retrieval-Augmented Generation (RAG) addresses this by integrating external corpora, its effectiveness…

Computation and Language · Computer Science 2026-02-04 Su Dong , Qinggang Zhang , Yilin Xiao , Shengyuan Chen , Chuang Zhou , Xiao Huang

Existing multimodal Retrieval-Augmented Generation (RAG) methods for visually rich documents (VRD) are often biased towards retrieving salient knowledge(e.g., prominent text and visual elements), while largely neglecting the critical…

Information Retrieval · Computer Science 2025-11-26 Anyang Tong , Xiang Niu , ZhiPing Liu , Chang Tian , Yanyan Wei , Zenglin Shi , Meng Wang

Retrieval-Augmented Generation (RAG) has demonstrated considerable effectiveness in open-domain question answering. However, when applied to heterogeneous documents, comprising both textual and tabular components, existing RAG approaches…

Computation and Language · Computer Science 2025-10-01 Xiaohan Yu , Pu Jian , Chong Chen
‹ Prev 1 2 3 10 Next ›