English

DeepNote: Note-Centric Deep Retrieval-Augmented Generation

Computation and Language 2025-04-08 v2

Abstract

Retrieval-Augmented Generation (RAG) mitigates factual errors and hallucinations in Large Language Models (LLMs) for question-answering (QA) by incorporating external knowledge. However, existing adaptive RAG methods rely on LLMs to predict retrieval timing and directly use retrieved information for generation, often failing to reflect real information needs and fully leverage retrieved knowledge. We develop DeepNote, an adaptive RAG framework that achieves in-depth and robust exploration of knowledge sources through note-centric adaptive retrieval. DeepNote employs notes as carriers for refining and accumulating knowledge. During in-depth exploration, it uses these notes to determine retrieval timing, formulate retrieval queries, and iteratively assess knowledge growth, ultimately leveraging the best note for answer generation. Extensive experiments and analyses demonstrate that DeepNote significantly outperforms all baselines (+10.2% to +20.1%) and exhibits the ability to gather knowledge with both high density and quality. Additionally, DPO further improves the performance of DeepNote. The code and data are available at https://github.com/thunlp/DeepNote.

Keywords

Cite

@article{arxiv.2410.08821,
  title  = {DeepNote: Note-Centric Deep Retrieval-Augmented Generation},
  author = {Ruobing Wang and Qingfei Zhao and Yukun Yan and Daren Zha and Yuxuan Chen and Shi Yu and Zhenghao Liu and Yixuan Wang and Shuo Wang and Xu Han and Zhiyuan Liu and Maosong Sun},
  journal= {arXiv preprint arXiv:2410.08821},
  year   = {2025}
}

Comments

28 pages, 6 figures, 21 tables

R2 v1 2026-06-28T19:17:50.789Z