Related papers: Deep Tabular Research via Continual Experience-Dri…

DTRec: Learning Dynamic Reasoning Trajectories for Sequential Recommendation

Inspired by advances in LLMs, reasoning-enhanced sequential recommendation performs multi-step deliberation before making final predictions, unlocking greater potential for capturing user preferences. However, current methods are…

Information Retrieval · Computer Science 2025-12-17 Yifan Shao , Peilin Zhou , Shoujin Wang , Weizhi Zhang , Xu Cai , Sunghun Kim

Deep Research: A Systematic Survey

Large language models (LLMs) have rapidly evolved from text generators into powerful problem solvers. Yet, many open tasks demand critical thinking, multi-source, and verifiable outputs, which are beyond single-shot prompting or standard…

Computation and Language · Computer Science 2025-12-03 Zhengliang Shi , Yiqun Chen , Haitao Li , Weiwei Sun , Shiyu Ni , Yougang Lyu , Run-Ze Fan , Bowen Jin , Yixuan Weng , Minjun Zhu , Qiujie Xie , Xinyu Guo , Qu Yang , Jiayi Wu , Jujia Zhao , Xiaqiang Tang , Xinbei Ma , Cunxiang Wang , Jiaxin Mao , Qingyao Ai , Jen-Tse Huang , Wenxuan Wang , Yue Zhang , Yiming Yang , Zhaopeng Tu , Zhaochun Ren

Deep Research Agents: A Systematic Examination And Roadmap

The rapid progress of Large Language Models (LLMs) has given rise to a new category of autonomous AI systems, referred to as Deep Research (DR) agents. These agents are designed to tackle complex, multi-turn informational research tasks by…

Artificial Intelligence · Computer Science 2025-09-04 Yuxuan Huang , Yihang Chen , Haozheng Zhang , Kang Li , Huichi Zhou , Meng Fang , Linyi Yang , Xiaoguang Li , Lifeng Shang , Songcen Xu , Jianye Hao , Kun Shao , Jun Wang

Deep Researcher with Test-Time Diffusion

Deep research agents, powered by Large Language Models (LLMs), are rapidly advancing; yet, their performance often plateaus when generating complex, long-form research reports using generic test-time scaling algorithms. Drawing inspiration…

Computation and Language · Computer Science 2025-07-23 Rujun Han , Yanfei Chen , Zoey CuiZhu , Lesly Miculicich , Guan Sun , Yuanjun Bi , Weiming Wen , Hui Wan , Chunfeng Wen , Solène Maître , George Lee , Vishy Tirumalashetty , Emily Xue , Zizhao Zhang , Salem Haykal , Burak Gokturk , Tomas Pfister , Chen-Yu Lee

Hierarchical Deep Research with Local-Web RAG: Toward Automated System-Level Materials Discovery

We present a long-horizon, hierarchical deep research (DR) agent designed for complex materials and device discovery problems that exceed the scope of existing Machine Learning (ML) surrogates and closed-source commercial agents. Our…

Machine Learning · Computer Science 2025-12-04 Rui Ding , Rodrigo Pires Ferreira , Yuxin Chen , Junhong Chen

Efficient Tree-Structured Deep Research with Adaptive Resource Allocation

Deep research agents, which synthesize information across diverse sources, are significantly constrained by the sequential nature of reasoning. This bottleneck results in high latency, poor runtime adaptability, and inefficient resource…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-31 Lunyiu Nie , Nedim Lipka , Ryan A. Rossi , Swarat Chaudhuri

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

The agency expected of Agentic Large Language Models goes beyond answering correctly, requiring autonomy to set goals and decide what to explore. We term this investigatory intelligence, distinguishing it from executional intelligence,…

Artificial Intelligence · Computer Science 2026-05-19 Wei Liu , Peijie Yu , Michele Orini , Yali Du , Yulan He

Deep Reasoning in General Purpose Agents via Structured Meta-Cognition

Humans intuitively solve complex problems by flexibly shifting among reasoning modes: they plan, execute, revise intermediate goals, resolve ambiguity through associative judgment, and apply formal procedures to well-specified subproblems.…

Computation and Language · Computer Science 2026-05-13 Dean Light , Michael Theologitis , Kshitish Ghate , Shuyue Stella Li , Benjamin Newman , Chirag Shah , Aylin Caliskan , Pang Wei Koh , Dan Suciu , Yulia Tsvetkov

Beyond Static Retrieval: Opportunities and Pitfalls of Iterative Retrieval in GraphRAG

Retrieval-augmented generation (RAG) is a powerful paradigm for improving large language models (LLMs) on knowledge-intensive question answering. Graph-based RAG (GraphRAG) leverages entity-relation graphs to support multi-hop reasoning,…

Artificial Intelligence · Computer Science 2025-10-01 Kai Guo , Xinnan Dai , Shenglai Zeng , Harry Shomer , Haoyu Han , Yu Wang , Jiliang Tang

SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning

Deep reinforcement learning (DRL) has gained great success by learning directly from high-dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of the subtasks is critical in hierarchical…

Artificial Intelligence · Computer Science 2019-03-01 Daoming Lyu , Fangkai Yang , Bo Liu , Steven Gustafson

HiRA: A Hierarchical Reasoning Framework for Decoupled Planning and Execution in Deep Search

Complex information needs in real-world search scenarios demand deep reasoning and knowledge synthesis across diverse sources, which traditional retrieval-augmented generation (RAG) pipelines struggle to address effectively. Current…

Artificial Intelligence · Computer Science 2025-11-03 Jiajie Jin , Xiaoxi Li , Guanting Dong , Yuyao Zhang , Yutao Zhu , Yang Zhao , Hongjin Qian , Zhicheng Dou

Multimodal Tabular Reasoning with Privileged Structured Information

Tabular reasoning involves multi-step information extraction and logical inference over tabular data. While recent advances have leveraged large language models (LLMs) for reasoning over structured tables, such high-quality textual…

Machine Learning · Computer Science 2025-06-05 Jun-Peng Jiang , Yu Xia , Hai-Long Sun , Shiyin Lu , Qing-Guo Chen , Weihua Luo , Kaifu Zhang , De-Chuan Zhan , Han-Jia Ye

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

LLM-based deep research agents are largely built on the ReAct framework. This linear design makes it difficult to revisit earlier states, branch into alternative search directions, or maintain global awareness under long contexts, often…

Computation and Language · Computer Science 2026-02-03 Jialiang Zhu , Gongrui Zhang , Xiaolong Ma , Lin Xu , Miaosen Zhang , Ruiqi Yang , Song Wang , Kai Qiu , Zhirong Wu , Qi Dai , Ruichun Ma , Bei Liu , Yifan Yang , Chong Luo , Zhengyuan Yang , Linjie Li , Lijuan Wang , Weizhu Chen , Xin Geng , Baining Guo

Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction

Efficient retrieval of external knowledge bases and web pages is crucial for enhancing the reasoning abilities of LLMs. Previous works on training LLMs to leverage external retrievers for solving complex problems have predominantly employed…

Artificial Intelligence · Computer Science 2025-11-17 Jun Xu , Xinkai Du , Yu Ao , Peilong Zhao , Yang Li , Ling Zhong , Lin Yuan , Zhongpu Bo , Xiaorui Wang , Mengshu Sun , Zhengke Gui , Dalong Zhang , Zhaoyang Wang , Qiwei Wang , Yangyang Hou , Zhiying Yin , Haofen Wang , Huajun Chen , Lei Liang , Jun Zhou

IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning

Deep Research (DR) agents extend Large Language Models (LLMs) beyond parametric knowledge by autonomously retrieving and synthesizing evidence from large web corpora into long-form reports, enabling a long-horizon agentic paradigm. However,…

Artificial Intelligence · Computer Science 2026-02-04 Haohao Luo , Zexi Li , Yuexiang Xie , Wenhao Zhang , Yaliang Li , Ying Shen

Test-Time Deep Thinking to Explore Implicit Rules

With the continuous advancement of Large Language Models (LLMs), intelligent agents are becoming increasingly vital. However, these agents often fail in environments governed by implicit rules--hidden constraints that cannot be observed…

Artificial Intelligence · Computer Science 2026-05-26 Wentong Chen , Xin Cong , Zhong Zhang , Yaxi Lu , Siyuan Zhao , Yesai Wu , Qinyu Luo , Haotian Chen , Yankai Lin , Zhiyuan Liu , Maosong Sun

Bidirectional Recursive Neural Networks for Token-Level Labeling with Structure

Recently, deep architectures, such as recurrent and recursive neural networks have been successfully applied to various natural language processing tasks. Inspired by bidirectional recurrent neural networks which use representations that…

Machine Learning · Computer Science 2013-12-03 Ozan İrsoy , Claire Cardie

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Deep research systems are widely used for multi-step web research, analysis, and cross-source synthesis, yet their evaluation remains challenging. Existing benchmarks often require annotation-intensive task construction, rely on static…

Computation and Language · Computer Science 2026-01-15 Yibo Wang , Lei Wang , Yue Deng , Keming Wu , Yao Xiao , Huanjin Yao , Liwei Kang , Hai Ye , Yongcheng Jing , Lidong Bing

Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning

Developing an automated driving system capable of navigating complex traffic environments remains a formidable challenge. Unlike rule-based or supervised learning-based methods, Deep Reinforcement Learning (DRL) based controllers eliminate…

Machine Learning · Computer Science 2025-01-28 Zhihao Zhang , Ekim Yurtsever , Keith A. Redmill

Dense Retrieval as Indirect Supervision for Large-space Decision Making

Many discriminative natural language understanding (NLU) tasks have large label spaces. Learning such a process of large-space decision making is particularly challenging due to the lack of training instances per label and the difficulty of…

Computation and Language · Computer Science 2023-10-31 Nan Xu , Fei Wang , Mingtao Dong , Muhao Chen