Related papers: ParallelMuse: Agentic Parallel Thinking for Deep I…

Efficient Tree-Structured Deep Research with Adaptive Resource Allocation

Deep research agents, which synthesize information across diverse sources, are significantly constrained by the sequential nature of reasoning. This bottleneck results in high latency, poor runtime adaptability, and inefficient resource…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-31 Lunyiu Nie , Nedim Lipka , Ryan A. Rossi , Swarat Chaudhuri

A Survey on Parallel Reasoning

With the increasing capabilities of Large Language Models (LLMs), parallel reasoning has emerged as a new inference paradigm that enhances reasoning robustness by concurrently exploring multiple lines of thought before converging on a final…

Computation and Language · Computer Science 2025-10-15 Ziqi Wang , Boye Niu , Zipeng Gao , Zhi Zheng , Tong Xu , Linghui Meng , Zhongli Li , Jing Liu , Yilong Chen , Chen Zhu , Hua Wu , Haifeng Wang , Enhong Chen

ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning

Reasoning-augmented search agents such as Search-R1, trained via reinforcement learning with verifiable rewards (RLVR), demonstrate remarkable capabilities in multi-step information retrieval from external knowledge sources. These agents…

Computation and Language · Computer Science 2025-08-14 Shu Zhao , Tan Yu , Anbang Xu , Japinder Singh , Aaditya Shukla , Rama Akkiraju

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Parallel thinking has emerged as a promising paradigm for reasoning, yet it imposes significant computational burdens. Existing efficiency methods primarily rely on local, per-trajectory signals and lack principled mechanisms to exploit…

Computation and Language · Computer Science 2026-02-12 Tong Zheng , Chengsong Huang , Runpeng Dai , Yun He , Rui Liu , Xin Ni , Huiwen Bao , Kaishen Wang , Hongtu Zhu , Jiaxin Huang , Furong Huang , Heng Huang

Dynamic Parallel Tree Search for Efficient LLM Reasoning

Tree of Thoughts (ToT) enhances Large Language Model (LLM) reasoning by structuring problem-solving as a spanning tree. However, recent methods focus on search accuracy while overlooking computational efficiency. The challenges of…

Artificial Intelligence · Computer Science 2025-02-28 Yifu Ding , Wentao Jiang , Shunyu Liu , Yongcheng Jing , Jinyang Guo , Yingjie Wang , Jing Zhang , Zengmao Wang , Ziwei Liu , Bo Du , Xianglong Liu , Dacheng Tao

Diffuse Thinking: Exploring Diffusion Language Models as Efficient Thought Proposers for Reasoning

In recent years, large language models (LLMs) have witnessed remarkable advancements, with the test-time scaling law consistently enhancing the reasoning capabilities. Through systematic evaluation and exploration of a diverse spectrum of…

Computation and Language · Computer Science 2025-11-03 Chenyang Shao , Sijian Ren , Fengli Xu , Yong Li

Argus: Evidence Assembly for Scalable Deep Research Agents

Deep research agents have achieved remarkable progress on complex information seeking tasks. Even long ReAct style rollouts explore only a single trajectory, while recent state of the art systems scale inference time compute via parallel…

Computation and Language · Computer Science 2026-05-21 Zhen Zhang , Liangcai Su , Zhuo Chen , Xiang Lin , Haotian Xu , Simon Shaolei Du , Kaiyu Yang , Bo An , Lidong Bing , Xinyu Wang

Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

Test-Time Scaling (TTS) enhances the reasoning capabilities of large language models by allocating additional inference compute to explore the solution space. However, existing parallel TTS methods typically keep branches isolated during…

Computation and Language · Computer Science 2026-05-27 Xinglin Wang , Hao Lin , Shaoxiong Feng , Peiwen Yuan , Yiwei Li , Jiayi Shi , Yueqi Zhang , Chuyi Tan , Ji Zhang , Boyuan Pan , Yao Hu , Kan Li

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Parallel scaling has emerged as a powerful paradigm to enhance reasoning capabilities in large language models (LLMs) by generating multiple Chain-of-Thought (CoT) traces simultaneously. However, this approach introduces significant…

Computation and Language · Computer Science 2026-04-17 Shangqing Tu , Yaxuan Li , Yushi Bai , Lei Hou , Juanzi Li

Hybrid Deep Searcher: Scalable Parallel and Sequential Search Reasoning

Large reasoning models (LRMs) combined with retrieval-augmented generation (RAG) have enabled deep research agents capable of multi-step reasoning with external knowledge retrieval. However, we find that existing approaches rarely…

Artificial Intelligence · Computer Science 2026-05-26 Dayoon Ko , Jihyuk Kim , Haeju Park , Sohyeon Kim , Dahyun Lee , Yongrae Jo , Gunhee Kim , Moontae Lee , Kyungjae Lee

Scaling Retrieval-Augmented Reasoning with Parallel Search and Explicit Merging

Deep search agents have proven effective in enhancing LLMs by retrieving external knowledge during multi-step reasoning. However, existing methods often generate a single query for retrieval at each reasoning step, limiting information…

Artificial Intelligence · Computer Science 2026-05-14 Jiabei Liu , Wenyu Mao , Junfei Tan , Chunxu Shen , Lingling Yi , Jiancan Wu , Xiang Wang

Self-Manager: Parallel Agent Loop for Long-form Deep Research

Long-form deep research requires multi-faceted investigations over extended horizons to get a comprehensive report. When handling such complex tasks, existing agents manage context at the subtask level to overcome linear context…

Computation and Language · Computer Science 2026-01-27 Yilong Xu , Zhi Zheng , Xiang Long , Yujun Cai , Yiwei Wang

Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity

Recent advances in large language models (LLMs) have demonstrated the power of reasoning through self-generated chains of thought. Multiple reasoning agents can collaborate to raise joint reasoning quality above individual outcomes.…

Artificial Intelligence · Computer Science 2025-05-19 Chan-Jan Hsu , Davide Buffelli , Jamie McGowan , Feng-Ting Liao , Yi-Chang Chen , Sattar Vakili , Da-shan Shiu

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Recently, Diffusion Large Language Models (dLLMs) have demonstrated unique efficiency advantages, enabled by their inherently parallel decoding mechanism and flexible generation paradigm. Meanwhile, despite the rapid advancement of Search…

Artificial Intelligence · Computer Science 2026-02-10 Jiahao Zhao , Shaoxuan Xu , Zhongxiang Sun , Fengqi Zhu , Jingyang Ou , Yuling Shi , Chongxuan Li , Xiao Zhang , Jun Xu

Parallel symbolic state-space exploration is difficult, but what is the alternative?

State-space exploration is an essential step in many modeling and analysis problems. Its goal is to find the states reachable from the initial state of a discrete-state model described. The state space can used to answer important…

Logic in Computer Science · Computer Science 2009-12-16 Gianfranco Ciardo , Yang Zhao , Xiaoqing Jin

BrowseMaster: Towards Scalable Web Browsing via Tool-Augmented Programmatic Agent Pair

Effective information seeking in the vast and ever-growing digital landscape requires balancing expansive search with strategic reasoning. Current large language model (LLM)-based agents struggle to achieve this balance due to limitations…

Artificial Intelligence · Computer Science 2025-08-13 Xianghe Pang , Shuo Tang , Rui Ye , Yuwen Du , Yaxin Du , Siheng Chen

Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Large language models (LLMs) have demonstrated remarkable capabilities in complex reasoning tasks when equipped with external tools. However, current frameworks predominantly rely on sequential processing, leading to inefficient execution…

Artificial Intelligence · Computer Science 2025-10-01 Tianrui Qin , Qianben Chen , Sinuo Wang , He Xing , King Zhu , He Zhu , Dingfeng Shi , Xinxin Liu , Ge Zhang , Jiaheng Liu , Yuchen Eleanor Jiang , Xitong Gao , Wangchunshu Zhou

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Recent deep research agents primarily improve performance by scaling reasoning depth, but this leads to high inference cost and latency in search-intensive scenarios. Moreover, generalization across heterogeneous research settings remains…

Computation and Language · Computer Science 2026-03-02 Qianben Chen , Tianrui Qin , King Zhu , Qiexiang Wang , Chengjun Yu , Shu Xu , Jiaqi Wu , Jiayu Zhang , Xinpeng Liu , Xin Gui , Jingyi Cao , Piaohong Wang , Dingfeng Shi , He Zhu , Tiannan Wang , Yuqing Wang , Maojia Song , Tianyu Zheng , Ge Zhang , Jian Yang , Jiaheng Liu , Minghao Liu , Yuchen Eleanor Jiang , Wangchunshu Zhou

Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Recent advances in reasoning models have demonstrated significant improvements in accuracy by employing detailed and comprehensive reasoning processes. However, generating these lengthy reasoning sequences is computationally expensive and…

Computation and Language · Computer Science 2025-08-27 Yijiong Yu

Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks

We study parallel test-time scaling for long-horizon agentic tasks such as agentic search and deep research, where multiple rollouts are generated in parallel and aggregated into a final response. While such scaling has proven effective for…

Computation and Language · Computer Science 2026-04-14 Yoonsang Lee , Howard Yen , Xi Ye , Danqi Chen