Related papers: Adaptive Deep Reasoning: Triggering Deep Thinking …

Think Smart, Not Hard: Difficulty Adaptive Reasoning for Large Audio Language Models

Large Audio Language Models (LALMs), powered by the chain-of-thought (CoT) paradigm, have shown remarkable reasoning capabilities. Intuitively, different problems often require varying depths of reasoning. While some methods can determine…

Machine Learning · Computer Science 2025-11-20 Zhichao Sheng , Shilin Zhou , Chen Gong , Zhenghua Li

Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning

The reasoning capabilities of large language models (LLMs) have improved substantially through increased test-time computation, typically in the form of intermediate tokens known as chain-of-thought (CoT). However, CoT often becomes…

Computation and Language · Computer Science 2026-01-07 Nathanaël Carraz Rakotonirina , Ren Pang , Neha Anna John , Michael Bohlke-Schneider , Momchil Hardalov

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Large Language Models (LLMs) have demonstrated remarkable capabilities but often face challenges with tasks requiring sophisticated reasoning. While Chain-of-Thought (CoT) prompting significantly enhances reasoning, it indiscriminately…

Machine Learning · Computer Science 2025-05-27 Chenwei Lou , Zewei Sun , Xinnian Liang , Meng Qu , Wei Shen , Wenqi Wang , Yuntao Li , Qingping Yang , Shuangzhi Wu

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Long chain-of-thought (CoT) significantly enhances the reasoning capabilities of large language models (LLMs). However, extensive reasoning traces lead to inefficiencies and increased time-to-first-token (TTFT). We propose a training…

Computation and Language · Computer Science 2026-01-08 Roy Xie , David Qiu , Deepak Gopinath , Dong Lin , Yanchao Sun , Chong Wang , Saloni Potdar , Bhuwan Dhingra

Think in Blocks: Adaptive Reasoning from Direct Response to Deep Reasoning

Large Language Models (LLMs) with chains-of-thought have demonstrated strong performance on an increasing range of tasks, particularly those involving complex logical reasoning. However, excessively long chains can lead to overthinking,…

Artificial Intelligence · Computer Science 2025-08-22 Yekun Zhu , Guang Chen , Chengjun Mao

Draft-Thinking: Learning Efficient Reasoning in Long Chain-of-Thought LLMs

Long chain-of-thought~(CoT) has become a dominant paradigm for enhancing the reasoning capability of large reasoning models~(LRMs); however, the performance gains often come with a substantial increase in reasoning budget. Recent studies…

Artificial Intelligence · Computer Science 2026-03-03 Jie Cao , Tianwei Lin , Zhenxuan Fan , Bo Yuan , Ziyuan Zhao , Rolan Yan , Wenqiao Zhang , Siliang Tang

A State-Transition Framework for Efficient LLM Reasoning

While Long Chain-of-Thought (CoT) reasoning significantly improves Large Language Models (LLMs) performance on complex reasoning tasks, the substantial computational and memory costs of generating long CoT sequences limit their efficiency…

Artificial Intelligence · Computer Science 2026-02-03 Liang Zhang , Yu Zhao , Longyue Wang , Tianqi Shi , Weihua Luo , Kaifu Zhang , Jinsong Su

Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs

Requiring a large language model (LLM) to generate intermediary reasoning steps, known as Chain of Thought (CoT), has been shown to be an effective way of boosting performance. Previous approaches have focused on generating multiple…

Computation and Language · Computer Science 2025-05-28 Haritz Puerto , Tilek Chubakov , Xiaodan Zhu , Harish Tayyar Madabushi , Iryna Gurevych

SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs

Chain-of-Thought (CoT) reasoning enables Large Language Models (LLMs) to solve complex reasoning tasks by generating intermediate reasoning steps. However, most existing approaches focus on hard token decoding, which constrains reasoning…

Computation and Language · Computer Science 2025-05-28 Yige Xu , Xu Guo , Zhiwei Zeng , Chunyan Miao

The Art of Efficient Reasoning: Data, Reward, and Optimization

Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but also suffer from heavy computational overhead. To address this issue, efficient reasoning aims to incentivize short yet accurate thinking…

Computation and Language · Computer Science 2026-03-23 Taiqiang Wu , Zenan Xu , Bo Zhou , Ngai Wong

Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models

With the rapid advancement of large reasoning models, long Chain-of-Thought (CoT) prompting has demonstrated strong performance on complex tasks. However, this often comes with a significant increase in token usage. In this paper, we…

Computation and Language · Computer Science 2025-06-05 Ruiqi Zhang , Changyi Xiao , Yixin Cao

Demystifying Long Chain-of-Thought Reasoning in LLMs

Scaling inference compute enhances reasoning in large language models (LLMs), with long chains-of-thought (CoTs) enabling strategies like backtracking and error correction. Reinforcement learning (RL) has emerged as a crucial method for…

Computation and Language · Computer Science 2025-02-06 Edward Yeo , Yuxuan Tong , Morry Niu , Graham Neubig , Xiang Yue

Not All Thoughts are Generated Equal: Efficient LLM Reasoning via Multi-Turn Reinforcement Learning

Compressing long chain-of-thought (CoT) from large language models (LLMs) is an emerging strategy to improve the reasoning efficiency of LLMs. Despite its promising benefits, existing studies equally compress all thoughts within a long CoT,…

Computation and Language · Computer Science 2025-05-27 Yansong Ning , Wei Li , Jun Fang , Naiqiang Tan , Hao Liu

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

Recent advancements in reasoning with large language models (RLLMs), such as OpenAI-O1 and DeepSeek-R1, have demonstrated their impressive capabilities in complex domains like mathematics and coding. A central factor in their success lies…

Artificial Intelligence · Computer Science 2025-07-21 Qiguang Chen , Libo Qin , Jinhao Liu , Dengyun Peng , Jiannan Guan , Peng Wang , Mengkang Hu , Yuhang Zhou , Te Gao , Wanxiang Che

The Impact of Reasoning Step Length on Large Language Models

Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correlation between the effectiveness of CoT and the length of reasoning steps in prompts remains largely unknown. To…

Computation and Language · Computer Science 2024-06-25 Mingyu Jin , Qinkai Yu , Dong Shu , Haiyan Zhao , Wenyue Hua , Yanda Meng , Yongfeng Zhang , Mengnan Du

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning…

Computation and Language · Computer Science 2025-08-25 Yang Sui , Yu-Neng Chuang , Guanchu Wang , Jiamu Zhang , Tianyi Zhang , Jiayi Yuan , Hongyi Liu , Andrew Wen , Shaochen Zhong , Na Zou , Hanjie Chen , Xia Hu

Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective

Recent studies have discovered that Chain-of-Thought prompting (CoT) can dramatically improve the performance of Large Language Models (LLMs), particularly when dealing with complex tasks involving mathematics or reasoning. Despite the…

Machine Learning · Computer Science 2023-12-27 Guhao Feng , Bohang Zhang , Yuntian Gu , Haotian Ye , Di He , Liwei Wang

Automatic Chain of Thought Prompting in Large Language Models

Large language models (LLMs) can perform complex reasoning by generating intermediate reasoning steps. Providing these steps for prompting demonstrations is called chain-of-thought (CoT) prompting. CoT prompting has two major paradigms. One…

Computation and Language · Computer Science 2022-10-10 Zhuosheng Zhang , Aston Zhang , Mu Li , Alex Smola

Short-Path Prompting in LLMs: Analyzing Reasoning Instability and Solutions for Robust Performance

Recent years have witnessed significant progress in large language models' (LLMs) reasoning, which is largely due to the chain-of-thought (CoT) approaches, allowing models to generate intermediate reasoning steps before reaching the final…

Computation and Language · Computer Science 2025-04-15 Zuoli Tang , Junjie Ou , Kaiqin Hu , Chunwei Wu , Zhaoxin Huan , Chilin Fu , Xiaolu Zhang , Jun Zhou , Chenliang Li

Rethinking Chain-of-Thought from the Perspective of Self-Training

Chain-of-thought (CoT) reasoning has emerged as an effective approach for activating latent capabilities in LLMs. Interestingly, we observe that both CoT reasoning and self-training share the core objective: iteratively leveraging…

Computation and Language · Computer Science 2025-05-27 Zongqian Wu , Baoduo Xu , Ruochen Cui , Mengmeng Zhan , Xiaofeng Zhu , Lei Feng