Related papers: Efficient Reasoning Models: A Survey

Enhancing Chain of Thought Prompting in Large Language Models via Reasoning Patterns

Chain of Thought (CoT) prompting can encourage language models to engage in multi-step logical reasoning. The quality of the provided demonstrations significantly influences the success of downstream inference tasks. Current unsupervised…

Computation and Language · Computer Science 2025-05-27 Yufeng Zhang , Xuepeng Wang , Lingxiang Wu , Jinqiao Wang

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

Recent advancements in reasoning with large language models (RLLMs), such as OpenAI-O1 and DeepSeek-R1, have demonstrated their impressive capabilities in complex domains like mathematics and coding. A central factor in their success lies…

Artificial Intelligence · Computer Science 2025-07-21 Qiguang Chen , Libo Qin , Jinhao Liu , Dengyun Peng , Jiannan Guan , Peng Wang , Mengkang Hu , Yuhang Zhou , Te Gao , Wanxiang Che

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Recent Large Reasoning Models (LRMs), such as DeepSeek-R1 and OpenAI o1, have demonstrated strong performance gains by scaling up the length of Chain-of-Thought (CoT) reasoning during inference. However, a growing concern lies in their…

Computation and Language · Computer Science 2026-01-01 Xiaoye Qu , Yafu Li , Zhao-Chen Su , Weigao Sun , Jianhao Yan , Dongrui Liu , Ganqu Cui , Daizong Liu , Shuxian Liang , Junxian He , Peng Li , Wei Wei , Jing Shao , Chaochao Lu , Yue Zhang , Xian-Sheng Hua , Bowen Zhou , Yu Cheng

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning…

Computation and Language · Computer Science 2025-08-25 Yang Sui , Yu-Neng Chuang , Guanchu Wang , Jiamu Zhang , Tianyi Zhang , Jiayi Yuan , Hongyi Liu , Andrew Wen , Shaochen Zhong , Na Zou , Hanjie Chen , Xia Hu

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Chain-of-thought (CoT) decoding enables language models to improve reasoning performance at the cost of high generation latency in decoding. Recent proposals have explored variants of contemplation tokens, a term we introduce that refers to…

Computation and Language · Computer Science 2024-12-18 Jeffrey Cheng , Benjamin Van Durme

Self-Training Elicits Concise Reasoning in Large Language Models

Chain-of-thought (CoT) reasoning has enabled large language models (LLMs) to utilize additional computation through intermediate tokens to solve complex tasks. However, we posit that typical reasoning traces contain many redundant tokens,…

Computation and Language · Computer Science 2025-06-11 Tergel Munkhbat , Namgyu Ho , Seo Hyun Kim , Yongjin Yang , Yujin Kim , Se-Young Yun

Efficient Inference for Large Reasoning Models: A Survey

Large Reasoning Models (LRMs) significantly improve the reasoning ability of Large Language Models (LLMs) by learning to reason, exhibiting promising performance in solving complex tasks. However, their deliberative reasoning process leads…

Computation and Language · Computer Science 2025-08-14 Yue Liu , Jiaying Wu , Yufei He , Ruihan Gong , Jun Xia , Liang Li , Hongcheng Gao , Hongyu Chen , Baolong Bi , Jiaheng Zhang , Zhiqi Huang , Bryan Hooi , Stan Z. Li , Keqin Li

Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Large Language Models (LLMs) have shown impressive performance on complex tasks through Chain-of-Thought (CoT) reasoning. However, conventional CoT relies on explicitly verbalized intermediate steps, which constrains its broader…

Computation and Language · Computer Science 2025-11-04 Xinghao Chen , Anhao Zhao , Heming Xia , Xuan Lu , Hanlin Wang , Yanjun Chen , Wei Zhang , Jian Wang , Wenjie Li , Xiaoyu Shen

Draft-Thinking: Learning Efficient Reasoning in Long Chain-of-Thought LLMs

Long chain-of-thought~(CoT) has become a dominant paradigm for enhancing the reasoning capability of large reasoning models~(LRMs); however, the performance gains often come with a substantial increase in reasoning budget. Recent studies…

Artificial Intelligence · Computer Science 2026-03-03 Jie Cao , Tianwei Lin , Zhenxuan Fan , Bo Yuan , Ziyuan Zhao , Rolan Yan , Wenqiao Zhang , Siliang Tang

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Long chain-of-thought (CoT) is an essential ingredient in effective usage of modern large language models, but our understanding of the reasoning strategies underlying these capabilities remains limited. While some prior works have…

Computation and Language · Computer Science 2025-05-16 Seongyun Lee , Seungone Kim , Minju Seo , Yongrae Jo , Dongyoung Go , Hyeonbin Hwang , Jinho Park , Xiang Yue , Sean Welleck , Graham Neubig , Moontae Lee , Minjoon Seo

Rethinking Chain-of-Thought Reasoning for Videos

Chain-of-thought (CoT) reasoning has been highly successful in solving complex tasks in natural language processing, and recent multimodal large language models (MLLMs) have extended this paradigm to video reasoning. However, these models…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Yiwu Zhong , Zi-Yuan Hu , Yin Li , Liwei Wang

Early Stopping Chain-of-thoughts in Large Language Models

Reasoning large language models (LLMs) have demonstrated superior capacities in solving complicated problems by generating long chain-of-thoughts (CoT), but such a lengthy CoT incurs high inference costs. Previous methods on inference-stage…

Computation and Language · Computer Science 2026-05-19 Minjia Mao , Bowen Yin , Yu Zhu , Xiao Fang

Answer Convergence as a Signal for Early Stopping in Reasoning

Chain-of-thought (CoT) prompting enhances reasoning in large language models (LLMs) but often leads to verbose and redundant outputs, thus increasing inference cost. We hypothesize that many reasoning steps are unnecessary for producing…

Computation and Language · Computer Science 2025-09-30 Xin Liu , Lu Wang

The Art of Efficient Reasoning: Data, Reward, and Optimization

Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but also suffer from heavy computational overhead. To address this issue, efficient reasoning aims to incentivize short yet accurate thinking…

Computation and Language · Computer Science 2026-03-23 Taiqiang Wu , Zenan Xu , Bo Zhou , Ngai Wong

Effective Reasoning Chains Reduce Intrinsic Dimensionality

Chain-of-thought (CoT) reasoning and its variants have substantially improved the performance of language models on complex reasoning tasks, yet the precise mechanisms by which different strategies facilitate generalization remain poorly…

Computation and Language · Computer Science 2026-02-11 Archiki Prasad , Mandar Joshi , Kenton Lee , Mohit Bansal , Peter Shaw

Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding

Large language models (LLMs) have demonstrated remarkable capabilities in tasks requiring reasoning and multi-step problem-solving through the use of chain-of-thought (CoT) prompting. However, generating the full CoT process results in…

Computation and Language · Computer Science 2024-09-16 Tianqiao Liu , Zui Chen , Zitao Liu , Mi Tian , Weiqi Luo

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Recent large language models achieve strong reasoning performance by generating detailed chain-of-thought traces, but this often leads to excessive token use and high inference latency. Existing efficiency approaches typically focus on…

Computation and Language · Computer Science 2025-12-01 Lukas Struppek , Dominik Hintersdorf , Hannah Struppek , Daniel Neider , Kristian Kersting

Efficient Reasoning via Chain of Unconscious Thought

Large Reasoning Models (LRMs) achieve promising performance but compromise token efficiency due to verbose reasoning processes. Unconscious Thought Theory (UTT) posits that complex problems can be solved more efficiently through…

Computation and Language · Computer Science 2025-05-27 Ruihan Gong , Yue Liu , Wenjie Qu , Mingzhe Du , Yufei He , Yingwei Ma , Yulin Chen , Xiang Liu , Yi Wen , Xinfeng Li , Ruidong Wang , Xinzhong Zhu , Bryan Hooi , Jiaheng Zhang

Fast Thinking for Large Language Models

Reasoning-oriented Large Language Models (LLMs) often rely on generating explicit tokens step by step, and their effectiveness typically hinges on large-scale supervised fine-tuning or reinforcement learning. While Chain-of-Thought (CoT)…

Computation and Language · Computer Science 2025-09-30 Haoyu Zheng , Zhuonan Wang , Yuqian Yuan , Tianwei Lin , Wenqiao Zhang , Zheqi Lv , Juncheng Li , Siliang Tang , Yueting Zhuang , Hongyang He

SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs

Chain-of-Thought (CoT) reasoning enables Large Language Models (LLMs) to solve complex reasoning tasks by generating intermediate reasoning steps. However, most existing approaches focus on hard token decoding, which constrains reasoning…

Computation and Language · Computer Science 2025-05-28 Yige Xu , Xu Guo , Zhiwei Zeng , Chunyan Miao