English
Related papers

Related papers: Efficient Reasoning Models: A Survey

200 papers

Chain of Thought (CoT) prompting can encourage language models to engage in multi-step logical reasoning. The quality of the provided demonstrations significantly influences the success of downstream inference tasks. Current unsupervised…

Computation and Language · Computer Science 2025-05-27 Yufeng Zhang , Xuepeng Wang , Lingxiang Wu , Jinqiao Wang

Recent advancements in reasoning with large language models (RLLMs), such as OpenAI-O1 and DeepSeek-R1, have demonstrated their impressive capabilities in complex domains like mathematics and coding. A central factor in their success lies…

Artificial Intelligence · Computer Science 2025-07-21 Qiguang Chen , Libo Qin , Jinhao Liu , Dengyun Peng , Jiannan Guan , Peng Wang , Mengkang Hu , Yuhang Zhou , Te Gao , Wanxiang Che

Recent Large Reasoning Models (LRMs), such as DeepSeek-R1 and OpenAI o1, have demonstrated strong performance gains by scaling up the length of Chain-of-Thought (CoT) reasoning during inference. However, a growing concern lies in their…

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning…

Computation and Language · Computer Science 2025-08-25 Yang Sui , Yu-Neng Chuang , Guanchu Wang , Jiamu Zhang , Tianyi Zhang , Jiayi Yuan , Hongyi Liu , Andrew Wen , Shaochen Zhong , Na Zou , Hanjie Chen , Xia Hu

Chain-of-thought (CoT) decoding enables language models to improve reasoning performance at the cost of high generation latency in decoding. Recent proposals have explored variants of contemplation tokens, a term we introduce that refers to…

Computation and Language · Computer Science 2024-12-18 Jeffrey Cheng , Benjamin Van Durme

Chain-of-thought (CoT) reasoning has enabled large language models (LLMs) to utilize additional computation through intermediate tokens to solve complex tasks. However, we posit that typical reasoning traces contain many redundant tokens,…

Computation and Language · Computer Science 2025-06-11 Tergel Munkhbat , Namgyu Ho , Seo Hyun Kim , Yongjin Yang , Yujin Kim , Se-Young Yun

Large Reasoning Models (LRMs) significantly improve the reasoning ability of Large Language Models (LLMs) by learning to reason, exhibiting promising performance in solving complex tasks. However, their deliberative reasoning process leads…

Computation and Language · Computer Science 2025-08-14 Yue Liu , Jiaying Wu , Yufei He , Ruihan Gong , Jun Xia , Liang Li , Hongcheng Gao , Hongyu Chen , Baolong Bi , Jiaheng Zhang , Zhiqi Huang , Bryan Hooi , Stan Z. Li , Keqin Li

Large Language Models (LLMs) have shown impressive performance on complex tasks through Chain-of-Thought (CoT) reasoning. However, conventional CoT relies on explicitly verbalized intermediate steps, which constrains its broader…

Computation and Language · Computer Science 2025-11-04 Xinghao Chen , Anhao Zhao , Heming Xia , Xuan Lu , Hanlin Wang , Yanjun Chen , Wei Zhang , Jian Wang , Wenjie Li , Xiaoyu Shen

Long chain-of-thought~(CoT) has become a dominant paradigm for enhancing the reasoning capability of large reasoning models~(LRMs); however, the performance gains often come with a substantial increase in reasoning budget. Recent studies…

Artificial Intelligence · Computer Science 2026-03-03 Jie Cao , Tianwei Lin , Zhenxuan Fan , Bo Yuan , Ziyuan Zhao , Rolan Yan , Wenqiao Zhang , Siliang Tang

Long chain-of-thought (CoT) is an essential ingredient in effective usage of modern large language models, but our understanding of the reasoning strategies underlying these capabilities remains limited. While some prior works have…

Computation and Language · Computer Science 2025-05-16 Seongyun Lee , Seungone Kim , Minju Seo , Yongrae Jo , Dongyoung Go , Hyeonbin Hwang , Jinho Park , Xiang Yue , Sean Welleck , Graham Neubig , Moontae Lee , Minjoon Seo

Chain-of-thought (CoT) reasoning has been highly successful in solving complex tasks in natural language processing, and recent multimodal large language models (MLLMs) have extended this paradigm to video reasoning. However, these models…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Yiwu Zhong , Zi-Yuan Hu , Yin Li , Liwei Wang

Reasoning large language models (LLMs) have demonstrated superior capacities in solving complicated problems by generating long chain-of-thoughts (CoT), but such a lengthy CoT incurs high inference costs. Previous methods on inference-stage…

Computation and Language · Computer Science 2026-05-19 Minjia Mao , Bowen Yin , Yu Zhu , Xiao Fang

Chain-of-thought (CoT) prompting enhances reasoning in large language models (LLMs) but often leads to verbose and redundant outputs, thus increasing inference cost. We hypothesize that many reasoning steps are unnecessary for producing…

Computation and Language · Computer Science 2025-09-30 Xin Liu , Lu Wang

Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but also suffer from heavy computational overhead. To address this issue, efficient reasoning aims to incentivize short yet accurate thinking…

Computation and Language · Computer Science 2026-03-23 Taiqiang Wu , Zenan Xu , Bo Zhou , Ngai Wong

Chain-of-thought (CoT) reasoning and its variants have substantially improved the performance of language models on complex reasoning tasks, yet the precise mechanisms by which different strategies facilitate generalization remain poorly…

Computation and Language · Computer Science 2026-02-11 Archiki Prasad , Mandar Joshi , Kenton Lee , Mohit Bansal , Peter Shaw

Large language models (LLMs) have demonstrated remarkable capabilities in tasks requiring reasoning and multi-step problem-solving through the use of chain-of-thought (CoT) prompting. However, generating the full CoT process results in…

Computation and Language · Computer Science 2024-09-16 Tianqiao Liu , Zui Chen , Zitao Liu , Mi Tian , Weiqi Luo

Recent large language models achieve strong reasoning performance by generating detailed chain-of-thought traces, but this often leads to excessive token use and high inference latency. Existing efficiency approaches typically focus on…

Computation and Language · Computer Science 2025-12-01 Lukas Struppek , Dominik Hintersdorf , Hannah Struppek , Daniel Neider , Kristian Kersting

Large Reasoning Models (LRMs) achieve promising performance but compromise token efficiency due to verbose reasoning processes. Unconscious Thought Theory (UTT) posits that complex problems can be solved more efficiently through…

Computation and Language · Computer Science 2025-05-27 Ruihan Gong , Yue Liu , Wenjie Qu , Mingzhe Du , Yufei He , Yingwei Ma , Yulin Chen , Xiang Liu , Yi Wen , Xinfeng Li , Ruidong Wang , Xinzhong Zhu , Bryan Hooi , Jiaheng Zhang

Reasoning-oriented Large Language Models (LLMs) often rely on generating explicit tokens step by step, and their effectiveness typically hinges on large-scale supervised fine-tuning or reinforcement learning. While Chain-of-Thought (CoT)…

Computation and Language · Computer Science 2025-09-30 Haoyu Zheng , Zhuonan Wang , Yuqian Yuan , Tianwei Lin , Wenqiao Zhang , Zheqi Lv , Juncheng Li , Siliang Tang , Yueting Zhuang , Hongyang He

Chain-of-Thought (CoT) reasoning enables Large Language Models (LLMs) to solve complex reasoning tasks by generating intermediate reasoning steps. However, most existing approaches focus on hard token decoding, which constrains reasoning…

Computation and Language · Computer Science 2025-05-28 Yige Xu , Xu Guo , Zhiwei Zeng , Chunyan Miao
‹ Prev 1 2 3 10 Next ›