Related papers: Efficient Reasoning with Hidden Thinking

Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding

Large language models (LLMs) have demonstrated remarkable capabilities in tasks requiring reasoning and multi-step problem-solving through the use of chain-of-thought (CoT) prompting. However, generating the full CoT process results in…

Computation and Language · Computer Science 2024-09-16 Tianqiao Liu , Zui Chen , Zitao Liu , Mi Tian , Weiqi Luo

Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Large Language Models (LLMs) have shown impressive performance on complex tasks through Chain-of-Thought (CoT) reasoning. However, conventional CoT relies on explicitly verbalized intermediate steps, which constrains its broader…

Computation and Language · Computer Science 2025-11-04 Xinghao Chen , Anhao Zhao , Heming Xia , Xuan Lu , Hanlin Wang , Yanjun Chen , Wei Zhang , Jian Wang , Wenjie Li , Xiaoyu Shen

Efficient Reasoning via Chain of Unconscious Thought

Large Reasoning Models (LRMs) achieve promising performance but compromise token efficiency due to verbose reasoning processes. Unconscious Thought Theory (UTT) posits that complex problems can be solved more efficiently through…

Computation and Language · Computer Science 2025-05-27 Ruihan Gong , Yue Liu , Wenjie Qu , Mingzhe Du , Yufei He , Yingwei Ma , Yulin Chen , Xiang Liu , Yi Wen , Xinfeng Li , Ruidong Wang , Xinzhong Zhu , Bryan Hooi , Jiaheng Zhang

Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Chain-of-Thought (CoT) reasoning successfully enhances the reasoning capabilities of Large Language Models (LLMs), yet it incurs substantial computational overhead for inference. Existing CoT compression methods often suffer from a critical…

Machine Learning · Computer Science 2026-05-26 Yuntian Tang , Bohan Jia , Wenxuan Huang , Lianyue Zhang , Jiao Xie , Wenxi Li , Wei Li , Jie Hu , Xinghao Chen Rongrong Ji , Shaohui Lin

Think Consistently, Reason Efficiently: Energy-Based Calibration for Implicit Chain-of-Thought

Large Language Models (LLMs) have demonstrated strong reasoning capabilities through \emph{Chain-of-Thought} (CoT) prompting, which enables step-by-step intermediate reasoning. However, explicit CoT methods rely on discrete token-level…

Computation and Language · Computer Science 2025-11-11 Zhikang Chen , Sen Cui , Deheng Ye , Yu Zhang , Yatao Bian , Tingting Zhu

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Large Language Models (LLMs) achieve superior performance through Chain-of-Thought (CoT) reasoning, but these token-level reasoning chains are computationally expensive and inefficient. In this paper, we introduce Compressed Latent…

Computation and Language · Computer Science 2026-02-04 Wenhui Tan , Jiaze Li , Jianzhong Ju , Zhenbo Luo , Ruihua Song , Jian Luan

A State-Transition Framework for Efficient LLM Reasoning

While Long Chain-of-Thought (CoT) reasoning significantly improves Large Language Models (LLMs) performance on complex reasoning tasks, the substantial computational and memory costs of generating long CoT sequences limit their efficiency…

Artificial Intelligence · Computer Science 2026-02-03 Liang Zhang , Yu Zhao , Longyue Wang , Tianqi Shi , Weihua Luo , Kaifu Zhang , Jinsong Su

A Survey on Latent Reasoning

Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, especially when guided by explicit chain-of-thought (CoT) reasoning that verbalizes intermediate steps. While CoT improves both interpretability and accuracy,…

Computation and Language · Computer Science 2025-07-11 Rui-Jie Zhu , Tianhao Peng , Tianhao Cheng , Xingwei Qu , Jinfa Huang , Dawei Zhu , Hao Wang , Kaiwen Xue , Xuanliang Zhang , Yong Shan , Tianle Cai , Taylor Kergan , Assel Kembay , Andrew Smith , Chenghua Lin , Binh Nguyen , Yuqi Pan , Yuhong Chou , Zefan Cai , Zhenhe Wu , Yongchi Zhao , Tianyu Liu , Jian Yang , Wangchunshu Zhou , Chujie Zheng , Chongxuan Li , Yuyin Zhou , Zhoujun Li , Zhaoxiang Zhang , Jiaheng Liu , Ge Zhang , Wenhao Huang , Jason Eshraghian

Efficient Inference for Large Reasoning Models: A Survey

Large Reasoning Models (LRMs) significantly improve the reasoning ability of Large Language Models (LLMs) by learning to reason, exhibiting promising performance in solving complex tasks. However, their deliberative reasoning process leads…

Computation and Language · Computer Science 2025-08-14 Yue Liu , Jiaying Wu , Yufei He , Ruihan Gong , Jun Xia , Liang Li , Hongcheng Gao , Hongyu Chen , Baolong Bi , Jiaheng Zhang , Zhiqi Huang , Bryan Hooi , Stan Z. Li , Keqin Li

Selective Latent Thinking: Adaptive Compression of LLM Reasoning Chains

Explicit chain-of-thought (CoT) reasoning substantially improves the reasoning ability of large language models (LLMs), but incurs high inference cost due to lengthy autoregressive traces. Existing latent reasoning methods offer a promising…

Computation and Language · Computer Science 2026-05-26 Hui Xie , Jie Liu , Ziyue Qiao , Joaquin Vanschore

Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning

Chain of Thought (CoT) reasoning has demonstrated remarkable deep reasoning capabilities in both large language models (LLMs) and multimodal large language models (MLLMs). However, its reliability is often undermined by the accumulation of…

Artificial Intelligence · Computer Science 2025-11-26 Zijun Chen , Wenbo Hu , Richang Hong

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Large Language Models (LLMs) excel at reasoning and planning when trained on chainof-thought (CoT) data, where the step-by-step thought process is explicitly outlined by text tokens. However, this results in lengthy inputs where many words…

Computation and Language · Computer Science 2025-09-03 DiJia Su , Hanlin Zhu , Yingchen Xu , Jiantao Jiao , Yuandong Tian , Qinqing Zheng

R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning

Chain-of-Thought (CoT) prompting helps Large Language Models (LLMs) tackle complex reasoning by eliciting explicit step-by-step rationales. However, CoT's verbosity increases latency and memory usage and may propagate early errors across…

Computation and Language · Computer Science 2025-09-30 Hongyu Shan , Mingyang Song , Chang Dai , Di Liang , Han Chen

Eliciting Chain-of-Thought in Base LLMs via Gradient-Based Representation Optimization

Chain-of-Thought (CoT) reasoning is a critical capability for large language models (LLMs), enabling them to tackle com- plex multi-step tasks. While base LLMs, pre-trained on general text corpora, often struggle with reasoning due to a…

Computation and Language · Computer Science 2025-11-25 Zijian Wang , Yanxiang Ma , Chang Xu

Making Slow Thinking Faster: Compressing LLM Chain-of-Thought via Step Entropy

Large Language Models (LLMs) using Chain-of-Thought (CoT) prompting excel at complex reasoning but generate verbose thought processes with considerable redundancy, leading to increased inference costs and reduced efficiency. We introduce a…

Artificial Intelligence · Computer Science 2026-02-17 Zeju Li , Jianyuan Zhong , Ziyang Zheng , Xiangyu Wen , Zhijian Xu , Yingying Cheng , Fan Zhang , Qiang Xu

Bridging Efficiency and Transparency: Explainable CoT Compression in Multimodal Large Reasoning Models

Long chains of thought (Long CoTs) are widely employed in multimodal reasoning models to tackle complex tasks by capturing detailed visual information. However, these Long CoTs are often excessively lengthy and contain redundant reasoning…

Artificial Intelligence · Computer Science 2026-02-11 Yizhi Wang , Linan Yue , Min-Ling Zhang

Not All Thoughts are Generated Equal: Efficient LLM Reasoning via Multi-Turn Reinforcement Learning

Compressing long chain-of-thought (CoT) from large language models (LLMs) is an emerging strategy to improve the reasoning efficiency of LLMs. Despite its promising benefits, existing studies equally compress all thoughts within a long CoT,…

Computation and Language · Computer Science 2025-05-27 Yansong Ning , Wei Li , Jun Fang , Naiqiang Tan , Hao Liu

Fast Thinking for Large Language Models

Reasoning-oriented Large Language Models (LLMs) often rely on generating explicit tokens step by step, and their effectiveness typically hinges on large-scale supervised fine-tuning or reinforcement learning. While Chain-of-Thought (CoT)…

Computation and Language · Computer Science 2025-09-30 Haoyu Zheng , Zhuonan Wang , Yuqian Yuan , Tianwei Lin , Wenqiao Zhang , Zheqi Lv , Juncheng Li , Siliang Tang , Yueting Zhuang , Hongyang He

How Far Ahead Do LLMs Plan? Uncovering the Latent Horizon in Chain-of-Thought Reasoning

Chain-of-thought (CoT) reasoning has become a central mechanism for eliciting multi-step reasoning in Large Language Models (LLMs). Yet recent evidence presents a tension: hidden states appear to already encode future reasoning before CoT…

Machine Learning · Computer Science 2026-05-29 Liyan Xu , Mo Yu , Fandong Meng , Jie Zhou

ImgCoT: Compressing Long Chain of Thought into Compact Visual Tokens for Efficient Reasoning of Large Language Model

Compressing long chains of thought (CoT) into compact latent tokens is crucial for efficient reasoning with large language models (LLMs). Recent studies employ autoencoders to achieve this by reconstructing textual CoT from latent tokens,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-02 Xiaoshu Chen , Sihang Zhou , Ke Liang , Taichun Zhou , Xinwang Liu