Related papers: DiffAdapt: Difficulty-Adaptive Reasoning for Token…

Efficient Inference for Large Reasoning Models: A Survey

Large Reasoning Models (LRMs) significantly improve the reasoning ability of Large Language Models (LLMs) by learning to reason, exhibiting promising performance in solving complex tasks. However, their deliberative reasoning process leads…

Computation and Language · Computer Science 2025-08-14 Yue Liu , Jiaying Wu , Yufei He , Ruihan Gong , Jun Xia , Liang Li , Hongcheng Gao , Hongyu Chen , Baolong Bi , Jiaheng Zhang , Zhiqi Huang , Bryan Hooi , Stan Z. Li , Keqin Li

Making Slow Thinking Faster: Compressing LLM Chain-of-Thought via Step Entropy

Large Language Models (LLMs) using Chain-of-Thought (CoT) prompting excel at complex reasoning but generate verbose thought processes with considerable redundancy, leading to increased inference costs and reduced efficiency. We introduce a…

Artificial Intelligence · Computer Science 2026-02-17 Zeju Li , Jianyuan Zhong , Ziyang Zheng , Xiangyu Wen , Zhijian Xu , Yingying Cheng , Fan Zhang , Qiang Xu

From Efficiency to Adaptivity: A Deeper Look at Adaptive Reasoning in Large Language Models

Recent advances in large language models (LLMs) have made reasoning a central benchmark for evaluating intelligence. While prior surveys focus on efficiency by examining how to shorten reasoning chains or reduce computation, this view…

Artificial Intelligence · Computer Science 2026-04-01 Chao Wu , Baoheng Li , Mingchen Gao , Yu Tian , Zhenyi Wang

ENTRA: Entropy-Based Redundancy Avoidance in Large Language Model Reasoning

Large Reasoning Models (LRMs) often suffer from overthinking, generating unnecessarily long reasoning chains even for simple tasks. This leads to substantial computational overhead with limited performance gain, primarily due to redundant…

Artificial Intelligence · Computer Science 2026-01-13 Ruichu Cai , Haopeng Du , Qingwen Lin , Yutong Chen , Zijian Li , Boyan Xu

Think Twice Before You Write -- an Entropy-based Decoding Strategy to Enhance LLM Reasoning

Decoding strategies play a central role in shaping the reasoning ability of large language models (LLMs). Traditional methods such as greedy decoding and beam search often suffer from error propagation, while sampling-based approaches…

Computation and Language · Computer Science 2026-04-02 Jiashu He , Meizhu Liu , Olaitan P Olaleye , Amit Agarwal , M. Avendi , Yassi Abbasi , Matthew Rowe , Hitesh Laxmichand Patel , Paul Li , Tao Sheng , Sujith Ravi , Dan Roth

Think or Not? Exploring Thinking Efficiency in Large Reasoning Models via an Information-Theoretic Lens

The recent rise of Large Reasoning Models (LRMs) has significantly improved multi-step reasoning performance, but often at the cost of generating excessively long reasoning chains. This paper revisits the efficiency of such reasoning…

Computation and Language · Computer Science 2025-05-27 Xixian Yong , Xiao Zhou , Yingying Zhang , Jinlin Li , Yefeng Zheng , Xian Wu

Probing the Difficulty Perception Mechanism of Large Language Models

Large language models (LLMs) are increasingly deployed on complex reasoning tasks, yet little is known about their ability to internally evaluate problem difficulty, which is an essential capability for adaptive reasoning and efficient…

Computation and Language · Computer Science 2025-10-14 Sunbowen Lee , Qingyu Yin , Chak Tou Leong , Jialiang Zhang , Yicheng Gong , Shiwen Ni , Min Yang , Xiaoyu Shen

DART: Difficulty-Adaptive Reasoning Truncation for Efficient Large Language Models

Adaptive reasoning is essential for aligning the computational effort of large language models (LLMs) with the intrinsic difficulty of problems. Current chain-of-thought methods boost reasoning ability but indiscriminately generate long…

Artificial Intelligence · Computer Science 2025-12-17 Ruofan Zhang , Bin Xia , Zhen Cheng , Cairen Jian , Minglun Yang , Ngai Wong , Yuan Cheng

Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models

Large Language Models (LLMs) often struggle with computational efficiency and error propagation in multi-step reasoning tasks. While recent advancements on prompting and post-training have enabled LLMs to perform step-wise reasoning, they…

Artificial Intelligence · Computer Science 2026-05-08 Yuan Sui , Yufei He , Tri Cao , Simeng Han , Yulin Chen , Bryan Hooi

Enhancing reasoning accuracy in large language models during inference time

Large Language Models (LLMs) often exhibit strong linguistic abilities while remaining unreliable on multi-step reasoning tasks, particularly when deployed without additional training or fine-tuning. In this work, we study inference-time…

Computation and Language · Computer Science 2026-03-24 Vinay Sharma , Manish Jain

LLMs for Game Theory: Entropy-Guided In-Context Learning and Adaptive CoT Reasoning

We propose a novel LLM-based framework for reasoning in discrete, game-theoretic tasks, illustrated with \emph{Tic-Tac-Toe}. The method integrates in-context learning with entropy-guided chain-of-thought (CoT) reasoning and adaptive context…

Computation and Language · Computer Science 2026-04-14 Tommaso Felice Banfi , Sashenka Gamage

InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning

Large Language Models (LLMs) with extended reasoning capabilities often generate verbose and redundant reasoning traces, incurring unnecessary computational cost. While existing reinforcement learning approaches address this by optimizing…

Artificial Intelligence · Computer Science 2026-03-19 Chengwei Wei , Jung-jae Kim , Longyin Zhang , Shengkai Chen , Nancy F. Chen

TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression

Large Language Models (LLMs) have recently achieved remarkable progress by leveraging Reinforcement Learning and extended Chain-of-Thought (CoT) techniques. However, the challenge of performing efficient language reasoning--especially…

Computation and Language · Computer Science 2025-06-17 Zhong-Zhi Li , Xiao Liang , Zihao Tang , Lei Ji , Peijie Wang , Haotian Xu , Xing W , Haizhen Huang , Weiwei Deng , Yeyun Gong , Zhijiang Guo , Xiao Liu , Fei Yin , Cheng-Lin Liu

Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning

The reasoning capabilities of large language models (LLMs) have improved substantially through increased test-time computation, typically in the form of intermediate tokens known as chain-of-thought (CoT). However, CoT often becomes…

Computation and Language · Computer Science 2026-01-07 Nathanaël Carraz Rakotonirina , Ren Pang , Neha Anna John , Michael Bohlke-Schneider , Momchil Hardalov

DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models

Recent advancements in slow thinking reasoning models have shown exceptional performance in complex reasoning tasks. However, these models often exhibit overthinking (generating redundant reasoning steps for simple problems), leading to…

Machine Learning · Computer Science 2026-01-13 Yi Shen , Jian Zhang , Jieyun Huang , Shuming Shi , Wenjing Zhang , Jiangze Yan , Ning Wang , Kai Wang , Zhaoxiang Liu , Shiguo Lian

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Recent Large Reasoning Models (LRMs), such as DeepSeek-R1 and OpenAI o1, have demonstrated strong performance gains by scaling up the length of Chain-of-Thought (CoT) reasoning during inference. However, a growing concern lies in their…

Computation and Language · Computer Science 2026-01-01 Xiaoye Qu , Yafu Li , Zhao-Chen Su , Weigao Sun , Jianhao Yan , Dongrui Liu , Ganqu Cui , Daizong Liu , Shuxian Liang , Junxian He , Peng Li , Wei Wei , Jing Shao , Chaochao Lu , Yue Zhang , Xian-Sheng Hua , Bowen Zhou , Yu Cheng

Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks

Large Language Models (LLMs), particularly smaller variants, still struggle with complex reasoning tasks. While inference-time prompting can guide reasoning, existing methods often rely on sequential queries. Ensemble approaches offer a…

Computation and Language · Computer Science 2025-10-28 Gregory Kang Ruey Lau , Wenyang Hu , Diwen Liu , Jizhuo Chen , See-Kiong Ng , Bryan Kian Hsiang Low

Learning Adaptive LLM Decoding

Decoding from large language models (LLMs) typically relies on fixed sampling hyperparameters (e.g., temperature, top-p), despite substantial variation in task difficulty and uncertainty across prompts and individual decoding steps. We…

Machine Learning · Computer Science 2026-03-17 Chloe H. Su , Zhe Ye , Samuel Tenka , Aidan Yang , Soonho Kong , Udaya Ghai

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning…

Computation and Language · Computer Science 2025-08-25 Yang Sui , Yu-Neng Chuang , Guanchu Wang , Jiamu Zhang , Tianyi Zhang , Jiayi Yuan , Hongyi Liu , Andrew Wen , Shaochen Zhong , Na Zou , Hanjie Chen , Xia Hu

Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression

While reasoning large language models (LLMs) demonstrate remarkable performance across various tasks, they also contain notable security vulnerabilities. Recent research has uncovered a "thinking-stopped" vulnerability in DeepSeek-R1, where…

Cryptography and Security · Computer Science 2025-04-30 Yu Cui , Yujun Cai , Yiwei Wang