English
Related papers

Related papers: DiffAdapt: Difficulty-Adaptive Reasoning for Token…

200 papers

Large Reasoning Models (LRMs) significantly improve the reasoning ability of Large Language Models (LLMs) by learning to reason, exhibiting promising performance in solving complex tasks. However, their deliberative reasoning process leads…

Computation and Language · Computer Science 2025-08-14 Yue Liu , Jiaying Wu , Yufei He , Ruihan Gong , Jun Xia , Liang Li , Hongcheng Gao , Hongyu Chen , Baolong Bi , Jiaheng Zhang , Zhiqi Huang , Bryan Hooi , Stan Z. Li , Keqin Li

Large Language Models (LLMs) using Chain-of-Thought (CoT) prompting excel at complex reasoning but generate verbose thought processes with considerable redundancy, leading to increased inference costs and reduced efficiency. We introduce a…

Artificial Intelligence · Computer Science 2026-02-17 Zeju Li , Jianyuan Zhong , Ziyang Zheng , Xiangyu Wen , Zhijian Xu , Yingying Cheng , Fan Zhang , Qiang Xu

Recent advances in large language models (LLMs) have made reasoning a central benchmark for evaluating intelligence. While prior surveys focus on efficiency by examining how to shorten reasoning chains or reduce computation, this view…

Artificial Intelligence · Computer Science 2026-04-01 Chao Wu , Baoheng Li , Mingchen Gao , Yu Tian , Zhenyi Wang

Large Reasoning Models (LRMs) often suffer from overthinking, generating unnecessarily long reasoning chains even for simple tasks. This leads to substantial computational overhead with limited performance gain, primarily due to redundant…

Artificial Intelligence · Computer Science 2026-01-13 Ruichu Cai , Haopeng Du , Qingwen Lin , Yutong Chen , Zijian Li , Boyan Xu

Decoding strategies play a central role in shaping the reasoning ability of large language models (LLMs). Traditional methods such as greedy decoding and beam search often suffer from error propagation, while sampling-based approaches…

The recent rise of Large Reasoning Models (LRMs) has significantly improved multi-step reasoning performance, but often at the cost of generating excessively long reasoning chains. This paper revisits the efficiency of such reasoning…

Computation and Language · Computer Science 2025-05-27 Xixian Yong , Xiao Zhou , Yingying Zhang , Jinlin Li , Yefeng Zheng , Xian Wu

Large language models (LLMs) are increasingly deployed on complex reasoning tasks, yet little is known about their ability to internally evaluate problem difficulty, which is an essential capability for adaptive reasoning and efficient…

Computation and Language · Computer Science 2025-10-14 Sunbowen Lee , Qingyu Yin , Chak Tou Leong , Jialiang Zhang , Yicheng Gong , Shiwen Ni , Min Yang , Xiaoyu Shen

Adaptive reasoning is essential for aligning the computational effort of large language models (LLMs) with the intrinsic difficulty of problems. Current chain-of-thought methods boost reasoning ability but indiscriminately generate long…

Artificial Intelligence · Computer Science 2025-12-17 Ruofan Zhang , Bin Xia , Zhen Cheng , Cairen Jian , Minglun Yang , Ngai Wong , Yuan Cheng

Large Language Models (LLMs) often struggle with computational efficiency and error propagation in multi-step reasoning tasks. While recent advancements on prompting and post-training have enabled LLMs to perform step-wise reasoning, they…

Artificial Intelligence · Computer Science 2026-05-08 Yuan Sui , Yufei He , Tri Cao , Simeng Han , Yulin Chen , Bryan Hooi

Large Language Models (LLMs) often exhibit strong linguistic abilities while remaining unreliable on multi-step reasoning tasks, particularly when deployed without additional training or fine-tuning. In this work, we study inference-time…

Computation and Language · Computer Science 2026-03-24 Vinay Sharma , Manish Jain

We propose a novel LLM-based framework for reasoning in discrete, game-theoretic tasks, illustrated with \emph{Tic-Tac-Toe}. The method integrates in-context learning with entropy-guided chain-of-thought (CoT) reasoning and adaptive context…

Computation and Language · Computer Science 2026-04-14 Tommaso Felice Banfi , Sashenka Gamage

Large Language Models (LLMs) with extended reasoning capabilities often generate verbose and redundant reasoning traces, incurring unnecessary computational cost. While existing reinforcement learning approaches address this by optimizing…

Artificial Intelligence · Computer Science 2026-03-19 Chengwei Wei , Jung-jae Kim , Longyin Zhang , Shengkai Chen , Nancy F. Chen

Large Language Models (LLMs) have recently achieved remarkable progress by leveraging Reinforcement Learning and extended Chain-of-Thought (CoT) techniques. However, the challenge of performing efficient language reasoning--especially…

Computation and Language · Computer Science 2025-06-17 Zhong-Zhi Li , Xiao Liang , Zihao Tang , Lei Ji , Peijie Wang , Haotian Xu , Xing W , Haizhen Huang , Weiwei Deng , Yeyun Gong , Zhijiang Guo , Xiao Liu , Fei Yin , Cheng-Lin Liu

The reasoning capabilities of large language models (LLMs) have improved substantially through increased test-time computation, typically in the form of intermediate tokens known as chain-of-thought (CoT). However, CoT often becomes…

Computation and Language · Computer Science 2026-01-07 Nathanaël Carraz Rakotonirina , Ren Pang , Neha Anna John , Michael Bohlke-Schneider , Momchil Hardalov

Recent advancements in slow thinking reasoning models have shown exceptional performance in complex reasoning tasks. However, these models often exhibit overthinking (generating redundant reasoning steps for simple problems), leading to…

Machine Learning · Computer Science 2026-01-13 Yi Shen , Jian Zhang , Jieyun Huang , Shuming Shi , Wenjing Zhang , Jiangze Yan , Ning Wang , Kai Wang , Zhaoxiang Liu , Shiguo Lian

Recent Large Reasoning Models (LRMs), such as DeepSeek-R1 and OpenAI o1, have demonstrated strong performance gains by scaling up the length of Chain-of-Thought (CoT) reasoning during inference. However, a growing concern lies in their…

Large Language Models (LLMs), particularly smaller variants, still struggle with complex reasoning tasks. While inference-time prompting can guide reasoning, existing methods often rely on sequential queries. Ensemble approaches offer a…

Computation and Language · Computer Science 2025-10-28 Gregory Kang Ruey Lau , Wenyang Hu , Diwen Liu , Jizhuo Chen , See-Kiong Ng , Bryan Kian Hsiang Low

Decoding from large language models (LLMs) typically relies on fixed sampling hyperparameters (e.g., temperature, top-p), despite substantial variation in task difficulty and uncertainty across prompts and individual decoding steps. We…

Machine Learning · Computer Science 2026-03-17 Chloe H. Su , Zhe Ye , Samuel Tenka , Aidan Yang , Soonho Kong , Udaya Ghai

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning…

Computation and Language · Computer Science 2025-08-25 Yang Sui , Yu-Neng Chuang , Guanchu Wang , Jiamu Zhang , Tianyi Zhang , Jiayi Yuan , Hongyi Liu , Andrew Wen , Shaochen Zhong , Na Zou , Hanjie Chen , Xia Hu

While reasoning large language models (LLMs) demonstrate remarkable performance across various tasks, they also contain notable security vulnerabilities. Recent research has uncovered a "thinking-stopped" vulnerability in DeepSeek-R1, where…

Cryptography and Security · Computer Science 2025-04-30 Yu Cui , Yujun Cai , Yiwei Wang
‹ Prev 1 2 3 10 Next ›