Related papers: Language Models as Compilers: Simulating Pseudocod…

Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs

In large language models (LLMs), code and reasoning reinforce each other: code offers an abstract, modular, and logic-driven structure that supports reasoning, while reasoning translates high-level goals into smaller, executable steps that…

Computation and Language · Computer Science 2025-02-27 Dayu Yang , Tianyang Liu , Daoan Zhang , Antoine Simoulin , Xiaoyi Liu , Yuwei Cao , Zhaopu Teng , Xin Qian , Grey Yang , Jiebo Luo , Julian McAuley

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator

Code provides a general syntactic structure to build complex programs and perform precise computations when paired with a code interpreter - we hypothesize that language models (LMs) can leverage code-writing to improve Chain of Thought…

Computation and Language · Computer Science 2024-07-31 Chengshu Li , Jacky Liang , Andy Zeng , Xinyun Chen , Karol Hausman , Dorsa Sadigh , Sergey Levine , Li Fei-Fei , Fei Xia , Brian Ichter

Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning

Despite the remarkable success of large language models (LLMs) on traditional natural language processing tasks, their planning ability remains a critical bottleneck in tackling complex multi-step reasoning tasks. Existing approaches mainly…

Computation and Language · Computer Science 2024-10-07 Jiaxin Wen , Jian Guan , Hongning Wang , Wei Wu , Minlie Huang

Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and Amendment

The reasoning abilities are one of the most enigmatic and captivating aspects of large language models (LLMs). Numerous studies are dedicated to exploring and expanding the boundaries of this reasoning capability. However, tasks that embody…

Artificial Intelligence · Computer Science 2025-02-27 Yuze Zhao , Tianyun Ji , Wenjun Feng , Zhenya Huang , Qi Liu , Zhiding Liu , Yixiao Ma , Kai Zhang , Enhong Chen

Preventing Language Models From Hiding Their Reasoning

Large language models (LLMs) often benefit from intermediate steps of reasoning to generate answers to complex problems. When these intermediate steps of reasoning are used to monitor the activity of the model, it is essential that this…

Machine Learning · Computer Science 2023-11-02 Fabien Roger , Ryan Greenblatt

Learning to Reason via Program Generation, Emulation, and Search

Program synthesis with language models (LMs) has unlocked a large set of reasoning abilities; code-tuned LMs have proven adept at generating programs that solve a wide variety of algorithmic symbolic manipulation tasks (e.g. word…

Computation and Language · Computer Science 2024-11-05 Nathaniel Weir , Muhammad Khalifa , Linlu Qiu , Orion Weller , Peter Clark

Demystifying Errors in LLM Reasoning Traces: An Empirical Study of Code Execution Simulation

Understanding a program's runtime reasoning behavior, meaning how intermediate states and control flows lead to final execution results, is essential for reliable code generation, debugging, and automated reasoning. Although large language…

Software Engineering · Computer Science 2025-12-02 Mohammad Abdollahi , Khandaker Rifah Tasnia , Soumit Kanti Saha , Jinqiu Yang , Song Wang , Hadi Hemmati

Code Execution as Grounded Supervision for LLM Reasoning

Training large language models (LLMs) with chain-of-thought (CoT) supervision has proven effective for enhancing their reasoning abilities. However, obtaining reliable and accurate reasoning supervision remains a significant challenge. We…

Computation and Language · Computer Science 2025-10-21 Dongwon Jung , Wenxuan Zhou , Muhao Chen

Advancing Reasoning in Large Language Models: Promising Methods and Approaches

Large Language Models (LLMs) have succeeded remarkably in various natural language processing (NLP) tasks, yet their reasoning capabilities remain a fundamental challenge. While LLMs exhibit impressive fluency and factual recall, their…

Computation and Language · Computer Science 2025-05-29 Avinash Patil , Aryan Jadon

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models

Neural-symbolic methods have demonstrated efficiency in enhancing the reasoning abilities of large language models (LLMs). However, existing methods mainly rely on syntactically mapping natural languages to complete formal languages like…

Computation and Language · Computer Science 2024-06-04 Yiming Wang , Zhuosheng Zhang , Pei Zhang , Baosong Yang , Rui Wang

Evaluating Prompting and Execution-Based Methods for Deterministic Computation in LLMs

Large Language Models (LLMs) have demonstrated strong capabilities in natural language understanding and reasoning. However, their ability to perform exact, deterministic computation remains unclear. In this work, we systematically evaluate…

Artificial Intelligence · Computer Science 2026-05-08 Hongkun Yu

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning…

Computation and Language · Computer Science 2025-08-25 Yang Sui , Yu-Neng Chuang , Guanchu Wang , Jiamu Zhang , Tianyi Zhang , Jiayi Yuan , Hongyi Liu , Andrew Wen , Shaochen Zhong , Na Zou , Hanjie Chen , Xia Hu

NExT: Teaching Large Language Models to Reason about Code Execution

A fundamental skill among human developers is the ability to understand and reason about program execution. As an example, a programmer can mentally simulate code execution in natural language to debug and repair code (aka. rubber duck…

Machine Learning · Computer Science 2024-04-24 Ansong Ni , Miltiadis Allamanis , Arman Cohan , Yinlin Deng , Kensen Shi , Charles Sutton , Pengcheng Yin

Implicit Reasoning in Large Language Models: A Comprehensive Survey

Large Language Models (LLMs) have demonstrated strong generalization across a wide range of tasks. Reasoning with LLMs is central to solving multi-step problems and complex decision-making. To support efficient reasoning, recent studies…

Computation and Language · Computer Science 2025-09-03 Jindong Li , Yali Fu , Li Fan , Jiahong Liu , Yao Shu , Chengwei Qin , Menglin Yang , Irwin King , Rex Ying

Unlocking Structured Thinking in Language Models with Cognitive Prompting

We propose cognitive prompting as a novel approach to guide problem-solving in large language models (LLMs) through structured, human-like cognitive operations, such as goal clarification, decomposition, filtering, abstraction, and pattern…

Computation and Language · Computer Science 2024-12-03 Oliver Kramer , Jill Baumann

Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought

Large language models (LLMs) have shown exceptional performance as general-purpose assistants, excelling across a variety of reasoning tasks. This achievement represents a significant step toward achieving artificial general intelligence…

Artificial Intelligence · Computer Science 2024-08-13 Xiaoyu Tan , Yongxin Deng , Xihe Qiu , Weidi Xu , Chao Qu , Wei Chu , Yinghui Xu , Yuan Qi

Distilling Algorithmic Reasoning from LLMs via Explaining Solution Programs

Distilling explicit chain-of-thought reasoning paths has emerged as an effective method for improving the reasoning abilities of large language models (LLMs) across various tasks. However, when tackling complex tasks that pose significant…

Computation and Language · Computer Science 2024-04-15 Jierui Li , Raymond Mooney

Reasoning Like Program Executors

Reasoning over natural language is a long-standing goal for the research community. However, studies have shown that existing language models are inadequate in reasoning. To address the issue, we present POET, a novel reasoning pre-training…

Computation and Language · Computer Science 2022-10-25 Xinyu Pi , Qian Liu , Bei Chen , Morteza Ziyadi , Zeqi Lin , Qiang Fu , Yan Gao , Jian-Guang Lou , Weizhu Chen

Computational Thinking Reasoning in Large Language Models

While large language models (LLMs) have demonstrated remarkable reasoning capabilities, they often struggle with complex tasks that require specific thinking paradigms, such as divide-and-conquer and procedural deduction, \etc Previous…

Software Engineering · Computer Science 2025-06-05 Kechi Zhang , Ge Li , Jia Li , Huangzhao Zhang , Jingjing Xu , Hao Zhu , Lecheng Wang , Jia Li , Yihong Dong , Jing Mai , Bin Gu , Zhi Jin