Related papers: Learning Multi-Step Reasoning by Solving Arithmeti…

Multi-Step Reasoning with Large Language Models, a Survey

Large language models (LLMs) with billions of parameters exhibit in-context learning abilities, enabling few-shot learning on tasks that the model was not specifically trained for. Traditional models achieve breakthrough performance on…

Artificial Intelligence · Computer Science 2025-11-04 Aske Plaat , Annie Wong , Suzan Verberne , Joost Broekens , Niki van Stein , Thomas Back

A Survey on Large Language Models for Mathematical Reasoning

Mathematical reasoning has long represented one of the most fundamental and challenging frontiers in artificial intelligence research. In recent years, large language models (LLMs) have achieved significant advances in this area. This…

Artificial Intelligence · Computer Science 2025-06-11 Peng-Yuan Wang , Tian-Shuo Liu , Chenyang Wang , Yi-Di Wang , Shu Yan , Cheng-Xing Jia , Xu-Hui Liu , Xin-Wei Chen , Jia-Cheng Xu , Ziniu Li , Yang Yu

A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics

Recent progress in large language models (LLM) found chain-of-thought prompting strategies to improve the reasoning ability of LLMs by encouraging problem solving through multiple steps. Therefore, subsequent research aimed to integrate the…

Computation and Language · Computer Science 2025-02-21 Ting-Ruen Wei , Haowei Liu , Xuyang Wu , Yi Fang

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Markov Chain of Thought for Efficient Mathematical Reasoning

Chain of Thought (CoT) of multi-step benefits from the logical structure of the reasoning steps and task-specific actions, significantly enhancing the mathematical reasoning capabilities of large language models. As the prevalence of long…

Artificial Intelligence · Computer Science 2025-03-07 Wen Yang , Minpeng Liao , Kai Fan

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models

Recent work has shown that language models (LMs) have strong multi-step (i.e., procedural) reasoning capabilities. However, it is unclear whether LMs perform these tasks by cheating with answers memorized from pretraining corpus, or, via a…

Computation and Language · Computer Science 2023-10-24 Yifan Hou , Jiaoda Li , Yu Fei , Alessandro Stolfo , Wangchunshu Zhou , Guangtao Zeng , Antoine Bosselut , Mrinmaya Sachan

LLMs Faithfully and Iteratively Compute Answers During CoT: A Systematic Analysis With Multi-step Arithmetics

This study investigates the internal information flow of large language models (LLMs) while performing chain-of-thought (CoT) style reasoning. Specifically, with a particular interest in the faithfulness of the CoT explanation to LLMs'…

Computation and Language · Computer Science 2026-03-20 Keito Kudo , Yoichi Aoki , Tatsuki Kuribayashi , Shusaku Sone , Masaya Taniguchi , Ana Brassard , Keisuke Sakaguchi , Kentaro Inui

Evaluating Step-by-Step Reasoning through Symbolic Verification

Pre-trained language models (LMs) have shown remarkable reasoning performance using explanations or chain-of-thoughts (CoT)) for in-context learning. On the other hand, these reasoning tasks are usually presumed to be more approachable for…

Computation and Language · Computer Science 2024-03-29 Yi-Fan Zhang , Hanlin Zhang , Li Erran Li , Eric Xing

Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning

Mathematical reasoning has been challenging for large language models (LLMs), and the introduction of step-by-step Chain-of-Thought (CoT) inference has significantly advanced the mathematical capabilities of LLMs. However, current…

Artificial Intelligence · Computer Science 2025-09-23 Lang Cao , Yingtian Zou , Chao Peng , Renhong Chen , Wu Ning , Yitong Li

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Chain-of-Thought (CoT) significantly enhances formal reasoning capabilities in Large Language Models (LLMs) by training them to explicitly generate intermediate reasoning steps. While LLMs readily benefit from such techniques, improving…

Computation and Language · Computer Science 2025-05-01 Haoran Xu , Baolin Peng , Hany Awadalla , Dongdong Chen , Yen-Chun Chen , Mei Gao , Young Jin Kim , Yunsheng Li , Liliang Ren , Yelong Shen , Shuohang Wang , Weijian Xu , Jianfeng Gao , Weizhu Chen

Teaching LLMs to Think Mathematically: A Critical Study of Decision-Making via Optimization

This paper investigates the capabilities of large language models (LLMs) in formulating and solving decision-making problems using mathematical programming. We first conduct a systematic review and meta-analysis of recent literature to…

Artificial Intelligence · Computer Science 2025-08-26 Mohammad J. Abdel-Rahman , Yasmeen Alslman , Dania Refai , Amro Saleh , Malik A. Abu Loha , Mohammad Yahya Hamed

Mathematical Reasoning in Large Language Models: Benchmarks, Architectures, Evaluation, and Open Challenges

Mathematical reasoning is essential for problem-solving in education, science, and industry, serving as a crucial benchmark for evaluating artificial intelligence systems. As Large Language Models (LLMs) improve their reasoning…

Computation and Language · Computer Science 2026-05-20 Husnain Amjad , Raja Khurram Shahzad , Aamir Shahzad , Mehwish Fatima

The Impact of Reasoning Step Length on Large Language Models

Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correlation between the effectiveness of CoT and the length of reasoning steps in prompts remains largely unknown. To…

Computation and Language · Computer Science 2024-06-25 Mingyu Jin , Qinkai Yu , Dong Shu , Haiyan Zhao , Wenyue Hua , Yanda Meng , Yongfeng Zhang , Mengnan Du

Large Language Models for Mathematical Reasoning: Progresses and Challenges

Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the…

Computation and Language · Computer Science 2024-09-18 Janice Ahn , Rishu Verma , Renze Lou , Di Liu , Rui Zhang , Wenpeng Yin

Faithful Reasoning Using Large Language Models

Although contemporary large language models (LMs) demonstrate impressive question-answering capabilities, their answers are typically the product of a single call to the model. This entails an unwelcome degree of opacity and compromises…

Artificial Intelligence · Computer Science 2022-08-31 Antonia Creswell , Murray Shanahan

Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters

Chain-of-Thought (CoT) prompting can dramatically improve the multi-step reasoning abilities of large language models (LLMs). CoT explicitly encourages the LLM to generate intermediate rationales for solving a problem, by providing a series…

Computation and Language · Computer Science 2023-06-02 Boshi Wang , Sewon Min , Xiang Deng , Jiaming Shen , You Wu , Luke Zettlemoyer , Huan Sun

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

Mathematical reasoning remains a significant challenge for large language models (LLMs), despite progress in prompting techniques such as Chain-of-Thought (CoT). We present **Chain of Mathematically Annotated Thought (CoMAT)**, which…

Artificial Intelligence · Computer Science 2026-01-16 Joshua Ong Jun Leang , Aryo Pradipta Gema , Shay B. Cohen

Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation

Multi-step reasoning ability of large language models is crucial in tasks such as math and tool utilization. Current researches predominantly focus on enhancing model performance in these multi-step reasoning tasks through fine-tuning with…

Computation and Language · Computer Science 2024-10-23 Yuli Qiu , Jiashu Yao , Heyan Huang , Yuhang Guo

Guiding Language Model Reasoning with Planning Tokens

Large language models (LLMs) have recently attracted considerable interest for their ability to perform complex reasoning tasks, such as chain-of-thought (CoT) reasoning. However, most of the existing approaches to enhance this ability rely…

Computation and Language · Computer Science 2024-08-08 Xinyi Wang , Lucas Caccia , Oleksiy Ostapenko , Xingdi Yuan , William Yang Wang , Alessandro Sordoni

Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding

Chain-of-Thought (CoT) is a technique that guides Large Language Models (LLMs) to decompose complex tasks into multi-step reasoning through intermediate steps in natural language form. Briefly, CoT enables LLMs to think step by step.…

Computation and Language · Computer Science 2023-10-19 Caoyun Fan , Jidong Tian , Yitian Li , Wenqing Chen , Hao He , Yaohui Jin