Related papers: Multi-tool Integration Application for Math Reason…

MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting

Large language models (LLMs) have achieved impressive performance on various reasoning tasks. To further improve the performance, we propose MultiTool-CoT, a novel framework that leverages chain-of-thought (CoT) prompting to incorporate…

Computation and Language · Computer Science 2023-05-29 Tatsuro Inaba , Hirokazu Kiyomaru , Fei Cheng , Sadao Kurohashi

MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

Tool-augmented Large Language Models (TALMs) are known to enhance the skillset of large language models (LLMs), thereby, leading to their improved reasoning abilities across many tasks. While, TALMs have been successfully employed in…

Computation and Language · Computer Science 2024-04-04 Debrup Das , Debopriyo Banerjee , Somak Aditya , Ashish Kulkarni

Mathematical Reasoning in Large Language Models: Benchmarks, Architectures, Evaluation, and Open Challenges

Mathematical reasoning is essential for problem-solving in education, science, and industry, serving as a crucial benchmark for evaluating artificial intelligence systems. As Large Language Models (LLMs) improve their reasoning…

Computation and Language · Computer Science 2026-05-20 Husnain Amjad , Raja Khurram Shahzad , Aamir Shahzad , Mehwish Fatima

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

A Survey on Large Language Models for Mathematical Reasoning

Mathematical reasoning has long represented one of the most fundamental and challenging frontiers in artificial intelligence research. In recent years, large language models (LLMs) have achieved significant advances in this area. This…

Artificial Intelligence · Computer Science 2025-06-11 Peng-Yuan Wang , Tian-Shuo Liu , Chenyang Wang , Yi-Di Wang , Shu Yan , Cheng-Xing Jia , Xu-Hui Liu , Xin-Wei Chen , Jia-Cheng Xu , Ziniu Li , Yang Yu

A Toolbox, Not a Hammer -- Multi-TAG: Scaling Math Reasoning with Multi-Tool Aggregation

Augmenting large language models (LLMs) with external tools is a promising avenue for developing high-performance mathematical reasoning systems. Prior tool-augmented approaches typically finetune an LLM to select and invoke a single tool…

Computation and Language · Computer Science 2025-08-25 Bohan Yao , Vikas Yadav

ART: Automatic multi-step reasoning and tool-use for large language models

Large language models (LLMs) can perform complex reasoning in few- and zero-shot settings by generating intermediate chain of thought (CoT) reasoning steps. Further, each reasoning step can rely on external tools to support computation…

Computation and Language · Computer Science 2023-03-17 Bhargavi Paranjape , Scott Lundberg , Sameer Singh , Hannaneh Hajishirzi , Luke Zettlemoyer , Marco Tulio Ribeiro

MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning

The tool-use Large Language Models (LLMs) that integrate with external Python interpreters have significantly enhanced mathematical reasoning capabilities for open-source LLMs, while tool-free methods chose another track: augmenting math…

Computation and Language · Computer Science 2024-05-14 Shuo Yin , Weihao You , Zhilong Ji , Guoqiang Zhong , Jinfeng Bai

To Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-Maximization

Recent advances in mathematical problem-solving with language models (LMs) integrate chain-of-thought (CoT) reasoning and code execution to harness their complementary strengths. However, existing hybrid frameworks exhibit a critical…

Artificial Intelligence · Computer Science 2025-07-21 Haozhe Wang , Long Li , Chao Qu , Fengming Zhu , Weidi Xu , Wei Chu , Fangzhen Lin

A Survey on Mathematical Reasoning and Optimization with Large Language Models

Mathematical reasoning and optimization are fundamental to artificial intelligence and computational problem-solving. Recent advancements in Large Language Models (LLMs) have significantly improved AI-driven mathematical reasoning, theorem…

Artificial Intelligence · Computer Science 2025-03-25 Ali Forootani

MathLearner: A Large Language Model Agent Framework for Learning to Solve Mathematical Problems

With the development of artificial intelligence (AI), large language models (LLM) are widely used in many fields. However, the reasoning ability of LLM is still very limited when it comes to mathematical reasoning. Mathematics plays an…

Computation and Language · Computer Science 2024-08-06 Wenbei Xie , Donglin Liu , Haoran Yan , Wenjie Wu , Zongyang Liu

mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models

Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a…

Computation and Language · Computer Science 2024-07-11 Huiyuan Lai , Malvina Nissim

Benchmarking Large Language Models for Math Reasoning Tasks

The use of Large Language Models (LLMs) in mathematical reasoning has become a cornerstone of related research, demonstrating the intelligence of these models and enabling potential practical applications through their advanced performance,…

Computation and Language · Computer Science 2024-12-20 Kathrin Seßler , Yao Rong , Emek Gözlüklü , Enkelejda Kasneci

Dual Instruction Tuning with Large Language Models for Mathematical Reasoning

Recent advancements highlight the success of instruction tuning with large language models (LLMs) utilizing Chain-of-Thought (CoT) data for mathematical reasoning tasks. Despite the fine-tuned LLMs, challenges persist, such as incorrect,…

Computation and Language · Computer Science 2024-03-28 Yongwei Zhou , Tiejun Zhao

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

While reasoning models (e.g., DeepSeek R1) trained with reinforcement learning (RL), excel in textual reasoning, they struggle in scenarios requiring structured problem-solving, such as geometric reasoning, concise computation, or complex…

Computation and Language · Computer Science 2025-04-18 Jiazhan Feng , Shijue Huang , Xingwei Qu , Ge Zhang , Yujia Qin , Baoquan Zhong , Chengquan Jiang , Jinxin Chi , Wanjun Zhong

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning…

Computation and Language · Computer Science 2023-06-08 Tianduo Wang , Wei Lu

Towards Effective Code-Integrated Reasoning

In this paper, we investigate code-integrated reasoning, where models generate code when necessary and integrate feedback by executing it through a code interpreter. To acquire this capability, models must learn when and how to use external…

Computation and Language · Computer Science 2025-06-02 Fei Bai , Yingqian Min , Beichen Zhang , Zhipeng Chen , Wayne Xin Zhao , Lei Fang , Zheng Liu , Zhongyuan Wang , Ji-Rong Wen

Alignment for Efficient Tool Calling of Large Language Models

Recent advancements in tool learning have enabled large language models (LLMs) to integrate external tools, enhancing their task performance by expanding their knowledge boundaries. However, relying on tools often introduces tradeoffs…

Computation and Language · Computer Science 2025-03-11 Hongshen Xu , Zihan Wang , Zichen Zhu , Lei Pan , Xingyu Chen , Lu Chen , Kai Yu

Teaching LLMs to Think Mathematically: A Critical Study of Decision-Making via Optimization

This paper investigates the capabilities of large language models (LLMs) in formulating and solving decision-making problems using mathematical programming. We first conduct a systematic review and meta-analysis of recent literature to…

Artificial Intelligence · Computer Science 2025-08-26 Mohammad J. Abdel-Rahman , Yasmeen Alslman , Dania Refai , Amro Saleh , Malik A. Abu Loha , Mohammad Yahya Hamed

Improving Large Language Model Fine-tuning for Solving Math Problems

Despite their success in many natural language tasks, solving math problems remains a significant challenge for large language models (LLMs). A large gap exists between LLMs' pass-at-one and pass-at-N performance in solving math problems,…

Computation and Language · Computer Science 2023-10-17 Yixin Liu , Avi Singh , C. Daniel Freeman , John D. Co-Reyes , Peter J. Liu