Related papers: Distilling Algorithmic Reasoning from LLMs via Exp…

Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs

Large Language Models (LLMs) have recently made significant advances in code generation through the 'Chain-of-Thought' prompting technique. This technique empowers the model to autonomously devise "solution plans" to tackle intricate…

Software Engineering · Computer Science 2024-03-21 Zhihong Sun , Chen Lyu , Bolun Li , Yao Wan , Hongyu Zhang , Ge Li , Zhi Jin

Explanations from Large Language Models Make Small Reasoners Better

Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations. In this paper, we consider the problem of leveraging the…

Computation and Language · Computer Science 2022-10-14 Shiyang Li , Jianshu Chen , Yelong Shen , Zhiyu Chen , Xinlu Zhang , Zekun Li , Hong Wang , Jing Qian , Baolin Peng , Yi Mao , Wenhu Chen , Xifeng Yan

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Small Models Struggle to Learn from Strong Reasoners

Large language models (LLMs) excel in complex reasoning tasks, and distilling their reasoning capabilities into smaller models has shown promise. However, we uncover an interesting phenomenon, which we term the Small Model Learnability Gap:…

Artificial Intelligence · Computer Science 2025-11-14 Yuetai Li , Xiang Yue , Zhangchen Xu , Fengqing Jiang , Luyao Niu , Bill Yuchen Lin , Bhaskar Ramasubramanian , Radha Poovendran

Reasoning Scaffolding: Distilling the Flow of Thought from LLMs

The prevailing approach to distilling reasoning from Large Language Models (LLMs)-behavioral cloning from textual rationales-is fundamentally limited. It teaches Small Language Models (SLMs) to mimic surface-level patterns rather than the…

Artificial Intelligence · Computer Science 2025-10-02 Xiangyu Wen , Junhua Huang , Zeju Li , Min Li , Jianyuan Zhong , Zhijian Xu , Mingxuan Yuan , Yongxiang Huang , Qiang Xu

Chain of Thoughtlessness? An Analysis of CoT in Planning

Large language model (LLM) performance on reasoning problems typically does not generalize out of distribution. Previous work has claimed that this can be mitigated with chain of thought prompting-a method of demonstrating solution…

Artificial Intelligence · Computer Science 2025-03-13 Kaya Stechly , Karthik Valmeekam , Subbarao Kambhampati

Efficient Long CoT Reasoning in Small Language Models

Recent large reasoning models such as DeepSeek-R1 exhibit strong complex problems solving abilities by generating long chain-of-thought (CoT) reasoning steps. It is challenging to directly train small language models (SLMs) to emerge long…

Computation and Language · Computer Science 2025-06-19 Zhaoyang Wang , Jinqi Jiang , Tian Qiu , Hui Liu , Xianfeng Tang , Huaxiu Yao

Teaching LLMs to Think Mathematically: A Critical Study of Decision-Making via Optimization

This paper investigates the capabilities of large language models (LLMs) in formulating and solving decision-making problems using mathematical programming. We first conduct a systematic review and meta-analysis of recent literature to…

Artificial Intelligence · Computer Science 2025-08-26 Mohammad J. Abdel-Rahman , Yasmeen Alslman , Dania Refai , Amro Saleh , Malik A. Abu Loha , Mohammad Yahya Hamed

Reasoning Distillation and Structural Alignment for Improved Code Generation

Effective code generation with language models hinges on two critical factors: accurately understanding the intent of the prompt and generating code that applies algorithmic reasoning to produce correct solutions capable of passing diverse…

Artificial Intelligence · Computer Science 2025-10-21 Amir Jalilifard , Anderson de Rezende Rocha , Marcos Medeiros Raimundo

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Large language models (LLMs) excel at complex reasoning tasks but remain computationally expensive, limiting their practical deployment. To address this, recent works have focused on distilling reasoning capabilities into smaller language…

Computation and Language · Computer Science 2025-11-06 Minki Kang , Jongwon Jeong , Seanie Lee , Jaewoong Cho , Sung Ju Hwang

Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory

While large language models (LLMs) leverage both knowledge and reasoning during inference, the capacity to distinguish between them plays a pivotal role in model analysis, interpretability, and development. Inspired by dual-system cognitive…

Artificial Intelligence · Computer Science 2025-07-25 Mutian Yang , Jiandong Gao , Ji Wu

The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models

Distilling the thinking traces of a Large Language Model (LLM) with reasoning capabilities into a smaller model has been proven effective. Yet, there is a scarcity of work done on how model performances scale with the quantity of…

Computation and Language · Computer Science 2025-10-08 Muyu He , Muhammad Ali Shafique , Anand Kumar , Tsach Mackey , Nazneen Rajani

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

Large Language Models (LLMs) prompted to generate chain-of-thought (CoT) exhibit impressive reasoning capabilities. Recent attempts at prompt decomposition toward solving complex, multi-step reasoning problems depend on the ability of the…

Computation and Language · Computer Science 2024-02-28 Gurusha Juneja , Subhabrata Dutta , Soumen Chakrabarti , Sunny Manchanda , Tanmoy Chakraborty

Divide-or-Conquer? Which Part Should You Distill Your LLM?

Recent methods have demonstrated that Large Language Models (LLMs) can solve reasoning tasks better when they are encouraged to solve subtasks of the main task first. In this paper we devise a similar strategy that breaks down reasoning…

Computation and Language · Computer Science 2024-11-20 Zhuofeng Wu , He Bai , Aonan Zhang , Jiatao Gu , VG Vinod Vydiswaran , Navdeep Jaitly , Yizhe Zhang

Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought

Large language models (LLMs) have shown remarkable reasoning capabilities given chain-of-thought prompts (examples with intermediate reasoning steps). Existing benchmarks measure reasoning ability indirectly, by evaluating accuracy on…

Computation and Language · Computer Science 2023-03-03 Abulhair Saparov , He He

Computational Thinking Reasoning in Large Language Models

While large language models (LLMs) have demonstrated remarkable reasoning capabilities, they often struggle with complex tasks that require specific thinking paradigms, such as divide-and-conquer and procedural deduction, \etc Previous…

Software Engineering · Computer Science 2025-06-05 Kechi Zhang , Ge Li , Jia Li , Huangzhao Zhang , Jingjing Xu , Hao Zhu , Lecheng Wang , Jia Li , Yihong Dong , Jing Mai , Bin Gu , Zhi Jin

Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving

Large Language Models (LLMs) have achieved tremendous progress, yet they still often struggle with challenging reasoning problems. Current approaches address this challenge by sampling or searching detailed and low-level reasoning chains.…

Artificial Intelligence · Computer Science 2023-12-07 Zhan Ling , Yunhao Fang , Xuanlin Li , Tongzhou Mu , Mingu Lee , Reza Pourreza , Roland Memisevic , Hao Su

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning

While large language models (LLMs) excel in various natural language processing tasks, their huge size and the inaccessibility of parameters present challenges for practical deployment. Previous studies try to distill task-specific ability…

Computation and Language · Computer Science 2024-03-21 Xuekai Zhu , Biqing Qi , Kaiyan Zhang , Xinwei Long , Zhouhan Lin , Bowen Zhou

PAL: Program-aided Language Models

Large language models (LLMs) have recently demonstrated an impressive ability to perform arithmetic and symbolic reasoning tasks, when provided with a few examples at test time ("few-shot prompting"). Much of this success can be attributed…

Computation and Language · Computer Science 2023-01-30 Luyu Gao , Aman Madaan , Shuyan Zhou , Uri Alon , Pengfei Liu , Yiming Yang , Jamie Callan , Graham Neubig

Teaching Algorithmic Reasoning via In-context Learning

Large language models (LLMs) have shown increasing in-context learning capabilities through scaling up model and data size. Despite this progress, LLMs are still unable to solve algorithmic reasoning problems. While providing a rationale…

Machine Learning · Computer Science 2022-11-17 Hattie Zhou , Azade Nova , Hugo Larochelle , Aaron Courville , Behnam Neyshabur , Hanie Sedghi