Related papers: CRISP: Complex Reasoning with Interpretable Step-b…

iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning

Large language models (LLMs), when guided by explicit textual plans, can perform reliable step-by-step reasoning during problem-solving. However, generating accurate and effective textual plans remains challenging due to LLM hallucinations…

Computation and Language · Computer Science 2026-01-01 Sijia Chen , Di Niu

Guiding Language Model Reasoning with Planning Tokens

Large language models (LLMs) have recently attracted considerable interest for their ability to perform complex reasoning tasks, such as chain-of-thought (CoT) reasoning. However, most of the existing approaches to enhance this ability rely…

Computation and Language · Computer Science 2024-08-08 Xinyi Wang , Lucas Caccia , Oleksiy Ostapenko , Xingdi Yuan , William Yang Wang , Alessandro Sordoni

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

Large language models (LLMs) have recently been shown to deliver impressive performance in various NLP tasks. To tackle multi-step reasoning tasks, few-shot chain-of-thought (CoT) prompting includes a few manually crafted step-by-step…

Computation and Language · Computer Science 2023-05-29 Lei Wang , Wanyu Xu , Yihuai Lan , Zhiqiang Hu , Yunshi Lan , Roy Ka-Wei Lee , Ee-Peng Lim

Step-TP: A Grounded, Step-Level Dataset with Chain-of-Thought Reasoning for LLM-Guided Tensor Program Optimization

Despite the strong reasoning capabilities of large language models (LLMs), optimizing the execution efficiency of tensor programs remains challenging due to the need for precise, composable transformation decisions. Recent LLM-guided…

Machine Learning · Computer Science 2026-05-26 Mengfan Liu , Da Zheng , Junwei Su , Chuan Wu

Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling

Inference-time scaling techniques have shown promise in enhancing the reasoning capabilities of large language models (LLMs). While recent research has primarily focused on training-time optimization, our work highlights inference-time…

Computation and Language · Computer Science 2026-02-12 Jiachun Li , Pengfei Cao , Zhuoran Jin , Yubo Chen , Jiexin Xu , Huaijun Li , Xiaojian Jiang , Kang Liu , Jun Zhao

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning…

Computation and Language · Computer Science 2023-06-08 Tianduo Wang , Wei Lu

Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages

Chain-of-thought (CoT) is capable of eliciting models to explicitly generate reasoning paths, thus promoting reasoning accuracy and attracting increasing attention. Specifically, zero-shot CoT achieves remarkable improvements in a wide…

Computation and Language · Computer Science 2023-10-24 Libo Qin , Qiguang Chen , Fuxuan Wei , Shijue Huang , Wanxiang Che

Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual Reasoning

There emerges a promising trend of using large language models (LLMs) to generate code-like plans for complex inference tasks such as visual reasoning. This paradigm, known as LLM-based planning, provides flexibility in problem solving and…

Computation and Language · Computer Science 2023-08-22 Pengbo Hu , Ji Qi , Xingyu Li , Hong Li , Xinqi Wang , Bing Quan , Ruiyu Wang , Yi Zhou

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

Large Language Models (LLMs) have demonstrated impressive capability in many natural language tasks. However, the auto-regressive generation process makes LLMs prone to produce errors, hallucinations and inconsistent statements when…

Artificial Intelligence · Computer Science 2024-07-23 Chaojie Wang , Yanchen Deng , Zhiyi Lyu , Liang Zeng , Jujie He , Shuicheng Yan , Bo An

Better Zero-Shot Reasoning with Self-Adaptive Prompting

Modern large language models (LLMs) have demonstrated impressive capabilities at sophisticated tasks, often through step-by-step reasoning similar to humans. This is made possible by their strong few and zero-shot abilities -- they can…

Computation and Language · Computer Science 2023-05-24 Xingchen Wan , Ruoxi Sun , Hanjun Dai , Sercan O. Arik , Tomas Pfister

ThinkSum: Probabilistic reasoning over sets using large language models

Large language models (LLMs) have a substantial capacity for high-level analogical reasoning: reproducing patterns in linear text that occur in their training data (zero-shot evaluation) or in the provided context (few-shot in-context…

Computation and Language · Computer Science 2023-06-05 Batu Ozturkler , Nikolay Malkin , Zhen Wang , Nebojsa Jojic

Multi-Step Reasoning with Large Language Models, a Survey

Large language models (LLMs) with billions of parameters exhibit in-context learning abilities, enabling few-shot learning on tasks that the model was not specifically trained for. Traditional models achieve breakthrough performance on…

Artificial Intelligence · Computer Science 2025-11-04 Aske Plaat , Annie Wong , Suzan Verberne , Joost Broekens , Niki van Stein , Thomas Back

CRISP: Compressed Reasoning via Iterative Self-Policy Distillation

Reasoning models think out loud, but much of what they say is noise. We introduce CRISP (Compressed Reasoning via Iterative Self-Policy Distillation), a method that teaches models to reason more concisely by distilling their own concise…

Machine Learning · Computer Science 2026-04-14 Hejian Sang , Yuanda Xu , Zhengze Zhou , Ran He , Zhipeng Wang , Jiachen Sun

CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks

Post-training, particularly reinforcement learning (RL) using self-play-generated data, has become a new learning paradigm for large language models (LLMs). However, scaling RL to develop a general reasoner remains a research challenge, as…

Artificial Intelligence · Computer Science 2024-10-02 Tianlong Wang , Junzhe Chen , Xueting Han , Jing Bai

Complexity-Based Prompting for Multi-Step Reasoning

We study the task of prompting large-scale language models to perform multi-step reasoning. Existing work shows that when prompted with a chain of thoughts (CoT), sequences of short sentences describing intermediate reasoning steps towards…

Computation and Language · Computer Science 2023-01-31 Yao Fu , Hao Peng , Ashish Sabharwal , Peter Clark , Tushar Khot

Reasoning with Language Model is Planning with World Model

Large language models (LLMs) have shown remarkable reasoning capabilities, especially when prompted to generate intermediate reasoning steps (e.g., Chain-of-Thought, CoT). However, LLMs can still struggle with problems that are easy for…

Computation and Language · Computer Science 2023-10-24 Shibo Hao , Yi Gu , Haodi Ma , Joshua Jiahua Hong , Zhen Wang , Daisy Zhe Wang , Zhiting Hu

Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning

Mathematical reasoning has been challenging for large language models (LLMs), and the introduction of step-by-step Chain-of-Thought (CoT) inference has significantly advanced the mathematical capabilities of LLMs. However, current…

Artificial Intelligence · Computer Science 2025-09-23 Lang Cao , Yingtian Zou , Chao Peng , Renhong Chen , Wu Ning , Yitong Li

Once Upon an Input: Reasoning via Per-Instance Program Synthesis

Large language models (LLMs) excel at zero-shot inference but continue to struggle with complex, multi-step reasoning. Recent methods that augment LLMs with intermediate reasoning steps such as Chain of Thought (CoT) and Program of Thought…

Computation and Language · Computer Science 2025-10-28 Adam Stein , Neelay Velingker , Mayur Naik , Eric Wong

Unlocking Structured Thinking in Language Models with Cognitive Prompting

We propose cognitive prompting as a novel approach to guide problem-solving in large language models (LLMs) through structured, human-like cognitive operations, such as goal clarification, decomposition, filtering, abstraction, and pattern…

Computation and Language · Computer Science 2024-12-03 Oliver Kramer , Jill Baumann

Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks

Large language models (LLMs) have shown impressive promise in code generation, yet their progress remains limited by the shortage of large-scale datasets that are both diverse and well-aligned with human reasoning. Most existing resources…

Machine Learning · Computer Science 2025-10-28 Amal Abed , Ivan Lukic , Jörg K. H. Franke , Frank Hutter