Related papers: Large Language Models Cannot Self-Correct Reasonin…

Large Language Models have Intrinsic Self-Correction Ability

Large language models (LLMs) have attracted significant attention for their exceptional abilities in various natural language processing tasks, but they suffer from hallucinations that will cause performance degradation. One promising…

Computation and Language · Computer Science 2024-12-24 Dancheng Liu , Amir Nassereldine , Ziming Yang , Chenhui Xu , Yuting Hu , Jiajie Li , Utkarsh Kumar , Changjae Lee , Ruiyang Qin , Yiyu Shi , Jinjun Xiong

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

Self-correction is an approach to improving responses from large language models (LLMs) by refining the responses using LLMs during inference. Prior work has proposed various self-correction frameworks using different sources of feedback,…

Computation and Language · Computer Science 2024-12-05 Ryo Kamoi , Yusen Zhang , Nan Zhang , Jiawei Han , Rui Zhang

Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tasks. However, their efficacy is undermined by undesired and inconsistent behaviors, including hallucination, unfaithful reasoning, and toxic…

Computation and Language · Computer Science 2023-08-31 Liangming Pan , Michael Saxon , Wenda Xu , Deepak Nathani , Xinyi Wang , William Yang Wang

Language Models can perform Single-Utterance Self-Correction of Perturbed Reasoning

Large Language Models (LLMs) have demonstrated impressive mathematical reasoning capabilities, yet their performance remains brittle to minor variations in problem description and prompting strategy. Furthermore, reasoning is vulnerable to…

Computation and Language · Computer Science 2025-06-23 Sam Silver , Jimin Sun , Ivan Zhang , Sara Hooker , Eddie Kim

On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept

Large Language Models (LLMs) are able to improve their responses when instructed to do so, a capability known as self-correction. When instructions provide only the task's goal without specific details about potential issues in the…

Computation and Language · Computer Science 2024-11-11 Guangliang Liu , Haitao Mao , Bochuan Cao , Zhiyu Xue , Xitong Zhang , Rongrong Wang , Jiliang Tang , Kristen Johnson

On the Convergence of Moral Self-Correction in Large Language Models

Large Language Models (LLMs) are able to improve their responses when instructed to do so, a capability known as self-correction. When instructions provide only a general and abstract goal without specific details about potential issues in…

Computation and Language · Computer Science 2025-10-28 Guangliang Liu , Haitao Mao , Bochuan Cao , Zhiyu Xue , Xitong Zhang , Rongrong Wang , Kristen Marie Johnson

Self-Correction Makes LLMs Better Parsers

Large language models (LLMs) have achieved remarkable success across various natural language processing (NLP) tasks. However, recent studies suggest that they still face challenges in performing fundamental NLP tasks essential for deep…

Computation and Language · Computer Science 2025-04-22 Ziyan Zhang , Yang Hou , Chen Gong , Zhenghua Li

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Self-correction has emerged as a promising solution to boost the reasoning performance of large language models (LLMs), where LLMs refine their solutions using self-generated critiques that pinpoint the errors. This work explores whether…

Computation and Language · Computer Science 2024-06-07 Yunxiang Zhang , Muhammad Khalifa , Lajanugen Logeswaran , Jaekyeom Kim , Moontae Lee , Honglak Lee , Lu Wang

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

Self-correction of large language models (LLMs) emerges as a critical component for enhancing their reasoning performance. Although various self-correction methods have been proposed, a comprehensive evaluation of these methods remains…

Computation and Language · Computer Science 2025-10-23 Guiyao Tie , Zenghui Yuan , Zeli Zhao , Chaoran Hu , Tianhe Gu , Ruihang Zhang , Sizhe Zhang , Junran Wu , Xiaoyue Tu , Ming Jin , Qingsong Wen , Lixing Chen , Pan Zhou , Lichao Sun

Self-Correcting Large Language Models: Generation vs. Multiple Choice

Large language models have recently demonstrated remarkable abilities to self-correct their responses through iterative refinement, often referred to as self-consistency or self-reflection. However, the dynamics of this self-correction…

Computation and Language · Computer Science 2025-11-13 Hossein A. Rahmani , Satyapriya Krishna , Xi Wang , Mohammadmehdi Naghiaei , Emine Yilmaz

Smaller Large Language Models Can Do Moral Self-Correction

Self-correction is one of the most amazing emerging capabilities of Large Language Models (LLMs), enabling LLMs to self-modify an inappropriate output given a natural language feedback which describes the problems of that output. Moral…

Computation and Language · Computer Science 2025-03-04 Guangliang Liu , Zhiyu Xue , Xitong Zhang , Rongrong Wang , Kristen Marie Johnson

Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models

The recent success of Large Language Models (LLMs) has catalyzed an increasing interest in their self-correction capabilities. This paper presents a comprehensive investigation into the intrinsic self-correction of LLMs, attempting to…

Computation and Language · Computer Science 2024-05-14 Loka Li , Zhenhao Chen , Guangyi Chen , Yixuan Zhang , Yusheng Su , Eric Xing , Kun Zhang

A Theoretical Understanding of Self-Correction through In-context Alignment

Going beyond mimicking limited human experiences, recent studies show initial evidence that, like humans, large language models (LLMs) are capable of improving their abilities purely by self-correction, i.e., correcting previous responses…

Machine Learning · Computer Science 2024-11-19 Yifei Wang , Yuyang Wu , Zeming Wei , Stefanie Jegelka , Yisen Wang

Large Language Models Can Self-Correct with Key Condition Verification

Intrinsic self-correct was a method that instructed large language models (LLMs) to verify and correct their responses without external feedback. Unfortunately, the study concluded that the LLMs could not self-correct reasoning yet. We find…

Computation and Language · Computer Science 2024-10-04 Zhenyu Wu , Qingkai Zeng , Zhihan Zhang , Zhaoxuan Tan , Chao Shen , Meng Jiang

Self-Correction Bench: Uncovering and Addressing the Self-Correction Blind Spot in Large Language Models

Although large language models (LLMs) have transformed AI, they still make mistakes and can explore unproductive reasoning paths. Self-correction capability is essential for deploying LLMs in safety-critical applications. We uncover a…

Computation and Language · Computer Science 2025-10-07 Ken Tsui

On the Intersection of Self-Correction and Trust in Language Models

Large Language Models (LLMs) have demonstrated remarkable capabilities in performing complex cognitive tasks. However, their complexity and lack of transparency have raised several trustworthiness concerns, including the propagation of…

Machine Learning · Computer Science 2023-11-07 Satyapriya Krishna

Self-correction is Not An Innate Capability in Language Models

Although there has been growing interest in the self-correction capability of Large Language Models (LLMs), there are varying conclusions about its effectiveness. Prior research has largely concentrated on intrinsic self-correction,…

Computation and Language · Computer Science 2026-01-23 Guangliang Liu , Zimo Qi , Xitong Zhang , Lu Cheng , Kristen Marie Johnson

Large Language Models Can Self-Improve in Long-context Reasoning

Large language models (LLMs) have achieved substantial progress in processing long contexts but still struggle with long-context reasoning. Existing approaches typically involve fine-tuning LLMs with synthetic data, which depends on…

Computation and Language · Computer Science 2024-11-14 Siheng Li , Cheng Yang , Zesen Cheng , Lemao Liu , Mo Yu , Yujiu Yang , Wai Lam

Self-rewarding correction for mathematical reasoning

We study self-rewarding reasoning large language models (LLMs), which can simultaneously generate step-by-step reasoning and evaluate the correctness of their outputs during the inference time-without external feedback. This integrated…

Artificial Intelligence · Computer Science 2025-02-28 Wei Xiong , Hanning Zhang , Chenlu Ye , Lichang Chen , Nan Jiang , Tong Zhang

Calibrating Reasoning in Language Models with Internal Consistency

Large language models (LLMs) have demonstrated impressive capabilities in various reasoning tasks, aided by techniques like chain-of-thought prompting that elicits verbalized reasoning. However, LLMs often generate text with obvious…

Artificial Intelligence · Computer Science 2024-12-06 Zhihui Xie , Jizhou Guo , Tong Yu , Shuai Li