Related papers: Self-Refine: Iterative Refinement with Self-Feedba…

LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback

Recent large language models (LLM) are leveraging human feedback to improve their generation quality. However, human feedback is costly to obtain, especially during inference. In this work, we propose LLMRefine, an inference time…

Computation and Language · Computer Science 2024-10-28 Wenda Xu , Daniel Deutsch , Mara Finkelstein , Juraj Juraska , Biao Zhang , Zhongtao Liu , William Yang Wang , Lei Li , Markus Freitag

Refining the Responses of LLMs by Themselves

In this paper, we propose a simple yet efficient approach based on prompt engineering that leverages the large language model itself to optimize its answers without relying on auxiliary models. We introduce an iterative self-evaluating…

Computation and Language · Computer Science 2023-05-09 Tianqiang Yan , Tiansheng Xu

SELF: Self-Evolution with Language Feedback

Large Language Models (LLMs) have demonstrated remarkable versatility across various domains. To further advance LLMs, we propose 'SELF' (Self-Evolution with Language Feedback), a novel approach that enables LLMs to self-improve through…

Computation and Language · Computer Science 2024-02-02 Jianqiao Lu , Wanjun Zhong , Wenyong Huang , Yufei Wang , Qi Zhu , Fei Mi , Baojun Wang , Weichao Wang , Xingshan Zeng , Lifeng Shang , Xin Jiang , Qun Liu

What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation

Iterative self-refinement is a simple inference-time strategy for machine translation: an LLM revises its own translation over multiple inference-time passes. Yet document-scale refinement remains poorly understood: 1) which pipelines work…

Computation and Language · Computer Science 2026-05-14 Shaomu Tan , Dawei Zhu , Ke Tran , Michael Denkowski , Sony Trenous , Bill Byrne , Leonardo Ribeiro , Felix Hieber

Self Iterative Label Refinement via Robust Unlabeled Learning

Recent advances in large language models (LLMs) have yielded impressive performance on various tasks, yet they often depend on high-quality feedback that can be costly. Self-refinement methods attempt to leverage LLMs' internal evaluation…

Computation and Language · Computer Science 2025-12-01 Hikaru Asano , Tadashi Kozuno , Yukino Baba

Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models

Large Language Models (LLMs) have revolutionized various applications by generating outputs based on given prompts. However, achieving the desired output requires iterative prompt refinement. This paper presents a novel approach that draws…

Machine Learning · Computer Science 2025-01-22 Rupesh Raj Karn

Evolving LLMs' Self-Refinement Capability via Synergistic Training-Inference Optimization

Self-Refinement refers to a model's ability to revise its own responses to produce improved outputs. This capability can also serve as a fundamental mechanism for Self-Improvement, for example, by reconstructing datasets with refined…

Computation and Language · Computer Science 2025-10-28 Yongcheng Zeng , Xinyu Cui , Xuanfa Jin , Qirui Mi , Guoqing Liu , Zexu Sun , Mengyue Yang , Dong Li , Weiyu Ma , Ning Yang , Jian Zhao , Jianye Hao , Haifeng Zhang , Jun Wang

Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem

Natural language explanations (NLEs) are vital for elucidating the reasoning behind large language model (LLM) decisions. Many techniques have been developed to generate NLEs using LLMs. However, like humans, LLMs might not always produce…

Computation and Language · Computer Science 2024-12-03 Qianli Wang , Tatiana Anikina , Nils Feldhus , Simon Ostermann , Sebastian Möller , Vera Schmitt

REFINER: Reasoning Feedback on Intermediate Representations

Language models (LMs) have recently shown remarkable performance on reasoning tasks by explicitly generating intermediate inferences, e.g., chain-of-thought prompting. However, these intermediate inference steps may be inappropriate…

Computation and Language · Computer Science 2024-02-06 Debjit Paul , Mete Ismayilzada , Maxime Peyrard , Beatriz Borges , Antoine Bosselut , Robert West , Boi Faltings

Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement

Recent studies show that large language models (LLMs) improve their performance through self-feedback on certain tasks while degrade on others. We discovered that such a contrary is due to LLM's bias in evaluating their own output. In this…

Computation and Language · Computer Science 2024-06-19 Wenda Xu , Guanglei Zhu , Xuandong Zhao , Liangming Pan , Lei Li , William Yang Wang

Iterative Critique-Refine Framework for Enhancing LLM Personalization

Personalized text generation requires models not only to produce coherent text but also to align with a target user's style, tone, and topical focus. Existing retrieval-augmented approaches such as LaMP and PGraphRAG enrich profiles with…

Computation and Language · Computer Science 2025-10-29 Durga Prasad Maram , Dhruvin Gandhi , Zonghai Yao , Gayathri Akkinapalli , Franck Dernoncourt , Yu Wang , Ryan A. Rossi , Nesreen K. Ahmed

LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops

Large Language Models (LLMs) are showing remarkable performance in generating source code, yet the generated code often has issues like compilation errors or incorrect code. Researchers and developers often face wasted effort in…

Software Engineering · Computer Science 2026-03-26 Ravin Ravi , Dylan Bradshaw , Stefano Ruberto , Gunel Jahangirova , Valerio Terragni

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

We explore a method for improving the performance of large language models through self-reflection and reinforcement learning. By incentivizing the model to generate better self-reflections when it answers incorrectly, we demonstrate that a…

Computation and Language · Computer Science 2025-06-02 Shelly Bensal , Umar Jamil , Christopher Bryant , Melisa Russak , Kiran Kamble , Dmytro Mozolevskyi , Muayad Ali , Waseem AlShikh

The ART of LLM Refinement: Ask, Refine, and Trust

In recent years, Large Language Models (LLMs) have demonstrated remarkable generative abilities, but can they judge the quality of their own generations? A popular concept, referred to as self-refinement, postulates that LLMs can detect and…

Computation and Language · Computer Science 2023-11-15 Kumar Shridhar , Koustuv Sinha , Andrew Cohen , Tianlu Wang , Ping Yu , Ram Pasunuru , Mrinmaya Sachan , Jason Weston , Asli Celikyilmaz

Self-rewarding correction for mathematical reasoning

We study self-rewarding reasoning large language models (LLMs), which can simultaneously generate step-by-step reasoning and evaluate the correctness of their outputs during the inference time-without external feedback. This integrated…

Artificial Intelligence · Computer Science 2025-02-28 Wei Xiong , Hanning Zhang , Chenlu Ye , Lichang Chen , Nan Jiang , Tong Zhang

Self-Improvement of Large Language Models: A Technical Overview and Future Outlook

As large language models (LLMs) continue to advance, improving them solely through human supervision is becoming increasingly costly and limited in scalability. As models approach human-level capabilities in certain domains, human feedback…

Computation and Language · Computer Science 2026-03-27 Haoyan Yang , Mario Xerri , Solha Park , Huajian Zhang , Yiyang Feng , Sai Akhil Kogilathota , Jiawei Zhou

Training Language Models with Language Feedback

Pretrained language models often do not perform tasks in ways that are in line with our preferences, e.g., generating offensive text or factually incorrect summaries. Recent work approaches the above issue by learning from a simple form of…

Computation and Language · Computer Science 2022-11-18 Jérémy Scheurer , Jon Ander Campos , Jun Shern Chan , Angelica Chen , Kyunghyun Cho , Ethan Perez

Improving the Validity of Automatically Generated Feedback via Reinforcement Learning

Automatically generating feedback via large language models (LLMs) in intelligent tutoring systems and online learning platforms has the potential to improve the learning outcomes of many students. However, both feedback generation and…

Computation and Language · Computer Science 2024-12-13 Alexander Scarlatos , Digory Smith , Simon Woodhead , Andrew Lan

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Can language models (LMs) self-refine their own responses? This question is increasingly relevant as a wide range of real-world user interactions involve refinement requests. However, prior studies have largely tested LMs' refinement…

Computation and Language · Computer Science 2025-12-01 Young-Jun Lee , Seungone Kim , Byung-Kwan Lee , Minkyeong Moon , Yechan Hwang , Jong Myoung Kim , Graham Neubig , Sean Welleck , Ho-Jin Choi

Refine-n-Judge: Curating High-Quality Preference Chains for LLM-Fine-Tuning

Large Language Models (LLMs) have demonstrated remarkable progress through preference-based fine-tuning, which critically depends on the quality of the underlying training data. While human feedback is essential for improving data quality,…

Artificial Intelligence · Computer Science 2025-10-31 Derin Cayir , Renjie Tao , Rashi Rungta , Kai Sun , Sean Chen , Haidar Khan , Minseok Kim , Julia Reinspach , Yue Liu