Related papers: Self-Taught Optimizer (STOP): Recursively Self-Imp…

Can Large Language Models Invent Algorithms to Improve Themselves?: Algorithm Discovery for Recursive Self-Improvement through Reinforcement Learning

Large Language Models (LLMs) have achieved remarkable capabilities, yet their improvement methods remain fundamentally constrained by human design. We present Self-Developing, a framework that enables LLMs to autonomously discover,…

Computation and Language · Computer Science 2025-06-11 Yoichi Ishibashi , Taro Yano , Masafumi Oyamada

Language Models Can Teach Themselves to Program Better

Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on human-authored problems, even solving some competitive-programming problems. Self-play has proven useful in games such as Go, and thus it is…

Machine Learning · Computer Science 2023-04-13 Patrick Haluptzok , Matthew Bowers , Adam Tauman Kalai

Self-Programming Artificial Intelligence Using Code-Generating Language Models

Recent progress in large-scale language models has enabled breakthroughs in previously intractable computer programming tasks. Prior work in meta-learning and neural architecture search has led to substantial successes across various task…

Artificial Intelligence · Computer Science 2023-02-06 Alex Sheng , Shankar Padmanabhan

Self-Improvement in Language Models: The Sharpening Mechanism

Recent work in language modeling has raised the possibility of self-improvement, where a language models evaluates and refines its own generations to achieve higher performance without external feedback. It is impossible for this…

Artificial Intelligence · Computer Science 2024-12-05 Audrey Huang , Adam Block , Dylan J. Foster , Dhruv Rohatgi , Cyril Zhang , Max Simchowitz , Jordan T. Ash , Akshay Krishnamurthy

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Generating high-quality code that solves complex programming tasks is challenging, especially with current decoder-based models that produce highly stochastic outputs. In code generation, even minor errors can easily break the entire…

Computation and Language · Computer Science 2025-04-15 Nikita Sorokin , Ivan Sedykh , Valentin Malykh

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Large language models often struggle with length generalization and solving complex problem instances beyond their training distribution. We present a self-improvement approach where models iteratively generate and learn from their own…

Machine Learning · Computer Science 2025-02-14 Nayoung Lee , Ziyang Cai , Avi Schwarzschild , Kangwook Lee , Dimitris Papailiopoulos

Self-Refine: Iterative Refinement with Self-Feedback

Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through…

Computation and Language · Computer Science 2023-05-29 Aman Madaan , Niket Tandon , Prakhar Gupta , Skyler Hallinan , Luyu Gao , Sarah Wiegreffe , Uri Alon , Nouha Dziri , Shrimai Prabhumoye , Yiming Yang , Shashank Gupta , Bodhisattwa Prasad Majumder , Katherine Hermann , Sean Welleck , Amir Yazdanbakhsh , Peter Clark

Self-Improvement of Large Language Models: A Technical Overview and Future Outlook

As large language models (LLMs) continue to advance, improving them solely through human supervision is becoming increasingly costly and limited in scalability. As models approach human-level capabilities in certain domains, human feedback…

Computation and Language · Computer Science 2026-03-27 Haoyan Yang , Mario Xerri , Solha Park , Huajian Zhang , Yiyang Feng , Sai Akhil Kogilathota , Jiawei Zhou

The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?

Self-improving large language models (LLMs) -- i.e., to improve the performance of an LLM by fine-tuning it with synthetic data generated by itself -- is a promising way to advance the capabilities of LLMs while avoiding extensive…

Computation and Language · Computer Science 2025-02-20 Yutao Sun , Mingshuai Chen , Tiancheng Zhao , Ruochen Xu , Zilun Zhang , Jianwei Yin

AutoCodeRover: Autonomous Program Improvement

Researchers have made significant progress in automating the software development process in the past decades. Recent progress in Large Language Models (LLMs) has significantly impacted the development process, where developers can use…

Software Engineering · Computer Science 2024-07-26 Yuntong Zhang , Haifeng Ruan , Zhiyu Fan , Abhik Roychoudhury

Scaffolding Language Learning via Multi-modal Tutoring Systems with Pedagogical Instructions

Intelligent tutoring systems (ITSs) that imitate human tutors and aim to provide immediate and customized instructions or feedback to learners have shown their effectiveness in education. With the emergence of generative artificial…

Computation and Language · Computer Science 2024-04-05 Zhengyuan Liu , Stella Xin Yin , Carolyn Lee , Nancy F. Chen

From Seed AI to Technological Singularity via Recursively Self-Improving Software

Software capable of improving itself has been a dream of computer scientists since the inception of the field. In this work we provide definitions for Recursively Self-Improving software, survey different types of self-improving software,…

Artificial Intelligence · Computer Science 2015-02-24 Roman V. Yampolskiy

Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Self-improvement is a mechanism in Large Language Model (LLM) pre-training, post-training and test-time inference. We explore a framework where the model verifies its own outputs, filters or reweights data based on this verification, and…

Computation and Language · Computer Science 2025-02-26 Yuda Song , Hanlin Zhang , Carson Eisenach , Sham Kakade , Dean Foster , Udaya Ghai

Bounded Recursive Self-Improvement

We have designed a machine that becomes increasingly better at behaving in underspecified circumstances, in a goal-directed way, on the job, by modeling itself and its environment as experience accumulates. Based on principles of…

Artificial Intelligence · Computer Science 2013-12-25 E. Nivel , K. R. Thórisson , B. R. Steunebrink , H. Dindo , G. Pezzulo , M. Rodriguez , C. Hernandez , D. Ognibene , J. Schmidhuber , R. Sanz , H. P. Helgason , A. Chella , G. K. Jonsson

Self-Infilling Code Generation

This work introduces self-infilling code generation, a general framework that incorporates infilling operations into auto-regressive decoding. Our approach capitalizes on the observation that recent infilling-capable code language models…

Programming Languages · Computer Science 2024-05-28 Lin Zheng , Jianbo Yuan , Zhi Zhang , Hongxia Yang , Lingpeng Kong

Test-Oriented Programming: rethinking coding for the GenAI era

Large language models (LLMs) have shown astonishing capability of generating software code, leading to its use to support developers in programming. Proposed tools have relied either on assistants for improved auto-complete or multi-agents,…

Software Engineering · Computer Science 2026-04-10 Jorge Melegati

CodeExemplar: Example-Based Scaffolding for Introductory Programming in the GenAI Era

Generative AI (GenAI) can generate working code with minimal effort, creating a tension in introductory programming: students need timely help, yet direct solutions invite copying and can short-circuit reasoning. To address this, we propose…

Human-Computer Interaction · Computer Science 2026-03-26 Boxuan Ma , Shinichi Konomi

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

There has been considerable divergence of opinion on the reasoning abilities of Large Language Models (LLMs). While the initial optimism that reasoning might emerge automatically with scale has been tempered thanks to a slew of…

Artificial Intelligence · Computer Science 2024-08-06 Kaya Stechly , Karthik Valmeekam , Subbarao Kambhampati

Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI

Many program synthesis tasks prove too challenging for even state-of-the-art language models to solve in single attempts. Search-based evolutionary methods offer a promising alternative by exploring solution spaces iteratively, but their…

Machine Learning · Computer Science 2026-03-17 Julien Pourcel , Cédric Colas , Pierre-Yves Oudeyer

Generating Sequences by Learning to Self-Correct

Sequence generation applications require satisfying semantic constraints, such as ensuring that programs are correct, using certain keywords, or avoiding undesirable content. Language models, whether fine-tuned or prompted with few-shot…

Computation and Language · Computer Science 2022-11-02 Sean Welleck , Ximing Lu , Peter West , Faeze Brahman , Tianxiao Shen , Daniel Khashabi , Yejin Choi