English
Related papers

Related papers: Learning to Self-Evolve

200 papers

Large Language Models (LLMs) have demonstrated remarkable versatility across various domains. To further advance LLMs, we propose 'SELF' (Self-Evolution with Language Feedback), a novel approach that enables LLMs to self-improve through…

Computation and Language · Computer Science 2024-02-02 Jianqiao Lu , Wanjun Zhong , Wenyong Huang , Yufei Wang , Qi Zhu , Fei Mi , Baojun Wang , Weichao Wang , Xingshan Zeng , Lifeng Shang , Xin Jiang , Qun Liu

Large language models (LLMs) have significantly advanced in various fields and intelligent agent applications. However, current LLMs that learn from human or external model supervision are costly and may face performance ceilings as task…

Computation and Language · Computer Science 2024-06-04 Zhengwei Tao , Ting-En Lin , Xiancai Chen , Hangyu Li , Yuchuan Wu , Yongbin Li , Zhi Jin , Fei Huang , Dacheng Tao , Jingren Zhou

Reinforcement learning (RL) has proven effective for fine-tuning large language models (LLMs), significantly enhancing their reasoning abilities in domains such as mathematics and code generation. A crucial factor influencing RL fine-tuning…

Artificial Intelligence · Computer Science 2025-10-31 Xiaoyin Chen , Jiarui Lu , Minsu Kim , Dinghuai Zhang , Jian Tang , Alexandre Piché , Nicolas Gontier , Yoshua Bengio , Ehsan Kamalloo

Recent advancements in prompt engineering strategies, such as Chain-of-Thought (CoT) and Self-Discover, have demonstrated significant potential in improving the reasoning abilities of Large Language Models (LLMs). However, these…

Computation and Language · Computer Science 2024-10-15 Krishna Aswani , Huilin Lu , Pranav Patankar , Priya Dhalwani , Iris Tan , Jayant Ganeshmohan , Simon Lacasse

Self-evolving large language models (LLMs) learn by generating their own training tasks and solutions, reducing reliance on human-curated supervision. However, in many reasoning domains, the model must also validate generated tasks and…

Artificial Intelligence · Computer Science 2026-05-28 Bowen Wei , Nan Wang , Yuqing Zhou , Jinhao Pan , Ziwei Zhu

Self-Refinement refers to a model's ability to revise its own responses to produce improved outputs. This capability can also serve as a fundamental mechanism for Self-Improvement, for example, by reconstructing datasets with refined…

Computation and Language · Computer Science 2025-10-28 Yongcheng Zeng , Xinyu Cui , Xuanfa Jin , Qirui Mi , Guoqing Liu , Zexu Sun , Mengyue Yang , Dong Li , Weiyu Ma , Ning Yang , Jian Zhao , Jianye Hao , Haifeng Zhang , Jun Wang

Large language models (LLMs) are powerful but static; they lack mechanisms to adapt their weights in response to new tasks, knowledge, or examples. We introduce Self-Adapting LLMs (SEAL), a framework that enables LLMs to self-adapt by…

Machine Learning · Computer Science 2025-09-19 Adam Zweiger , Jyothish Pari , Han Guo , Ekin Akyürek , Yoon Kim , Pulkit Agrawal

The capabilities of Large Language Models (LLMs) are limited to some extent by pre-training, so some researchers optimize LLMs through post-training. Existing post-training strategies, such as memory-based retrieval or preference…

Computation and Language · Computer Science 2025-07-22 Haoran Sun , Zekun Zhang , Shaoning Zeng

Reinforcement learning (RL) has demonstrated potential in enhancing the reasoning capabilities of large language models (LLMs), but such training typically demands substantial efforts in creating and annotating data. In this work, we…

Computation and Language · Computer Science 2025-10-06 Hangfan Zhang , Siyuan Xu , Zhimeng Guo , Huaisheng Zhu , Shicheng Liu , Xinrun Wang , Qiaosheng Zhang , Yang Chen , Peng Ye , Lei Bai , Shuyue Hu

Large Language Models (LLMs) have achieved remarkable capabilities, yet their improvement methods remain fundamentally constrained by human design. We present Self-Developing, a framework that enables LLMs to autonomously discover,…

Computation and Language · Computer Science 2025-06-11 Yoichi Ishibashi , Taro Yano , Masafumi Oyamada

Large Language Models (LLMs) have demonstrated remarkable capabilities on various tasks, while the further evolvement is limited to the lack of high-quality training data. In addition, traditional training approaches rely too much on…

Computation and Language · Computer Science 2025-02-14 Peidong Wang , Ming Wang , Zhiming Ma , Xiaocui Yang , Shi Feng , Daling Wang , Yifei Zhang , Kaisong Song

Building agentic systems that can autonomously self-improve from experience is a longstanding goal of AI. Large language models (LLMs) today primarily self-improve via two mechanisms: self-reflection for context updates, and reinforcement…

Artificial Intelligence · Computer Science 2026-02-26 Lunjun Zhang , Ryan Chen , Bradly C. Stadie

To improve the performance of large language models (LLMs), researchers have explored providing LLMs with textual task-solving experience via prompts. However, they rely on manual efforts to acquire and apply such experience for each task,…

Computation and Language · Computer Science 2024-07-15 Jinglong Gao , Xiao Ding , Yiming Cui , Jianbai Zhao , Hepeng Wang , Ting Liu , Bing Qin

This paper presents a benchmark self-evolving framework to dynamically evaluate rapidly advancing Large Language Models (LLMs), aiming for a more accurate assessment of their capabilities and limitations. We utilize a multi-agent system to…

Computation and Language · Computer Science 2024-02-20 Siyuan Wang , Zhuohan Long , Zhihao Fan , Zhongyu Wei , Xuanjing Huang

Large Language Models (LLMs) have recently advanced many applications on software engineering tasks, particularly the potential for code generation. Among contemporary challenges, code generated by LLMs often suffers from inaccuracies and…

Software Engineering · Computer Science 2024-08-29 Thai Tang Quoc , Duc Ha Minh , Tho Quan Thanh , Anh Nguyen-Duc

Large language models (LLMs) have achieved substantial progress in processing long contexts but still struggle with long-context reasoning. Existing approaches typically involve fine-tuning LLMs with synthetic data, which depends on…

Computation and Language · Computer Science 2024-11-14 Siheng Li , Cheng Yang , Zesen Cheng , Lemao Liu , Mo Yu , Yujiu Yang , Wai Lam

Large Language Models (LLMs) have demonstrated remarkable progress in complex reasoning tasks through both post-training and test-time scaling laws. While prevalent test-time scaling approaches are often realized by using external reward…

Machine Learning · Computer Science 2025-10-31 Fuxiang Zhang , Jiacheng Xu , Chaojie Wang , Ce Cui , Yang Liu , Bo An

Reinforcement learning with verifiable rewards improves reasoning in large language models (LLMs), but many methods still rely on large human-labeled datasets. While self-play reduces this dependency, it often lacks explicit planning and…

Artificial Intelligence · Computer Science 2026-03-18 Yulin Peng , Xinxin Zhu , Chenxing Wei , Nianbo Zeng , Leilei Wang , Ying Tiffany He , F. Richard Yu

This technical report briefly describes our JDExplore d-team's Vega v2 submission on the SuperGLUE leaderboard. SuperGLUE is more challenging than the widely used general language understanding evaluation (GLUE) benchmark, containing eight…

Computation and Language · Computer Science 2022-12-06 Qihuang Zhong , Liang Ding , Yibing Zhan , Yu Qiao , Yonggang Wen , Li Shen , Juhua Liu , Baosheng Yu , Bo Du , Yixin Chen , Xinbo Gao , Chunyan Miao , Xiaoou Tang , Dacheng Tao

Large Language Model (LLM)-based agents have demonstrated remarkable capabilities in complex reasoning and multi-turn interactions but struggle to continuously improve and adapt when deployed in new environments. One promising approach is…

Artificial Intelligence · Computer Science 2026-03-11 Jiongxiao Wang , Qiaojing Yan , Yawei Wang , Yijun Tian , Soumya Smruti Mishra , Zhichao Xu , Megha Gandhi , Panpan Xu , Lin Lee Cheong
‹ Prev 1 2 3 10 Next ›