Related papers: Language Models as Continuous Self-Evolving Data E…

A Survey on Self-Evolution of Large Language Models

Large language models (LLMs) have significantly advanced in various fields and intelligent agent applications. However, current LLMs that learn from human or external model supervision are costly and may face performance ceilings as task…

Computation and Language · Computer Science 2024-06-04 Zhengwei Tao , Ting-En Lin , Xiancai Chen , Hangyu Li , Yuchuan Wu , Yongbin Li , Zhi Jin , Fei Huang , Dacheng Tao , Jingren Zhou

Language Models can Self-Lengthen to Generate Long Texts

Recent advancements in Large Language Models (LLMs) have significantly enhanced their ability to process long contexts, yet a notable gap remains in generating long, aligned outputs. This limitation stems from a training gap where…

Computation and Language · Computer Science 2024-11-01 Shanghaoran Quan , Tianyi Tang , Bowen Yu , An Yang , Dayiheng Liu , Bofei Gao , Jianhong Tu , Yichang Zhang , Jingren Zhou , Junyang Lin

Evolution without Large Models: Training Language Model with Task Principles

A common training approach for language models involves using a large-scale language model to expand a human-provided dataset, which is subsequently used for model training.This method significantly reduces training costs by eliminating the…

Computation and Language · Computer Science 2025-07-09 Minghang Zhu , Shen Gao , Zhengliang Shi , Jiabao Fang , Pengjie Ren , Zhaochun Ren , Zhumin Chen , Shuo Shang

Towards Efficient and Effective Alignment of Large Language Models

Large language models (LLMs) exhibit remarkable capabilities across diverse tasks, yet aligning them efficiently and effectively with human expectations remains a critical challenge. This thesis advances LLM alignment by introducing novel…

Computation and Language · Computer Science 2025-06-12 Yuxin Jiang

Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle

In the wake of relentless digital transformation, data-driven solutions are emerging as powerful tools to address multifarious industrial tasks such as forecasting, anomaly detection, planning, and even complex decision-making. Although…

Artificial Intelligence · Computer Science 2023-10-18 Xu Yang , Xiao Yang , Weiqing Liu , Jinhui Li , Peng Yu , Zeqi Ye , Jiang Bian

Learning to Self-Evolve

We introduce Learning to Self-Evolve (LSE), a reinforcement learning framework that trains large language models (LLMs) to improve their own contexts at test time. We situate LSE in the setting of test-time self-evolution, where a model…

Computation and Language · Computer Science 2026-03-20 Xiaoyin Chen , Canwen Xu , Yite Wang , Boyi Liu , Zhewei Yao , Yuxiong He

Can Large Language Models Invent Algorithms to Improve Themselves?: Algorithm Discovery for Recursive Self-Improvement through Reinforcement Learning

Large Language Models (LLMs) have achieved remarkable capabilities, yet their improvement methods remain fundamentally constrained by human design. We present Self-Developing, a framework that enables LLMs to autonomously discover,…

Computation and Language · Computer Science 2025-06-11 Yoichi Ishibashi , Taro Yano , Masafumi Oyamada

Towards Next-Generation LLM Training: From the Data-Centric Perspective

Large language models (LLMs) have demonstrated remarkable performance across a wide range of tasks and domains, with data playing a central role in enabling these advances. Despite this success, the preparation and effective utilization of…

Computation and Language · Computer Science 2026-03-17 Hao Liang , Zhengyang Zhao , Zhaoyang Han , Meiyi Qiang , Xiaochen Ma , Bohan Zeng , Qifeng Cai , Zhiyu Li , Linpeng Tang , Weinan E , Wentao Zhang

Efficient Strategy for Improving Large Language Model (LLM) Capabilities

Large Language Models (LLMs) have become a milestone in the field of artificial intelligence and natural language processing. However, their large-scale deployment remains constrained by the need for significant computational resources.…

Computation and Language · Computer Science 2025-08-07 Julián Camilo Velandia Gutiérrez

The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback

Reinforcement learning (RL) has demonstrated potential in enhancing the reasoning capabilities of large language models (LLMs), but such training typically demands substantial efforts in creating and annotating data. In this work, we…

Computation and Language · Computer Science 2025-10-06 Hangfan Zhang , Siyuan Xu , Zhimeng Guo , Huaisheng Zhu , Shicheng Liu , Xinrun Wang , Qiaosheng Zhang , Yang Chen , Peng Ye , Lei Bai , Shuyue Hu

Large Language Model Agent as a Mechanical Designer

Conventional mechanical design follows an iterative process in which initial concepts are refined through cycles of expert assessment and resource-intensive Finite Element Method (FEM) analysis to meet performance goals. While machine…

Machine Learning · Computer Science 2025-05-02 Yayati Jadhav , Amir Barati Farimani

When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering

This paper presents the LLM-ADE framework, a novel methodology for continued pre-training of large language models (LLMs) that addresses the challenges of catastrophic forgetting and double descent. LLM-ADE employs dynamic architectural…

Computational Engineering, Finance, and Science · Computer Science 2024-04-22 Stephen Choi , William Gazeley

Assessing and Advancing Benchmarks for Evaluating Large Language Models in Software Engineering Tasks

Large language models (LLMs) are gaining increasing popularity in software engineering (SE) due to their unprecedented performance across various applications. These models are increasingly being utilized for a range of SE tasks, including…

Software Engineering · Computer Science 2025-11-05 Xing Hu , Feifei Niu , Junkai Chen , Xin Zhou , Junwei Zhang , Junda He , Xin Xia , David Lo

Evaluating Large Language Models on Business Process Modeling: Framework, Benchmark, and Self-Improvement Analysis

Large Language Models (LLMs) are rapidly transforming various fields, and their potential in Business Process Management (BPM) is substantial. This paper assesses the capabilities of LLMs on business process modeling using a framework for…

Databases · Computer Science 2024-12-03 Humam Kourani , Alessandro Berti , Daniel Schuster , Wil M. P. van der Aalst

Self-Improvement of Large Language Models: A Technical Overview and Future Outlook

As large language models (LLMs) continue to advance, improving them solely through human supervision is becoming increasingly costly and limited in scalability. As models approach human-level capabilities in certain domains, human feedback…

Computation and Language · Computer Science 2026-03-27 Haoyan Yang , Mario Xerri , Solha Park , Huajian Zhang , Yiyang Feng , Sai Akhil Kogilathota , Jiawei Zhou

Apprentices to Research Assistants: Advancing Research with Large Language Models

Large Language Models (LLMs) have emerged as powerful tools in various research domains. This article examines their potential through a literature review and firsthand experimentation. While LLMs offer benefits like cost-effectiveness and…

Human-Computer Interaction · Computer Science 2024-04-10 M. Namvarpour , A. Razi

Large Language Models Make Sample-Efficient Recommender Systems

Large language models (LLMs) have achieved remarkable progress in the field of natural language processing (NLP), demonstrating remarkable abilities in producing text that resembles human language for various tasks. This opens up new…

Information Retrieval · Computer Science 2024-06-05 Jianghao Lin , Xinyi Dai , Rong Shan , Bo Chen , Ruiming Tang , Yong Yu , Weinan Zhang

Empowering Large Language Models for Textual Data Augmentation

With the capabilities of understanding and executing natural language instructions, Large language models (LLMs) can potentially act as a powerful tool for textual data augmentation. However, the quality of augmented data depends heavily on…

Computation and Language · Computer Science 2024-04-30 Yichuan Li , Kaize Ding , Jianling Wang , Kyumin Lee

Qwen-BIM: developing large language model for BIM-based design with domain-specific benchmark and dataset

As the construction industry advances toward digital transformation, BIM (Building Information Modeling)-based design has become a key driver supporting intelligent construction. Despite Large Language Models (LLMs) have shown potential in…

Artificial Intelligence · Computer Science 2026-02-25 Jia-Rui Lin , Yun-Hong Cai , Xiang-Rui Ni , Shaojie Zhou , Peng Pan

A Survey on Large Language Model-based Agents for Statistics and Data Science

In recent years, data science agents powered by Large Language Models (LLMs), known as "data agents," have shown significant potential to transform the traditional data analysis paradigm. This survey provides an overview of the evolution,…

Artificial Intelligence · Computer Science 2025-12-01 Maojun Sun , Ruijian Han , Binyan Jiang , Houduo Qi , Defeng Sun , Yancheng Yuan , Jian Huang