Related papers: CODESKILL: Learning Self-Evolving Skills for Codin…

CoEvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification

Anthropic proposes the concept of skills for LLM agents to tackle multi-step professional tasks that simple tool invocations cannot address. A tool is a single, self-contained function, whereas a skill is a structured bundle of…

Artificial Intelligence · Computer Science 2026-04-14 Hanrong Zhang , Shicheng Fan , Henry Peng Zou , Yankai Chen , Zhenting Wang , Jiayu Zhou , Chengze Li , Wei-Chieh Huang , Yifei Yao , Kening Zheng , Xue Liu , Xiaoxiao Li , Philip S. Yu

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise needed for specialized tasks. Recent work addresses this through \textit{agent skills}: reusable…

Artificial Intelligence · Computer Science 2026-03-04 Salaheddin Alzubi , Noah Provenzano , Jaydon Bingham , Weiyuan Chen , Tu Vu

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Most Large Language Model (LLM) agent memory systems rely on a small set of static, hand-designed operations for extracting memory. These fixed procedures hard-code human priors about what to store and how to revise memory, making them…

Computation and Language · Computer Science 2026-05-26 Haozhen Zhang , Quanyu Long , Jianzhu Bao , Tao Feng , Weizhi Zhang , Haodong Yue , Wenya Wang

EffiSkill: Agent Skill Based Automated Code Efficiency Optimization

Code efficiency is a fundamental aspect of software quality, yet how to harness large language models (LLMs) to optimize programs remains challenging. Prior approaches have sought for one-shot rewriting, retrieved exemplars, or prompt-based…

Software Engineering · Computer Science 2026-03-31 Zimu Wang , Yuling Shi , Mengfan Li , Zijun Liu , Jie M. Zhang , Chengcheng Wan , Xiaodong Gu

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Large Language Model (LLM) agents have shown stunning results in complex tasks, yet they often operate in isolation, failing to learn from past experiences. Existing memory-based methods primarily store raw trajectories, which are often…

Machine Learning · Computer Science 2026-02-10 Peng Xia , Jianwen Chen , Hanyang Wang , Jiaqi Liu , Kaide Zeng , Yu Wang , Siwei Han , Yiyang Zhou , Xujiang Zhao , Haifeng Chen , Zeyu Zheng , Cihang Xie , Huaxiu Yao

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks. Yet, manual authoring creates a severe scalability bottleneck. Conversely, automated skill generation often yields fragile or…

Artificial Intelligence · Computer Science 2026-04-28 Jingwei Ni , Yihao Liu , Xinpeng Liu , Yutao Sun , Mengyu Zhou , Pengyu Cheng , Dexin Wang , Erchao Zhao , Xiaoxi Jiang , Guanjun Jiang

SkillGrad: Optimizing Agent Skills Like Gradient Descent

Agent skills provide a lightweight way to adapt LLM agents to specialized domains by storing reusable procedural knowledge in structured files. However, whether downloaded from third parties or self-generated, these skills are often…

Artificial Intelligence · Computer Science 2026-05-28 Hanyu Wang , Yifan Lan , Bochuan Cao , Lu Lin , Jinghui Chen

MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation

Large language model (LLM) agents rely on reusable skills to solve complex tasks. However, existing skill creation approaches treat skills as isolated and static artifacts, limiting their reusability, reliability, and long-term improvement.…

Artificial Intelligence · Computer Science 2026-05-27 Huawei Lin , Peng Li , Jie Song , Fuxin Jiang , Tieying Zhang

MIND-Skill: Quality-Guaranteed Skill Generation via Multi-Agent Induction and Deduction

Large language model (LLM) powered AI agents have emerged as a promising paradigm for autonomous problem-solving, yet they continue to struggle with complex, multi-step real-world tasks that demand domain-specific procedural knowledge.…

Artificial Intelligence · Computer Science 2026-05-12 Yixuan Li , Mingshu Cai , Ziyang Xiao , Wanyuan Wang , Yanchen Deng , Bo An

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

A persistent skill library allows language model agents to reuse successful strategies across tasks. Maintaining such a library requires three coupled capabilities. The agent selects a relevant skill, utilizes it during execution, and…

Artificial Intelligence · Computer Science 2026-05-13 Yaorui Shi , Yuxin Chen , Zhengxi Lu , Yuchun Miao , Shugui Liu , Qi GU , Xunliang Cai , Xiang Wang , An Zhang

A Self-Improving Coding Agent

Recent advancements in Large Language Models (LLMs) have spurred interest in deploying LLM agents to undertake tasks in the world. LLMs are often deployed in agent systems: code that orchestrates LLM calls and provides them with tools. We…

Artificial Intelligence · Computer Science 2025-05-20 Maxime Robeyns , Martin Szummer , Laurence Aitchison

AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution

In practical LLM applications, users repeatedly express stable preferences and requirements, such as reducing hallucinations, following institutional writing conventions, or avoiding overly technical wording, yet such interaction experience…

Artificial Intelligence · Computer Science 2026-03-06 Yutao Yang , Junsong Li , Qianjun Pan , Bihao Zhan , Yuxuan Cai , Lin Du , Jie Zhou , Kai Chen , Qin Chen , Xin Li , Bo Zhang , Liang He

EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents

Embodied agents can benefit from skills that guide object search, action execution, and state changes across diverse environments. Since embodied environments vary across layouts, object states, and other execution factors, these skills…

Artificial Intelligence · Computer Science 2026-05-12 Ruofei Ju , Xinrui Wang , Xin Ding , Yifan Yang , Hao Wu , Shiqi Jiang , Qianxi Zhang , Hao Wen , Xiangyu Li , Weijun Wang , Kun Li , Yunxin Liu , Haipeng Dai , Wei Wang , Ting Cao

WebXSkill: Skill Learning for Autonomous Web Agents

Autonomous web agents powered by large language models (LLMs) have shown promise in completing complex browser tasks, yet they still struggle with long-horizon workflows. A key bottleneck is the grounding gap in existing skill formulations:…

Artificial Intelligence · Computer Science 2026-04-16 Zhaoyang Wang , Qianhui Wu , Xuchao Zhang , Chaoyun Zhang , Wenlin Yao , Fazle Elahi Faisal , Baolin Peng , Si Qin , Suman Nath , Qingwei Lin , Chetan Bansal , Dongmei Zhang , Saravan Rajmohan , Jianfeng Gao , Huaxiu Yao

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

Skills have become the de facto way to enable LLM agents to perform complex real-world tasks with customized instructions, workflows, and tools, but how to learn them automatically and effectively remains unclear. We introduce…

Computation and Language · Computer Science 2026-04-23 Shanshan Zhong , Yi Lu , Jingjie Ning , Yibing Wan , Lihan Feng , Yuyi Ao , Leonardo F. R. Ribeiro , Markus Dreyer , Sean Ammirati , Chenyan Xiong

Scaling Coding Agents via Atomic Skills

Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often leads to task-specific overfitting and limited generalization. To address this, we propose a novel scaling paradigm that shifts the…

Software Engineering · Computer Science 2026-04-28 Yingwei Ma , Yue Liu , Xinlong Yang , Yanhao Li , Kelin Fu , Yibo Miao , Yuchong Xie , Zhexu Wang , Shing-Chi Cheung

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

Skills, i.e., structured workflow instructions distilled for large language models (LLMs), are becoming an increasingly important mechanism for improving agent performance on real-world downstream tasks. However, as the open-source skill…

Computation and Language · Computer Science 2026-05-29 Jiahao Ying , Boxian Ai , Wei Tang , Siyuan Liu , Yixin Cao

CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation

Code generation aims to produce code that fulfills requirements written in natural languages automatically. Large language Models (LLMs) like ChatGPT have demonstrated promising effectiveness in this area. Nonetheless, these LLMs often fail…

Software Engineering · Computer Science 2025-01-15 Ruwei Pan , Hongyu Zhang , Chao Liu

Skill-R1: Agent Skill Evolution via Reinforcement Learning

Agentic large language models often rely on skills, reusable natural language procedures that guide planning, action, and tool use. In practice, skills are typically improved through prompt engineering or by aligning the task LLM itself,…

Machine Learning · Computer Science 2026-05-12 Yash Vishe , Rohan Surana , Xunyi Jiang , Zihan Huang , Xintong Li , Nikki Lijing Kuang , Tong Yu , Ryan A. Rossi , Jingbo Shang , Julian McAuley , Junda Wu

Experiential Co-Learning of Software-Developing Agents

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents. A representative scenario is in software development, where LLM agents demonstrate…

Computation and Language · Computer Science 2024-06-06 Chen Qian , Yufan Dang , Jiahao Li , Wei Liu , Zihao Xie , Yifei Wang , Weize Chen , Cheng Yang , Xin Cong , Xiaoyin Che , Zhiyuan Liu , Maosong Sun