English
Related papers

Related papers: CODESKILL: Learning Self-Evolving Skills for Codin…

200 papers

Anthropic proposes the concept of skills for LLM agents to tackle multi-step professional tasks that simple tool invocations cannot address. A tool is a single, self-contained function, whereas a skill is a structured bundle of…

Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise needed for specialized tasks. Recent work addresses this through \textit{agent skills}: reusable…

Artificial Intelligence · Computer Science 2026-03-04 Salaheddin Alzubi , Noah Provenzano , Jaydon Bingham , Weiyuan Chen , Tu Vu

Most Large Language Model (LLM) agent memory systems rely on a small set of static, hand-designed operations for extracting memory. These fixed procedures hard-code human priors about what to store and how to revise memory, making them…

Computation and Language · Computer Science 2026-05-26 Haozhen Zhang , Quanyu Long , Jianzhu Bao , Tao Feng , Weizhi Zhang , Haodong Yue , Wenya Wang

Code efficiency is a fundamental aspect of software quality, yet how to harness large language models (LLMs) to optimize programs remains challenging. Prior approaches have sought for one-shot rewriting, retrieved exemplars, or prompt-based…

Software Engineering · Computer Science 2026-03-31 Zimu Wang , Yuling Shi , Mengfan Li , Zijun Liu , Jie M. Zhang , Chengcheng Wan , Xiaodong Gu

Large Language Model (LLM) agents have shown stunning results in complex tasks, yet they often operate in isolation, failing to learn from past experiences. Existing memory-based methods primarily store raw trajectories, which are often…

Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks. Yet, manual authoring creates a severe scalability bottleneck. Conversely, automated skill generation often yields fragile or…

Artificial Intelligence · Computer Science 2026-04-28 Jingwei Ni , Yihao Liu , Xinpeng Liu , Yutao Sun , Mengyu Zhou , Pengyu Cheng , Dexin Wang , Erchao Zhao , Xiaoxi Jiang , Guanjun Jiang

Agent skills provide a lightweight way to adapt LLM agents to specialized domains by storing reusable procedural knowledge in structured files. However, whether downloaded from third parties or self-generated, these skills are often…

Artificial Intelligence · Computer Science 2026-05-28 Hanyu Wang , Yifan Lan , Bochuan Cao , Lu Lin , Jinghui Chen

Large language model (LLM) agents rely on reusable skills to solve complex tasks. However, existing skill creation approaches treat skills as isolated and static artifacts, limiting their reusability, reliability, and long-term improvement.…

Artificial Intelligence · Computer Science 2026-05-27 Huawei Lin , Peng Li , Jie Song , Fuxin Jiang , Tieying Zhang

Large language model (LLM) powered AI agents have emerged as a promising paradigm for autonomous problem-solving, yet they continue to struggle with complex, multi-step real-world tasks that demand domain-specific procedural knowledge.…

Artificial Intelligence · Computer Science 2026-05-12 Yixuan Li , Mingshu Cai , Ziyang Xiao , Wanyuan Wang , Yanchen Deng , Bo An

A persistent skill library allows language model agents to reuse successful strategies across tasks. Maintaining such a library requires three coupled capabilities. The agent selects a relevant skill, utilizes it during execution, and…

Artificial Intelligence · Computer Science 2026-05-13 Yaorui Shi , Yuxin Chen , Zhengxi Lu , Yuchun Miao , Shugui Liu , Qi GU , Xunliang Cai , Xiang Wang , An Zhang

Recent advancements in Large Language Models (LLMs) have spurred interest in deploying LLM agents to undertake tasks in the world. LLMs are often deployed in agent systems: code that orchestrates LLM calls and provides them with tools. We…

Artificial Intelligence · Computer Science 2025-05-20 Maxime Robeyns , Martin Szummer , Laurence Aitchison

In practical LLM applications, users repeatedly express stable preferences and requirements, such as reducing hallucinations, following institutional writing conventions, or avoiding overly technical wording, yet such interaction experience…

Artificial Intelligence · Computer Science 2026-03-06 Yutao Yang , Junsong Li , Qianjun Pan , Bihao Zhan , Yuxuan Cai , Lin Du , Jie Zhou , Kai Chen , Qin Chen , Xin Li , Bo Zhang , Liang He

Embodied agents can benefit from skills that guide object search, action execution, and state changes across diverse environments. Since embodied environments vary across layouts, object states, and other execution factors, these skills…

Artificial Intelligence · Computer Science 2026-05-12 Ruofei Ju , Xinrui Wang , Xin Ding , Yifan Yang , Hao Wu , Shiqi Jiang , Qianxi Zhang , Hao Wen , Xiangyu Li , Weijun Wang , Kun Li , Yunxin Liu , Haipeng Dai , Wei Wang , Ting Cao

Autonomous web agents powered by large language models (LLMs) have shown promise in completing complex browser tasks, yet they still struggle with long-horizon workflows. A key bottleneck is the grounding gap in existing skill formulations:…

Skills have become the de facto way to enable LLM agents to perform complex real-world tasks with customized instructions, workflows, and tools, but how to learn them automatically and effectively remains unclear. We introduce…

Computation and Language · Computer Science 2026-04-23 Shanshan Zhong , Yi Lu , Jingjie Ning , Yibing Wan , Lihan Feng , Yuyi Ao , Leonardo F. R. Ribeiro , Markus Dreyer , Sean Ammirati , Chenyan Xiong

Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often leads to task-specific overfitting and limited generalization. To address this, we propose a novel scaling paradigm that shifts the…

Software Engineering · Computer Science 2026-04-28 Yingwei Ma , Yue Liu , Xinlong Yang , Yanhao Li , Kelin Fu , Yibo Miao , Yuchong Xie , Zhexu Wang , Shing-Chi Cheung

Skills, i.e., structured workflow instructions distilled for large language models (LLMs), are becoming an increasingly important mechanism for improving agent performance on real-world downstream tasks. However, as the open-source skill…

Computation and Language · Computer Science 2026-05-29 Jiahao Ying , Boxian Ai , Wei Tang , Siyuan Liu , Yixin Cao

Code generation aims to produce code that fulfills requirements written in natural languages automatically. Large language Models (LLMs) like ChatGPT have demonstrated promising effectiveness in this area. Nonetheless, these LLMs often fail…

Software Engineering · Computer Science 2025-01-15 Ruwei Pan , Hongyu Zhang , Chao Liu

Agentic large language models often rely on skills, reusable natural language procedures that guide planning, action, and tool use. In practice, skills are typically improved through prompt engineering or by aligning the task LLM itself,…

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents. A representative scenario is in software development, where LLM agents demonstrate…

Computation and Language · Computer Science 2024-06-06 Chen Qian , Yufan Dang , Jiahao Li , Wei Liu , Zihao Xie , Yifei Wang , Weize Chen , Cheng Yang , Xin Cong , Xiaoyin Che , Zhiyuan Liu , Maosong Sun
‹ Prev 1 2 3 10 Next ›