Related papers: Klear-AgentForge: Forging Agentic Intelligence thr…

Scaling Agents via Continual Pre-training

Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving. However, post-training approaches building upon general-purpose foundation models…

Computation and Language · Computer Science 2025-09-17 Liangcai Su , Zhen Zhang , Guangyu Li , Zhuo Chen , Chenxi Wang , Maojia Song , Xinyu Wang , Kuan Li , Jialong Wu , Xuanzhong Chen , Zile Qiao , Zhongwang Zhang , Huifeng Yin , Shihao Cai , Runnan Fang , Zhengwei Tao , Wenbiao Yin , Chenxiong Qian , Yong Jiang , Pengjun Xie , Fei Huang , Jingren Zhou

Towards General Agentic Intelligence via Environment Scaling

Advanced agentic intelligence is a prerequisite for deploying Large Language Models in practical, real-world applications. Diverse real-world APIs demand precise, robust function-calling intelligence, which needs agents to develop these…

Computation and Language · Computer Science 2025-09-17 Runnan Fang , Shihao Cai , Baixuan Li , Jialong Wu , Guangyu Li , Wenbiao Yin , Xinyu Wang , Xiaobin Wang , Liangcai Su , Zhen Zhang , Shibin Wu , Zhengwei Tao , Yong Jiang , Pengjun Xie , Fei Huang , Jingren Zhou

AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Recent advances in large language models (LLMs) have sparked growing interest in building generalist agents that can learn through online interactions. However, applying reinforcement learning (RL) to train LLM agents in multi-turn,…

Artificial Intelligence · Computer Science 2025-10-07 Hanchen Zhang , Xiao Liu , Bowen Lv , Xueqiao Sun , Bohao Jing , Iat Long Iong , Zhenyu Hou , Zehan Qi , Hanyu Lai , Yifan Xu , Rui Lu , Hongning Wang , Jie Tang , Yuxiao Dong

AgentFly: Extensible and Scalable Reinforcement Learning for LM Agents

Language model (LM) agents have gained significant attention for their ability to autonomously complete tasks through interactions with environments, tools, and APIs. LM agents are primarily built with prompt engineering or supervised…

Artificial Intelligence · Computer Science 2025-07-22 Renxi Wang , Rifo Ahmad Genadi , Bilal El Bouardi , Yongxin Wang , Fajri Koto , Zhengzhong Liu , Timothy Baldwin , Haonan Li

Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills

Large language model (LLM) agents are moving beyond prompting alone. ChatGPT marked the rise of general-purpose LLM assistants, DeepSeek showed that on-policy reinforcement learning with verifiable rewards can improve reasoning and tool…

Artificial Intelligence · Computer Science 2026-03-10 Pengcheng Jiang , Jiacheng Lin , Zhiyi Shi , Zifeng Wang , Luxi He , Yichen Wu , Ming Zhong , Peiyang Song , Qizheng Zhang , Heng Wang , Xueqiang Xu , Hanwen Xu , Pengrui Han , Dylan Zhang , Jiashuo Sun , Chaoqi Yang , Kun Qian , Tian Wang , Changran Hu , Manling Li , Quanzheng Li , Hao Peng , Sheng Wang , Jingbo Shang , Chao Zhang , Jiaxuan You , Liyuan Liu , Pan Lu , Yu Zhang , Heng Ji , Yejin Choi , Dawn Song , Jimeng Sun , Jiawei Han

LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent

Reinforcement Learning (RL) has emerged as a powerful training paradigm for LLM-based agents. However, scaling agentic RL for deep research remains constrained by two coupled challenges: hand-crafted synthetic data fails to elicit genuine…

Artificial Intelligence · Computer Science 2026-04-23 Wanli Li , Bince Qu , Bo Pan , Jianyu Zhang , Zheng Liu , Pan Zhang , Wei Chen , Bo Zhang

KAT-Coder Technical Report

Recent advances in large language models (LLMs) have enabled progress in agentic coding, where models autonomously reason, plan, and act within interactive software development workflows. However, bridging the gap between static text-based…

Computation and Language · Computer Science 2025-11-03 Zizheng Zhan , Ken Deng , Jinghui Wang , Xiaojiang Zhang , Huaixi Tang , Minglei Zhang , Zhiyi Lai , Haoyang Huang , Wen Xiang , Kun Wu , Wenhao Zhuang , Shaojie Wang , Shangpeng Yan , Kepeng Lei , Zongxian Feng , Huiming Wang , Zheng Lin , Mengtong Li , Mengfei Xie , Yinghan Cui , Xuxing Chen , Chao Wang , Weihao Li , Wenqiang Zhu , Jiarong Zhang , Jingxuan Xu , Songwei Yu , Yifan Yao , Xinping Lei , C. Zhang , Han Li , Junqi Xiong , Zuchen Gao , Dailin Li , Haimo Li , Jiaheng Liu , Yuqun Zhang , Junyi Peng , Haotian Zhang , Bin Chen

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

The emergence of large language model (LLM)-based agents has significantly advanced the development of autonomous machine learning (ML) engineering. However, the dominant prompt-based paradigm exhibits limitations: smaller models lack the…

Computation and Language · Computer Science 2026-05-04 Zexi Liu , Jingyi Chai , Xinyu Zhu , Shuo Tang , Rui Ye , Bo Zhang , Lei Bai , Siheng Chen

AgentInstruct: Toward Generative Teaching with Agentic Flows

Synthetic data is becoming increasingly important for accelerating the development of language models, both large and small. Despite several successful use cases, researchers also raised concerns around model collapse and drawbacks of…

Artificial Intelligence · Computer Science 2024-07-08 Arindam Mitra , Luciano Del Corro , Guoqing Zheng , Shweti Mahajan , Dany Rouhana , Andres Codas , Yadong Lu , Wei-ge Chen , Olga Vrousgos , Corby Rosset , Fillipe Silva , Hamed Khanpour , Yash Lara , Ahmed Awadallah

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Recent advances in large language model (LLM) have empowered autonomous agents to perform multi-turn interactions with tools and environments. However, scaling such agent training is limited by the lack of diverse and reliable environments.…

Artificial Intelligence · Computer Science 2026-05-26 Zhaoyang Wang , Canwen Xu , Boyi Liu , Yite Wang , Siwei Han , Zhewei Yao , Huaxiu Yao , Yuxiong He

How to Train Your LLM Web Agent: A Statistical Diagnosis

LLM-based web agents have recently made significant progress, but much of it has occurred in closed-source systems, widening the gap with open-source alternatives. Progress has been held back by two key challenges: first, a narrow focus on…

Artificial Intelligence · Computer Science 2026-02-16 Dheeraj Vattikonda , Santhoshi Ravichandran , Emiliano Penaloza , Hadi Nekoei , Megh Thakkar , Thibault Le Sellier de Chezelles , Nicolas Gontier , Miguel Muñoz-Mármol , Sahar Omidi Shayegan , Stefania Raimondo , Xue Liu , Alexandre Drouin , Laurent Charlin , Alexandre Piché , Alexandre Lacoste , Massimo Caccia

A Lightweight Modular Framework for Constructing Autonomous Agents Driven by Large Language Models: Design, Implementation, and Applications in AgentForge

The emergence of LLMs has catalyzed a paradigm shift in autonomous agent development, enabling systems capable of reasoning, planning, and executing complex multi-step tasks. However, existing agent frameworks often suffer from…

Artificial Intelligence · Computer Science 2026-01-21 Akbar Anbar Jafari , Cagri Ozcinar , Gholamreza Anbarjafari

Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces

Agentic systems operating over large tool ecosystems must plan and execute long-horizon workflows under weak or non-verifiable supervision. While frontier models mitigate these challenges through scale and large context budgets, small…

Machine Learning · Computer Science 2026-03-10 Karan Gupta , Pranav Vajreshwari , Yash Pandya , Raghav Magazine , Akshay Nambi , Ahmed Awadallah

Agentic Reinforcement Learning for Real-World Code Repair

We tackle the challenge of training reliable code-fixing agents in real repositories, where complex builds and shifting dependencies make evaluation unstable. We developed a verifiable pipeline with success defined as post-fix build…

Machine Learning · Computer Science 2025-10-28 Siyu Zhu , Anastasiya Karpovich , Albert Chen , Jessica Koscheka , Shailesh Jannu , Di Wen , Yuqing Zhu , Rohit Jain , Alborz Geramifard

AgentFlux: Decoupled Fine-Tuning & Inference for On-Device Agentic Systems

The deployment of Large Language Models (LLMs) as agentic orchestrators has revolutionized task automation, but the need for privacy-preserving, cost-effective solutions demands on-device inference capabilities. However, local LLMs…

Artificial Intelligence · Computer Science 2025-11-13 Rohan Kadekodi , Zhan Jin , Keisuke Kamahori , Yile Gu , Sean Khatiri , Noah H. Bayindirli , Sergey Gorbunov , Baris Kasikci

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Agentic repository-level code understanding is essential for automating complex software engineering tasks, yet the field lacks reliable benchmarks. Existing evaluations often overlook the long tail topics and rely on popular repositories…

Software Engineering · Computer Science 2026-03-18 Songcheng Cai , Zhiheng Lyu , Yuansheng Ni , Xiangchao Chen , Baichuan Zhou , Shenzhe Zhu , Yi Lu , Haozhe Wang , Chi Ruan , Benjamin Schneider , Weixu Zhang , Xiang Li , Andy Zheng , Yuyu Zhang , Ping Nie , Wenhu Chen

Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning

The paradigm of agentic AI is shifting from engineered complex workflows to post-training native models. However, existing agents are typically confined to static, predefined action spaces--such as exclusively using APIs, GUI events, or…

Machine Learning · Computer Science 2025-12-11 Kaichen He , Zihao Wang , Muyao Li , Anji Liu , Yitao Liang

Don't Just Fine-tune the Agent, Tune the Environment

Large Language Model (LLM) agents show great promise for complex, multi-turn tool-use tasks, but their development is often hampered by the extreme scarcity of high-quality training data. Supervised fine-tuning (SFT) on synthetic data leads…

Artificial Intelligence · Computer Science 2026-02-02 Siyuan Lu , Zechuan Wang , Hongxuan Zhang , Qintong Wu , Leilei Gan , Chenyi Zhuang , Jinjie Gu , Tao Lin

daVinci-Dev: Agent-native Mid-training for Software Engineering

Recently, the frontier of Large Language Model (LLM) capabilities has shifted from single-turn code generation to agentic software engineering-a paradigm where models autonomously navigate, edit, and test complex repositories. While…

Software Engineering · Computer Science 2026-01-28 Ji Zeng , Dayuan Fu , Tiantian Mi , Yumin Zhuang , Yaxing Huang , Xuefeng Li , Lyumanshan Ye , Muhang Xie , Qishuo Hua , Zhen Huang , Mohan Jiang , Hanning Wang , Jifan Lin , Yang Xiao , Jie Sun , Yunze Wu , Pengfei Liu

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Autonomous data science, from raw data sources to analyst-grade deep research reports, has been a long-standing challenge, and is now becoming feasible with the emergence of powerful large language models (LLMs). Recent workflow-based data…

Artificial Intelligence · Computer Science 2025-10-21 Shaolei Zhang , Ju Fan , Meihao Fan , Guoliang Li , Xiaoyong Du