Related papers: SeaView: Software Engineering Agent Visual Interfa…

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

Language model (LM) agents are increasingly being used to automate complicated tasks in digital environments. Just as humans benefit from powerful software applications, such as integrated development environments, for complex tasks like…

Software Engineering · Computer Science 2024-11-13 John Yang , Carlos E. Jimenez , Alexander Wettig , Kilian Lieret , Shunyu Yao , Karthik Narasimhan , Ofir Press

SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution

Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context…

Software Engineering · Computer Science 2026-05-27 Kang He , Kaushik Roy

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

LLM-based agents have shown promising capabilities in a growing range of software engineering (SWE) tasks. However, advancing this field faces two critical challenges. First, high-quality training data is scarce, especially data that…

Software Engineering · Computer Science 2025-11-05 Ibragim Badertdinov , Alexander Golubev , Maksim Nekrashevich , Anton Shevtsov , Simon Karasik , Andrei Andriushchenko , Maria Trofimova , Daria Litvintseva , Boris Yangel

SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents

Large Language Model (LLM)-based agents have recently shown impressive capabilities in complex reasoning and tool use via multi-step interactions with their environments. While these agents have the potential to tackle complicated tasks,…

Artificial Intelligence · Computer Science 2025-11-04 Jiaye Lin , Yifu Guo , Yuzhen Han , Sen Hu , Ziyi Ni , Licheng Wang , Mingguang Chen , Hongzhang Liu , Ronghao Chen , Yangfan He , Daxin Jiang , Binxing Jiao , Chen Hu , Huacan Wang

SEW: Self-Evolving Agentic Workflows for Automated Code Generation

Large Language Models (LLMs) have demonstrated effectiveness in code generation tasks. To enable LLMs to address more complex coding challenges, existing research has focused on crafting multi-agent systems with agentic workflows, where…

Software Engineering · Computer Science 2026-04-15 Siwei Liu , Jinyuan Fang , Han Zhou , Yingxu Wang , Zaiqiao Meng

Can Agents Fix Agent Issues?

LLM-based agent systems are emerging as a new software paradigm and have been widely adopted across diverse domains such as medicine, robotics, and programming. However, maintaining these systems requires substantial effort, as they are…

Artificial Intelligence · Computer Science 2025-10-27 Alfin Wijaya Rahardja , Junwei Liu , Weitong Chen , Zhenpeng Chen , Yiling Lou

SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling

Large language models (LLMs) have advanced rapidly from conversational problem solving to addressing real-world tasks involving tool use, such as software engineering (SWE). Recent LLM-powered toolkits, such as OpenAI Codex and Cursor, have…

Artificial Intelligence · Computer Science 2025-06-24 Haoran Wang , Zhenyu Hou , Yao Wei , Jie Tang , Yuxiao Dong

Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?

Large Language Models (LLMs) are reshaping almost all industries, including software engineering. In recent years, a number of LLM agents have been proposed to solve real-world software problems. Such software agents are typically equipped…

Software Engineering · Computer Science 2025-11-25 Chunqiu Steven Xia , Zhe Wang , Yan Yang , Yuxiang Wei , Lingming Zhang

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Repurposing large vision-language models (LVLMs) as computer use agents (CUAs) has led to substantial breakthroughs, primarily driven by human-labeled data. However, these models often struggle with novel and specialized software,…

Artificial Intelligence · Computer Science 2025-08-13 Zeyi Sun , Ziyu Liu , Yuhang Zang , Yuhang Cao , Xiaoyi Dong , Tong Wu , Dahua Lin , Jiaqi Wang

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Issue resolution, a complex Software Engineering (SWE) task integral to real-world development, has emerged as a compelling challenge for artificial intelligence. The establishment of benchmarks like SWE-bench revealed this task as…

Software Engineering · Computer Science 2026-01-21 Caihua Li , Lianghong Guo , Yanlin Wang , Daya Guo , Wei Tao , Zhenyu Shan , Mingwei Liu , Jiachi Chen , Haoyu Song , Duyu Tang , Hongyu Zhang , Zibin Zheng

Immersion in the GitHub Universe: Scaling Coding Agents to Mastery

Achieving mastery in real world software engineering tasks is fundamentally bottlenecked by the scarcity of large scale, high quality training data. Scaling such data has been limited by the complexity of environment setup, unit test…

Software Engineering · Computer Science 2026-03-25 Jiale Zhao , Guoxin Chen , Fanzhe Meng , Minghao Li , Jie Chen , Hui Xu , Yongshuai Sun , Wayne Xin Zhao , Ruihua Song , Yuan Zhang , Peng Wang , Cheng Chen , Jirong Wen , Kai Jia

SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents

Test-time scaling has been widely adopted to enhance the capabilities of Large Language Model (LLM) agents in software engineering (SWE) tasks. However, the standard approach of repeatedly sampling trajectories from scratch is…

Software Engineering · Computer Science 2026-02-06 Yifeng Ding , Lingming Zhang

SWE-TRACE: Optimizing Long-Horizon SWE Agents Through Rubric Process Reward Models and Heuristic Test-Time Scaling

Resolving real-world software engineering (SWE) issues with autonomous agents requires complex, long-horizon reasoning. Current pipelines are bottlenecked by unoptimized demonstration data, sparse execution rewards, and computationally…

Software Engineering · Computer Science 2026-04-17 Hao Han , Jin Xie , Xuehao Ma , Weiquan Zhu , Ziyao Zhang , ZhiLiang Long , Hongkai Chen , Qingwen Ye

SWE-Exp: Experience-Driven Software Issue Resolution

Recent advances in large language model (LLM) agents have shown remarkable progress in software issue resolution, leveraging advanced techniques such as multi-agent collaboration and Monte Carlo Tree Search (MCTS). However, current agents…

Software Engineering · Computer Science 2026-02-03 Silin Chen , Shaoxin Lin , Yuling Shi , Heng Lian , Xiaodong Gu , Longfei Yun , Dong Chen , Lin Cao , Jiyang Liu , Nu Xia , Qianxiang Wang

Process-Centric Analysis of Agentic Software Systems

Agentic systems are modern software systems: they consist of orchestrated modules, expose interfaces, and are deployed in software pipelines. Unlike conventional programs, their execution, i.e., trajectories, is inherently stochastic and…

Software Engineering · Computer Science 2026-04-14 Shuyang Liu , Yang Chen , Rahul Krishna , Saurabh Sinha , Jatin Ganhotra , Reyhan Jabbarvand

Agentless: Demystifying LLM-based Software Engineering Agents

Recent advancements in large language models (LLMs) have significantly advanced the automation of software development tasks, including code synthesis, program repair, and test generation. More recently, researchers and industry…

Software Engineering · Computer Science 2024-10-30 Chunqiu Steven Xia , Yinlin Deng , Soren Dunn , Lingming Zhang

SEAlign: Alignment Training for Software Engineering Agent

Recent advances in code generation models have demonstrated impressive capabilities in automating software development tasks, yet these models still struggle in real-world software engineering scenarios. Although current training methods,…

Software Engineering · Computer Science 2025-03-25 Kechi Zhang , Huangzhao Zhang , Ge Li , Jinliang You , Jia Li , Yunfei Zhao , Zhi Jin

SWE-Effi: Re-Evaluating Software AI Agent System Effectiveness Under Resource Constraints

The advancement of large language models (LLMs) and code agents has demonstrated significant potential to assist software engineering (SWE) tasks, such as autonomous issue resolution and feature addition. Existing AI for software…

Software Engineering · Computer Science 2025-09-22 Zhiyu Fan , Kirill Vasilevski , Dayi Lin , Boyuan Chen , Yihao Chen , Zhiqing Zhong , Jie M. Zhang , Pinjia He , Ahmed E. Hassan

Why AI Agents Still Need You: Findings from Developer-Agent Collaborations in the Wild

Software Engineering Agents (SWE agents) can autonomously perform development tasks on benchmarks like SWE Bench, but still face challenges when tackling complex and ambiguous real-world tasks. Consequently, SWE agents are often designed to…

Software Engineering · Computer Science 2025-10-13 Aayush Kumar , Yasharth Bajpai , Sumit Gulwani , Gustavo Soares , Emerson Murphy-Hill

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Software engineering (SWE) has recently emerged as a crucial testbed for next-generation LLM agents, demanding inherent capabilities in two critical dimensions: sustained iterative problem-solving (e.g., >50 interaction rounds) and…

Artificial Intelligence · Computer Science 2025-06-25 Liang Zeng , Yongcong Li , Yuzhen Xiao , Changshi Li , Chris Yuhao Liu , Rui Yan , Tianwen Wei , Jujie He , Xuchen Song , Yang Liu , Yahui Zhou