English
Related papers

Related papers: SeaView: Software Engineering Agent Visual Interfa…

200 papers

Language model (LM) agents are increasingly being used to automate complicated tasks in digital environments. Just as humans benefit from powerful software applications, such as integrated development environments, for complex tasks like…

Software Engineering · Computer Science 2024-11-13 John Yang , Carlos E. Jimenez , Alexander Wettig , Kilian Lieret , Shunyu Yao , Karthik Narasimhan , Ofir Press

Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context…

Software Engineering · Computer Science 2026-05-27 Kang He , Kaushik Roy

LLM-based agents have shown promising capabilities in a growing range of software engineering (SWE) tasks. However, advancing this field faces two critical challenges. First, high-quality training data is scarce, especially data that…

Large Language Model (LLM)-based agents have recently shown impressive capabilities in complex reasoning and tool use via multi-step interactions with their environments. While these agents have the potential to tackle complicated tasks,…

Artificial Intelligence · Computer Science 2025-11-04 Jiaye Lin , Yifu Guo , Yuzhen Han , Sen Hu , Ziyi Ni , Licheng Wang , Mingguang Chen , Hongzhang Liu , Ronghao Chen , Yangfan He , Daxin Jiang , Binxing Jiao , Chen Hu , Huacan Wang

Large Language Models (LLMs) have demonstrated effectiveness in code generation tasks. To enable LLMs to address more complex coding challenges, existing research has focused on crafting multi-agent systems with agentic workflows, where…

Software Engineering · Computer Science 2026-04-15 Siwei Liu , Jinyuan Fang , Han Zhou , Yingxu Wang , Zaiqiao Meng

LLM-based agent systems are emerging as a new software paradigm and have been widely adopted across diverse domains such as medicine, robotics, and programming. However, maintaining these systems requires substantial effort, as they are…

Artificial Intelligence · Computer Science 2025-10-27 Alfin Wijaya Rahardja , Junwei Liu , Weitong Chen , Zhenpeng Chen , Yiling Lou

Large language models (LLMs) have advanced rapidly from conversational problem solving to addressing real-world tasks involving tool use, such as software engineering (SWE). Recent LLM-powered toolkits, such as OpenAI Codex and Cursor, have…

Artificial Intelligence · Computer Science 2025-06-24 Haoran Wang , Zhenyu Hou , Yao Wei , Jie Tang , Yuxiao Dong

Large Language Models (LLMs) are reshaping almost all industries, including software engineering. In recent years, a number of LLM agents have been proposed to solve real-world software problems. Such software agents are typically equipped…

Software Engineering · Computer Science 2025-11-25 Chunqiu Steven Xia , Zhe Wang , Yan Yang , Yuxiang Wei , Lingming Zhang

Repurposing large vision-language models (LVLMs) as computer use agents (CUAs) has led to substantial breakthroughs, primarily driven by human-labeled data. However, these models often struggle with novel and specialized software,…

Artificial Intelligence · Computer Science 2025-08-13 Zeyi Sun , Ziyu Liu , Yuhang Zang , Yuhang Cao , Xiaoyi Dong , Tong Wu , Dahua Lin , Jiaqi Wang

Issue resolution, a complex Software Engineering (SWE) task integral to real-world development, has emerged as a compelling challenge for artificial intelligence. The establishment of benchmarks like SWE-bench revealed this task as…

Software Engineering · Computer Science 2026-01-21 Caihua Li , Lianghong Guo , Yanlin Wang , Daya Guo , Wei Tao , Zhenyu Shan , Mingwei Liu , Jiachi Chen , Haoyu Song , Duyu Tang , Hongyu Zhang , Zibin Zheng

Achieving mastery in real world software engineering tasks is fundamentally bottlenecked by the scarcity of large scale, high quality training data. Scaling such data has been limited by the complexity of environment setup, unit test…

Test-time scaling has been widely adopted to enhance the capabilities of Large Language Model (LLM) agents in software engineering (SWE) tasks. However, the standard approach of repeatedly sampling trajectories from scratch is…

Software Engineering · Computer Science 2026-02-06 Yifeng Ding , Lingming Zhang

Resolving real-world software engineering (SWE) issues with autonomous agents requires complex, long-horizon reasoning. Current pipelines are bottlenecked by unoptimized demonstration data, sparse execution rewards, and computationally…

Software Engineering · Computer Science 2026-04-17 Hao Han , Jin Xie , Xuehao Ma , Weiquan Zhu , Ziyao Zhang , ZhiLiang Long , Hongkai Chen , Qingwen Ye

Recent advances in large language model (LLM) agents have shown remarkable progress in software issue resolution, leveraging advanced techniques such as multi-agent collaboration and Monte Carlo Tree Search (MCTS). However, current agents…

Software Engineering · Computer Science 2026-02-03 Silin Chen , Shaoxin Lin , Yuling Shi , Heng Lian , Xiaodong Gu , Longfei Yun , Dong Chen , Lin Cao , Jiyang Liu , Nu Xia , Qianxiang Wang

Agentic systems are modern software systems: they consist of orchestrated modules, expose interfaces, and are deployed in software pipelines. Unlike conventional programs, their execution, i.e., trajectories, is inherently stochastic and…

Software Engineering · Computer Science 2026-04-14 Shuyang Liu , Yang Chen , Rahul Krishna , Saurabh Sinha , Jatin Ganhotra , Reyhan Jabbarvand

Recent advancements in large language models (LLMs) have significantly advanced the automation of software development tasks, including code synthesis, program repair, and test generation. More recently, researchers and industry…

Software Engineering · Computer Science 2024-10-30 Chunqiu Steven Xia , Yinlin Deng , Soren Dunn , Lingming Zhang

Recent advances in code generation models have demonstrated impressive capabilities in automating software development tasks, yet these models still struggle in real-world software engineering scenarios. Although current training methods,…

Software Engineering · Computer Science 2025-03-25 Kechi Zhang , Huangzhao Zhang , Ge Li , Jinliang You , Jia Li , Yunfei Zhao , Zhi Jin

The advancement of large language models (LLMs) and code agents has demonstrated significant potential to assist software engineering (SWE) tasks, such as autonomous issue resolution and feature addition. Existing AI for software…

Software Engineering · Computer Science 2025-09-22 Zhiyu Fan , Kirill Vasilevski , Dayi Lin , Boyuan Chen , Yihao Chen , Zhiqing Zhong , Jie M. Zhang , Pinjia He , Ahmed E. Hassan

Software Engineering Agents (SWE agents) can autonomously perform development tasks on benchmarks like SWE Bench, but still face challenges when tackling complex and ambiguous real-world tasks. Consequently, SWE agents are often designed to…

Software Engineering · Computer Science 2025-10-13 Aayush Kumar , Yasharth Bajpai , Sumit Gulwani , Gustavo Soares , Emerson Murphy-Hill

Software engineering (SWE) has recently emerged as a crucial testbed for next-generation LLM agents, demanding inherent capabilities in two critical dimensions: sustained iterative problem-solving (e.g., >50 interaction rounds) and…

Artificial Intelligence · Computer Science 2025-06-25 Liang Zeng , Yongcong Li , Yuzhen Xiao , Changshi Li , Chris Yuhao Liu , Rui Yan , Tianwen Wei , Jujie He , Xuchen Song , Yang Liu , Yahui Zhou
‹ Prev 1 2 3 10 Next ›