English

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Software Engineering 2025-09-08 v3 Artificial Intelligence

Abstract

Large Language Models (LLMs) have revolutionized software engineering (SE), showcasing remarkable proficiency in various coding tasks. Despite recent advancements that have enabled the creation of autonomous software agents utilizing LLMs for end-to-end development tasks, these systems are typically designed for specific SE functions. We introduce HyperAgent, an innovative generalist multi-agent system designed to tackle a wide range of SE tasks across different programming languages by mimicking the workflows of human developers. HyperAgent features four specialized agents-Planner, Navigator, Code Editor, and Executor-capable of handling the entire lifecycle of SE tasks, from initial planning to final verification. HyperAgent sets new benchmarks in diverse SE tasks, including GitHub issue resolution on the renowned SWE-Bench benchmark, outperforming robust baselines. Furthermore, HyperAgent demonstrates exceptional performance in repository-level code generation (RepoExec) and fault localization and program repair (Defects4J), often surpassing state-of-the-art baselines.

Keywords

Cite

@article{arxiv.2409.16299,
  title  = {HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale},
  author = {Huy Nhat Phan and Tien N. Nguyen and Phong X. Nguyen and Nghi D. Q. Bui},
  journal= {arXiv preprint arXiv:2409.16299},
  year   = {2025}
}

Comments

49 pages

R2 v1 2026-06-28T18:55:37.316Z