Related papers: SpecRover: Code Intent Extraction via LLMs

AutoCodeRover: Autonomous Program Improvement

Researchers have made significant progress in automating the software development process in the past decades. Recent progress in Large Language Models (LLMs) has significantly impacted the development process, where developers can use…

Software Engineering · Computer Science 2024-07-26 Yuntong Zhang , Haifeng Ruan , Zhiyu Fan , Abhik Roychoudhury

SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion

Large Language Models (LLMs) excel at code-related tasks but often struggle in realistic software repositories, where project-specific APIs and cross-file dependencies are crucial. Retrieval-augmented methods mitigate this by injecting…

Software Engineering · Computer Science 2026-04-22 George Ma , Anurag Koul , Qi Chen , Yawen Wu , Sachit Kuhar , Yu Yu , Aritra Sengupta , Varun Kumar , Murali Krishna Ramanathan

InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration

Large Language Models (LLMs) frequently generate buggy code with complex logic errors that are challenging to diagnose. While existing LLM-based self-repair approaches conduct intensive static semantic analysis or reply on superficial…

Software Engineering · Computer Science 2025-10-22 Yunkun Wang , Yue Zhang , Guochang Li , Chen Zhi , Binhua Li , Fei Huang , Yongbin Li , Shuiguang Deng

REFINE: Enhancing Program Repair Agents through Context-Aware Patch Refinement

Large Language Models (LLMs) have recently shown strong potential in automatic program repair (APR), especially in repository-level settings where the goal is to generate patches based on natural language issue descriptions, large…

Software Engineering · Computer Science 2025-10-07 Anvith Pabba , Simin Chen , Alex Mathai , Anindya Chakraborty , Baishakhi Ray

Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories

Large Language Model (LLM)-based agents are increasingly employed to automate complex software engineering tasks, such as program repair and issue resolution. These agents operate by autonomously generating natural language thoughts,…

Software Engineering · Computer Science 2025-10-09 Islem Bouzenia , Michael Pradel

Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents

In this paper, we introduce Auto-Intent, a method to adapt a pre-trained large language model (LLM) as an agent for a target domain without direct fine-tuning, where we empirically focus on web navigation tasks. Our approach first discovers…

Computation and Language · Computer Science 2024-10-31 Jaekyeom Kim , Dong-Ki Kim , Lajanugen Logeswaran , Sungryull Sohn , Honglak Lee

Agent-Driven Automatic Software Improvement

With software maintenance accounting for 50% of the cost of developing software, enhancing code quality and reliability has become more critical than ever. In response to this challenge, this doctoral research proposal aims to explore…

Software Engineering · Computer Science 2024-06-25 Fernando Vallecillos Ruiz

SpecBench: Evaluating Specification-Level Reasoning for Software Engineering LLM Agents

Software engineering (SWE) agents are transitioning from code generation to full software development lifecycle automation. A critical phase in this lifecycle is specification design: transforming initial proposals into carefully considered…

Multiagent Systems · Computer Science 2026-05-29 Grant Hamblin , Kevin Song , Zhanda Zhu , Anand Jayarajan , Sihang Liu , Nandita Vijaykumar , Gennady Pekhimenko

RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing

Code auditing is the process of reviewing code with the aim of identifying bugs. Large Language Models (LLMs) have demonstrated promising capabilities for this task without requiring compilation, while also supporting user-friendly…

Software Engineering · Computer Science 2025-06-02 Jinyao Guo , Chengpeng Wang , Xiangzhe Xu , Zian Su , Xiangyu Zhang

MORepair: Teaching LLMs to Repair Code via Multi-Objective Fine-tuning

Within the realm of software engineering, specialized tasks on code, such as program repair, present unique challenges, necessitating fine-tuning Large language models~(LLMs) to unlock state-of-the-art performance. Fine-tuning approaches…

Software Engineering · Computer Science 2025-09-23 Boyang Yang , Haoye Tian , Jiadong Ren , Hongyu Zhang , Jacques Klein , Tegawendé F. Bissyandé , Claire Le Goues , Shunfu Jin

A Self-Improving Coding Agent

Recent advancements in Large Language Models (LLMs) have spurred interest in deploying LLM agents to undertake tasks in the world. LLMs are often deployed in agent systems: code that orchestrates LLM calls and provides them with tools. We…

Artificial Intelligence · Computer Science 2025-05-20 Maxime Robeyns , Martin Szummer , Laurence Aitchison

SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning

Recent advances in inference-time compute have significantly improved performance on complex tasks by generating long chains of thought (CoTs) using Large Reasoning Models (LRMs). However, this improved accuracy comes at the cost of high…

Machine Learning · Computer Science 2025-05-20 Rui Pan , Yinwei Dai , Zhihao Zhang , Gabriele Oliaro , Zhihao Jia , Ravi Netravali

Assured Automatic Programming via Large Language Models

With the advent of AI-based coding engines, it is possible to convert natural language requirements to executable code in standard programming languages. However, AI-generated code can be unreliable, and the natural language requirements…

Software Engineering · Computer Science 2024-11-06 Martin Mirchev , Andreea Costea , Abhishek Kr Singh , Abhik Roychoudhury

ConceptCoder: Improve Code Reasoning via Concept Learning

Large language models (LLMs) have shown promising results for software engineering applications, but still struggle with code reasoning tasks such as vulnerability detection (VD). We introduce ConceptCoder, a fine-tuning method that…

Software Engineering · Computer Science 2026-03-25 Md Mahbubur Rahman , Hengbo Tong , Wei Le

SpecTra: Enhancing the Code Translation Ability of Language Models by Generating Multi-Modal Specifications

Large language models (LLMs) are increasingly being used for the task of automated code translation, which has important real-world applications. However, most existing approaches use only the source code of a program as an input to an LLM,…

Software Engineering · Computer Science 2025-12-08 Vikram Nitin , Rahul Krishna , Baishakhi Ray

Intention is All You Need: Refining Your Code from Your Intention

Code refinement aims to enhance existing code by addressing issues, refactoring, and optimizing to improve quality and meet specific requirements. As software projects scale in size and complexity, the traditional iterative exchange between…

Software Engineering · Computer Science 2025-02-13 Qi Guo , Xiaofei Xie , Shangqing Liu , Ming Hu , Xiaohong Li , Lei Bu

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

Large Language Models (LLMs) have demonstrated remarkable proficiency across a variety of complex tasks. One significant application of LLMs is in tackling software engineering challenges, particularly in resolving real-world tasks on…

Computation and Language · Computer Science 2025-05-08 Chengxing Xie , Bowen Li , Chang Gao , He Du , Wai Lam , Difan Zou , Kai Chen

Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems

Despite recent progress in generating hardware RTL code with LLMs, existing solutions still suffer from a substantial gap between practical application scenarios and the requirements of real-world RTL code development. Prior approaches…

Hardware Architecture · Computer Science 2025-09-10 Zhongzhi Yu , Mingjie Liu , Michael Zimmer , Yingyan Celine Lin , Yong Liu , Haoxing Ren

MarsCode Agent: AI-native Automated Bug Fixing

Recent advances in large language models (LLMs) have shown significant potential to automate various software development tasks, including code completion, test generation, and bug fixing. However, the application of LLMs for automated bug…

Software Engineering · Computer Science 2024-09-05 Yizhou Liu , Pengfei Gao , Xinchen Wang , Jie Liu , Yexuan Shi , Zhao Zhang , Chao Peng

AI-powered Code Review with LLMs: Early Results

In this paper, we present a novel approach to improving software quality and efficiency through a Large Language Model (LLM)-based model designed to review code and identify potential issues. Our proposed LLM-based AI agent model is trained…

Software Engineering · Computer Science 2025-12-11 Zeeshan Rasheed , Malik Abdul Sami , Muhammad Waseem , Kai-Kristian Kemell , Xiaofeng Wang , Anh Nguyen , Kari Systä , Pekka Abrahamsson