Related papers: Code Review Agent Benchmark

Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review

Code review has evolved for decades, from informal peer checking to today's pull request (PR) workflows, yet it remains a largely manual, uneven, and cognitively demanding process. The rise of Artificial Intelligence (AI) coding assistants…

Software Engineering · Computer Science 2026-05-19 Hüseyin Özgür Kamalı , Erdem Tuna , Vahid Haratian , Eray Tüzün

Human-AI Synergy in Agentic Code Review

Code review is a critical software engineering practice where developers review code changes before integration to ensure code quality, detect defects, and improve maintainability. In recent years, AI agents that can understand code…

Software Engineering · Computer Science 2026-03-18 Suzhen Zhong , Shayan Noei , Ying Zou , Bram Adams

CodeAgent: Autonomous Communicative Agents for Code Review

Code review, which aims at ensuring the overall quality and reliability of software, is a cornerstone of software development. Unfortunately, while crucial, Code review is a labor-intensive process that the research community is looking to…

Software Engineering · Computer Science 2024-09-26 Xunzhu Tang , Kisub Kim , Yewei Song , Cedric Lothritz , Bei Li , Saad Ezzini , Haoye Tian , Jacques Klein , Tegawende F. Bissyande

CR-Bench: Evaluating the Real-World Utility of AI Code Review Agents

Recent advances in frontier large language models have enabled code review agents that operate in open-ended, reasoning-intensive settings. However, the lack of standardized benchmarks and granular evaluation protocols makes it difficult to…

Software Engineering · Computer Science 2026-03-13 Kristen Pereira , Neelabh Sinha , Rajat Ghosh , Debojyoti Dutta

How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses

The rapid adoption of large language models has led to the emergence of AI coding agents that autonomously create pull requests on GitHub. However, how these agents differ in their pull request description characteristics, and how human…

Artificial Intelligence · Computer Science 2026-02-20 Kan Watanabe , Rikuto Tsuchida , Takahiro Monno , Bin Huang , Kazuma Yamasaki , Youmei Fan , Kazumasa Shimari , Kenichi Matsumoto

A Roadmap on Modern Code Review: Challenges and Opportunities

Over the past decade, modern code review (MCR) has been established as a cornerstone of software quality assurance and a vital channel for knowledge transfer within development teams. However, the manual inspection of increasingly complex…

Software Engineering · Computer Science 2026-03-03 Zezhou Yang , Cuiyun Gao , Zhaoqiang Guo , Zhenhao Li , Kui Liu , Xin Xia , Yuming Zhou

Code Review as Decision-Making -- Building a Cognitive Model from the Questions Asked During Code Review

Code review is a well-established and valued practice in the software engineering community contributing to both code quality and interpersonal benefits. However, there are challenges in both tools and processes that give rise to…

Software Engineering · Computer Science 2025-07-15 Lo Gullstrand Heander , Emma Söderberg , Christofer Rydenfält

Automating Code Review: A Systematic Literature Review

Code Review consists in assessing the code written by teammates with the goal of increasing code quality. Empirical studies documented the benefits brought by such a practice that, however, has its cost to pay in terms of developers' time.…

Software Engineering · Computer Science 2025-03-13 Rosalia Tufano , Gabriele Bavota

From Industry Claims to Empirical Reality: An Empirical Study of Code Review Agents in Pull Requests

Autonomous coding agents are generating code at an unprecedented scale, with OpenAI Codex alone creating over 400,000 pull requests (PRs) in two months. As agentic PR volumes increase, code review agents (CRAs) have become routine…

Software Engineering · Computer Science 2026-04-06 Kowshik Chowdhury , Dipayan Banik , K M Ferdous , Shazibul Islam Shamim

Automating Code Review Activities by Large-Scale Pre-training

Code review is an essential part to software development lifecycle since it aims at guaranteeing the quality of codes. Modern code review activities necessitate developers viewing, understanding and even running the programs to assess…

Software Engineering · Computer Science 2022-10-12 Zhiyu Li , Shuai Lu , Daya Guo , Nan Duan , Shailesh Jannu , Grant Jenks , Deep Majumder , Jared Green , Alexey Svyatkovskiy , Shengyu Fu , Neel Sundaresan

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

The rise of large language models for code has reshaped software development. Autonomous coding agents, able to create branches, open pull requests, and perform code reviews, now actively contribute to real-world projects. Their growing…

Software Engineering · Computer Science 2026-04-02 Razvan Mihai Popescu , David Gros , Andrei Botocan , Rahul Pandita , Prem Devanbu , Maliheh Izadi

ReviewRanker: A Semi-Supervised Learning Based Approach for Code Review Quality Estimation

Code review is considered a key process in the software industry for minimizing bugs and improving code quality. Inspection of review process effectiveness and continuous improvement can boost development productivity. Such inspection is a…

Software Engineering · Computer Science 2023-07-11 Saifullah Mahbub , Md. Easin Arafat , Chowdhury Rafeed Rahman , Zannatul Ferdows , Masum Hasan

When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests

Autonomous coding agents increasingly contribute to software development by submitting pull requests on GitHub; yet, little is known about how these contributions integrate into human-driven review workflows. We present a large empirical…

Software Engineering · Computer Science 2026-02-24 Costain Nachuma , Minhaz Zibran

Learning to Boost the Efficiency of Modern Code Review

Modern Code Review (MCR) is a standard in all kinds of organizations that develop software. MCR pays for itself through perceived and proven benefits in quality assurance and knowledge transfer. However, the time invest in MCR is generally…

Software Engineering · Computer Science 2021-04-20 Robert Heumüller

Towards Automating Code Review Activities

Code reviews are popular in both industrial and open source projects. The benefits of code reviews are widely recognized and include better code quality and lower likelihood of introducing bugs. However, since code review is a manual…

Software Engineering · Computer Science 2021-05-20 Rosalia Tufano , Luca Pascarella , Michele Tufano , Denys Poshyvanyk , Gabriele Bavota

Generation-based Code Review Automation: How Far Are We?

Code review is an effective software quality assurance activity; however, it is labor-intensive and time-consuming. Thus, a number of generation-based automatic code review (ACR) approaches have been proposed recently, which leverage deep…

Software Engineering · Computer Science 2023-03-14 Xin Zhou , Kisub Kim , Bowen Xu , DongGyun Han , Junda He , David Lo

Measuring AI agent autonomy: Towards a scalable approach with code inspection

AI agents are AI systems that can achieve complex goals autonomously. Assessing the level of agent autonomy is crucial for understanding both their potential benefits and risks. Current assessments of autonomy often focus on specific risks…

Artificial Intelligence · Computer Science 2025-02-24 Peter Cihon , Merlin Stein , Gagan Bansal , Sam Manning , Kevin Xu

Code Review Automation: Strengths and Weaknesses of the State of the Art

The automation of code review has been tackled by several researchers with the goal of reducing its cost. The adoption of deep learning in software engineering pushed the automation to new boundaries, with techniques imitating developers in…

Software Engineering · Computer Science 2024-01-11 Rosalia Tufano , Ozren Dabić , Antonio Mastropaolo , Matteo Ciniselli , Gabriele Bavota

AI builds, We Analyze: An Empirical Study of AI-Generated Build Code Quality

The rapid adoption of AI coding agents for software development has raised important questions about the quality and maintainability of the code they produce. While prior studies have examined AI-generated source code, the impact of AI…

Software Engineering · Computer Science 2026-01-26 Anwar Ghammam , Mohamed Almukhtar

Understanding Dominant Themes in Reviewing Agentic AI-authored Code

While prior work has examined the generation capabilities of Agentic AI systems, little is known about how reviewers respond to AI-authored code in practice. In this paper, we present a large-scale empirical study of code review dynamics in…

Software Engineering · Computer Science 2026-01-28 Md. Asif Haider , Thomas Zimmermann