Related papers: DocChecker: Bootstrapping Code Large Language Mode…

Deep Just-In-Time Inconsistency Detection Between Comments and Source Code

Natural language comments convey key aspects of source code such as implementation, usage, and pre- and post-conditions. Failure to update comments accordingly when the corresponding code is modified introduces inconsistencies, which is…

Software Engineering · Computer Science 2020-12-29 Sheena Panthaplackel , Junyi Jessy Li , Milos Gligoric , Raymond J. Mooney

Larger Is Not Always Better: Leveraging Structured Code Diffs for Comment Inconsistency Detection

Ensuring semantic consistency between source code and its accompanying comments is crucial for program comprehension, effective debugging, and long-term maintainability. Comment inconsistency arises when developers modify code but neglect…

Software Engineering · Computer Science 2025-12-25 Phong Nguyen , Anh M. T. Bui , Phuong T. Nguyen

CCISolver: End-to-End Detection and Repair of Method-Level Code-Comment Inconsistency

Comments within code serve as a crucial foundation for software documentation, facilitating developers to communicate and understand the code effectively. However, code-comment inconsistency (CCI) can negatively affect software development,…

Software Engineering · Computer Science 2025-06-26 Renyi Zhong , Yintong Huo , Wenwei Gu , Jinxi Kuang , Zhihan Jiang , Guangba Yu , Yichen Li , David Lo , Michael R. Lyu

DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation

Code review is a vital but demanding aspect of software development, generating significant interest in automating review comments. Traditional evaluation methods for these comments, primarily based on text similarity, face two major…

Software Engineering · Computer Science 2025-01-28 Junyi Lu , Xiaojia Li , Zihan Hua , Lei Yu , Shiqi Cheng , Li Yang , Fengjun Zhang , Chun Zuo

Are your comments outdated? Towards automatically detecting code-comment consistency

In software development and maintenance, code comments can help developers understand source code, and improve communication among developers. However, developers sometimes neglect to update the corresponding comment when changing the code,…

Software Engineering · Computer Science 2024-03-04 Yuan Huang , Yinan Chen , Xiangping Chen , Xiaocong Zhou

Code Comment Inconsistency Detection with BERT and Longformer

Comments, or natural language descriptions of source code, are standard practice among software developers. By communicating important aspects of the code such as functionality and usage, comments help with software project maintenance.…

Computation and Language · Computer Science 2022-08-01 Theo Steiner , Rui Zhang

A Qualitative Investigation into LLM-Generated Multilingual Code Comments and Automatic Evaluation Metrics

Large Language Models are essential coding assistants, yet their training is predominantly English-centric. In this study, we evaluate the performance of code language models in non-English contexts, identifying challenges in their adoption…

Software Engineering · Computer Science 2025-05-22 Jonathan Katzy , Yongcheng Huang , Gopal-Raj Panchu , Maksym Ziemlewski , Paris Loizides , Sander Vermeulen , Arie van Deursen , Maliheh Izadi

DocPrism: Local Categorization and External Filtering to Identify Relevant Code-Documentation Inconsistencies

Code-documentation inconsistencies are common and undesirable: they can lead to developer misunderstandings and software defects. This paper introduces DocPrism, a multi-language, code-documentation inconsistency detection tool. DocPrism…

Software Engineering · Computer Science 2025-11-04 Xiaomeng Xu , Zahin Wahab , Reid Holmes , Caroline Lemieux

SmartPatchLinker: An Open-Source Tool to Linked Changes Detection for Code Review

In large software ecosystems, semantically related code changes, such as alternative solutions or overlapping modifications are often discovered only days after submission, leading to duplicated effort and delayed reviews. We present…

Software Engineering · Computer Science 2026-04-07 Islem Khemissi , Moataz Chouchen , Dong Wang , Raula Gaikovina Kula

Harnessing Large Language Models for Curated Code Reviews

In code review, generating structured and relevant comments is crucial for identifying code issues and facilitating accurate code changes that ensure an efficient code review process. Well-crafted comments not only streamline the code…

Software Engineering · Computer Science 2025-02-06 Oussama Ben Sghaier , Martin Weyssow , Houari Sahraoui

Comments on Comments: Where Code Review and Documentation Meet

A central function of code review is to increase understanding; helping reviewers understand a code change aids in knowledge transfer and finding bugs. Comments in code largely serve a similar purpose, helping future readers understand the…

Software Engineering · Computer Science 2022-04-04 Nikitha Rao , Jason Tsay , Martin Hirzel , Vincent J. Hellendoorn

Identifying Inaccurate Descriptions in LLM-generated Code Comments via Test Execution

Software comments are critical for human understanding of software, and as such many comment generation techniques have been proposed. However, we find that a systematic evaluation of the factual accuracy of generated comments is rare; only…

Software Engineering · Computer Science 2024-06-24 Sungmin Kang , Louis Milliken , Shin Yoo

ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases

Reasoning over Commonsense Knowledge Bases (CSKB), i.e. CSKB reasoning, has been explored as a way to acquire new commonsense knowledge based on reference knowledge in the original CSKBs and external prior knowledge. Despite the advancement…

Computation and Language · Computer Science 2024-01-26 Quyet V. Do , Tianqing Fang , Shizhe Diao , Zhaowei Wang , Yangqiu Song

CCTEST: Testing and Repairing Code Completion Systems

Code completion, a highly valuable topic in the software development domain, has been increasingly promoted for use by recent advances in large language models (LLMs). To date, visible LLM-based code completion frameworks such as GitHub…

Software Engineering · Computer Science 2023-05-09 Zongjie Li , Chaozheng Wang , Zhibo Liu , Haoxuan Wang , Dong Chen , Shuai Wang , Cuiyun Gao

DocTer: Documentation Guided Fuzzing for Testing Deep Learning API Functions

Input constraints are useful for many software development tasks. For example, input constraints of a function enable the generation of valid inputs, i.e., inputs that follow these constraints, to test the function deeper. API functions of…

Software Engineering · Computer Science 2024-03-07 Danning Xie , Yitong Li , Mijung Kim , Hung Viet Pham , Lin Tan , Xiangyu Zhang , Michael W. Godfrey

Predicting Usefulness of Code Review Comments using Textual Features and Developer Experience

Although peer code review is widely adopted in both commercial and open source development, existing studies suggest that such code reviews often contain a significant amount of non-useful review comments. Unfortunately, to date, no tools…

Software Engineering · Computer Science 2018-07-13 Mohammad Masudur Rahman , Chanchal K. Roy , Raula G. Kula

DiffSearch: A Scalable and Precise Search Engine for Code Changes

The source code of successful projects is evolving all the time, resulting in hundreds of thousands of code changes stored in source code repositories. This wealth of data can be useful, e.g., to find changes similar to a planned code…

Software Engineering · Computer Science 2022-11-01 Luca Di Grazia , Paul Bredl , Michael Pradel

Security Defect Detection via Code Review: A Study of the OpenStack and Qt Communities

Background: Despite the widespread use of automated security defect detection tools, software projects still contain many security defects that could result in serious damage. Such tools are largely context-insensitive and may not cover all…

Software Engineering · Computer Science 2023-07-06 Jiaxin Yu , Liming Fu , Peng Liang , Amjed Tahir , Mojtaba Shahin

Co-FactChecker: A Framework for Human-AI Collaborative Claim Verification Using Large Reasoning Models

Professional fact-checkers rely on domain knowledge and deep contextual understanding to verify claims. Large language models (LLMs) and large reasoning models (LRMs) lack such grounding and primarily reason from available evidence alone,…

Computation and Language · Computer Science 2026-04-16 Dhruv Sahnan , Subhabrata Dutta , Tanmoy Chakraborty , Preslav Nakov , Iryna Gurevych

SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning

Identifying and addressing security issues during the early phase of the development lifecycle is critical for mitigating the long-term negative impacts on software systems. Code review serves as an effective practice that enables…

Software Engineering · Computer Science 2025-10-31 Fang Liu , Simiao Liu , Yinghao Zhu , Xiaoli Lian , Li Zhang