Related papers: Assertion-Aware Test Code Summarization with Large…

Assertion Messages with Large Language Models (LLMs) for Code

Assertion messages significantly enhance unit tests by clearly explaining the reasons behind test failures, yet they are frequently omitted by developers and automated test-generation tools. Despite recent advancements, Large Language…

Software Engineering · Computer Science 2025-09-25 Ahmed Aljohani , Anamul Haque Mollah , Hyunsook Do

On the Evaluation of Large Language Models in Unit Test Generation

Unit testing is an essential activity in software development for verifying the correctness of software components. However, manually writing unit tests is challenging and time-consuming. The emergence of Large Language Models (LLMs) offers…

Software Engineering · Computer Science 2024-09-26 Lin Yang , Chen Yang , Shutao Gao , Weijing Wang , Bo Wang , Qihao Zhu , Xiao Chu , Jianyi Zhou , Guangtai Liang , Qianxiang Wang , Junjie Chen

Beyond Strict Rules: Assessing the Effectiveness of Large Language Models for Code Smell Detection

Code smells are symptoms of potential code quality problems that may affect software maintainability, thus increasing development costs and impacting software reliability. Large language models (LLMs) have shown remarkable capabilities for…

Software Engineering · Computer Science 2026-01-16 Saymon Souza , Amanda Santana , Eduardo Figueiredo , Igor Muzetti , João Eduardo Montandon , Lionel Briand

Analysis on LLMs Performance for Code Summarization

Code summarization aims to generate concise natural language descriptions for source code. Deep learning has been used more and more recently in software engineering, particularly for tasks like code creation and summarization.…

Software Engineering · Computer Science 2025-01-27 Md. Ahnaf Akib , Md. Muktadir Mazumder , Salman Ahsan

Calibration of Large Language Models on Code Summarization

A brief, fluent, and relevant summary can be helpful during program comprehension; however, such a summary does require significant human effort to produce. Often, good summaries are unavailable in software projects, which makes maintenance…

Software Engineering · Computer Science 2025-06-03 Yuvraj Virk , Premkumar Devanbu , Toufique Ahmed

Source Code Summarization in the Era of Large Language Models

To support software developers in understanding and maintaining programs, various automatic (source) code summarization techniques have been proposed to generate a concise natural language summary (i.e., comment) for a given code snippet.…

Software Engineering · Computer Science 2025-08-26 Weisong Sun , Yun Miao , Yuekang Li , Hongyu Zhang , Chunrong Fang , Yi Liu , Gelei Deng , Yang Liu , Zhenyu Chen

Are Large Language Models Truly Smarter Than Humans?

Public leaderboards increasingly suggest that large language models (LLMs) surpass human experts on benchmarks spanning academic knowledge, law, and programming. Yet most benchmarks are fully public, their questions widely mirrored across…

Artificial Intelligence · Computer Science 2026-03-18 Eshwar Reddy M , Sourav Karmakar

Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

Large Language Models (LLM) are a new class of computation engines, "programmed" via prompt engineering. We are still learning how to best "program" these LLMs to help developers. We start with the intuition that developers tend to…

Software Engineering · Computer Science 2024-01-15 Toufique Ahmed , Kunal Suresh Pai , Premkumar Devanbu , Earl T. Barr

LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation

Reliable evaluation of large language model (LLM)-generated summaries remains an open challenge, particularly across heterogeneous domains and document lengths. We conduct a comprehensive meta-evaluation of 14 automatic summarization…

Computation and Language · Computer Science 2026-04-29 Huyen Nguyen , Haoxuan Zhang , Yang Zhang , Junhua Ding , Haihua Chen

Prompt-Driven Code Summarization: A Systematic Literature Review

Software documentation is essential for program comprehension, developer onboarding, code review, and long-term maintenance. Yet producing quality documentation manually is time-consuming and frequently yields incomplete or inconsistent…

Software Engineering · Computer Science 2026-04-20 Afia Farjana , Zaiyu Cheng , Antonio Mastropaolo

Exploring Prompting Large Language Models as Explainable Metrics

This paper describes the IUST NLP Lab submission to the Prompting Large Language Models as Explainable Metrics Shared Task at the Eval4NLP 2023 Workshop on Evaluation & Comparison of NLP Systems. We have proposed a zero-shot prompt-based…

Computation and Language · Computer Science 2023-11-21 Ghazaleh Mahmoudi

A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing

Unit testing plays a pivotal role in software development, improving software quality and reliability. However, generating effective test cases manually is time-consuming, prompting interest in unit testing research. Recently, Large…

Software Engineering · Computer Science 2024-12-24 Ye Shang , Quanjun Zhang , Chunrong Fang , Siqi Gu , Jianyi Zhou , Zhenyu Chen

Can Large Language Models Serve as Evaluators for Code Summarization?

Code summarization facilitates program comprehension and software maintenance by converting code snippets into natural-language descriptions. Over the years, numerous methods have been developed for this task, but a key challenge remains:…

Software Engineering · Computer Science 2024-12-03 Yang Wu , Yao Wan , Zhaoyang Chu , Wenting Zhao , Ye Liu , Hongyu Zhang , Xuanhua Shi , Philip S. Yu

Large Language Models Versus Static Code Analysis Tools: A Systematic Benchmark for Vulnerability Detection

Modern software relies on a multitude of automated testing and quality assurance tools to prevent errors, bugs and potential vulnerabilities. This study sets out to provide a head-to-head, quantitative and qualitative evaluation of six…

Software Engineering · Computer Science 2025-08-07 Damian Gnieciak , Tomasz Szandala

On the Effectiveness of LLM-as-a-judge for Code Generation and Summarization

Large Language Models have been recently exploited as judges for complex natural language processing tasks, such as Q&A. The basic idea is to delegate to an LLM the assessment of the "quality" of the output provided by an automated…

Software Engineering · Computer Science 2025-07-23 Giuseppe Crupi , Rosalia Tufano , Alejandro Velasco , Antonio Mastropaolo , Denys Poshyvanyk , Gabriele Bavota

ASSERTIFY: Utilizing Large Language Models to Generate Assertions for Production Code

Production assertions are statements embedded in the code to help developers validate their assumptions about the code. They assist developers in debugging, provide valuable documentation, and enhance code comprehension. Current research in…

Software Engineering · Computer Science 2024-11-27 Mohammad Jalili Torkamani , Abhinav Sharma , Nikita Mehrotra , Rahul Purandare

Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation

Generating tests automatically is a key and ongoing area of focus in software engineering research. The emergence of Large Language Models (LLMs) has opened up new opportunities, given their ability to perform a wide spectrum of tasks.…

Software Engineering · Computer Science 2025-01-20 Azat Abdullin , Pouria Derakhshanfar , Annibale Panichella

Measuring Determinism in Large Language Models for Software Code Review

Large Language Models (LLMs) promise to streamline software code reviews, but their ability to produce consistent assessments remains an open question. In this study, we tested four leading LLMs -- GPT-4o mini, GPT-4o, Claude 3.5 Sonnet,…

Software Engineering · Computer Science 2025-03-03 Eugene Klishevich , Yegor Denisov-Blanch , Simon Obstbaum , Igor Ciobanu , Michal Kosinski

Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values

The design and implementation of unit tests is a complex task many programmers neglect. This research evaluates the potential of Large Language Models (LLMs) in automatically generating test cases, comparing them with manual tests. An…

Software Engineering · Computer Science 2025-05-16 Martín Rodríguez , Gustavo Rossi , Alejandro Fernandez

Exploring Large Language Models for Code Explanation

Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks…

Software Engineering · Computer Science 2023-10-26 Paheli Bhattacharya , Manojit Chakraborty , Kartheek N S N Palepu , Vikas Pandey , Ishan Dindorkar , Rakesh Rajpurohit , Rishabh Gupta