Related papers: Progressive Code Integration for Abstractive Bug R…

LLMs as Evaluators: A Novel Approach to Evaluate Bug Report Summarization

Summarizing software artifacts is an important task that has been thoroughly researched. For evaluating software summarization approaches, human judgment is still the most trusted evaluation. However, it is time-consuming and fatiguing for…

Software Engineering · Computer Science 2024-09-04 Abhishek Kumar , Sonia Haiduc , Partha Pratim Das , Partha Pratim Chakrabarti

Summary-Mediated Repair: Can LLMs use code summarisation as a tool for program repair?

Large Language Models (LLMs) often produce code with subtle implementation-level bugs despite strong benchmark performance. These errors are hard for LLMs to spot and can have large behavioural effects; yet when asked to summarise code,…

Software Engineering · Computer Science 2025-11-25 Lukas Twist

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries

Large Language Models (LLMs) are increasingly used to generate summaries of software bug reports, including sections such as Steps-to-Reproduce (S2R), Actual Behavior (AB), and Expected Behavior (EB). However, these models frequently…

Software Engineering · Computer Science 2026-05-26 Hinduja Nirujan , Shreyas Patil , Abdallah Ayoub , Ahmad Abdel Latif , Gouri Ginde

Prompt-Driven Code Summarization: A Systematic Literature Review

Software documentation is essential for program comprehension, developer onboarding, code review, and long-term maintenance. Yet producing quality documentation manually is time-consuming and frequently yields incomplete or inconsistent…

Software Engineering · Computer Science 2026-04-20 Afia Farjana , Zaiyu Cheng , Antonio Mastropaolo

StructSum: Summarization via Structured Representations

Abstractive text summarization aims at compressing the information of a long source document into a rephrased, condensed summary. Despite advances in modeling techniques, abstractive summarization models still suffer from several key…

Computation and Language · Computer Science 2021-02-17 Vidhisha Balachandran , Artidoro Pagnoni , Jay Yoon Lee , Dheeraj Rajagopal , Jaime Carbonell , Yulia Tsvetkov

Source Code Summarization in the Era of Large Language Models

To support software developers in understanding and maintaining programs, various automatic (source) code summarization techniques have been proposed to generate a concise natural language summary (i.e., comment) for a given code snippet.…

Software Engineering · Computer Science 2025-08-26 Weisong Sun , Yun Miao , Yuekang Li , Hongyu Zhang , Chunrong Fang , Yi Liu , Gelei Deng , Yang Liu , Zhenyu Chen

Code Summarization Beyond Function Level

Code summarization is a critical task in natural language processing and software engineering, which aims to generate concise descriptions of source code. Recent advancements have improved the quality of these summaries, enhancing code…

Computation and Language · Computer Science 2025-02-25 Vladimir Makharev , Vladimir Ivanov

Simple and Effective Baselines for Code Summarisation Evaluation

Code documentation is useful, but writing it is time-consuming. Different techniques for generating code summaries have emerged, but comparing them is difficult because human evaluation is expensive and automatic metrics are unreliable. In…

Computation and Language · Computer Science 2025-05-27 Jade Robinson , Jonathan K. Kummerfeld

A Novel LLM-based Two-stage Summarization Approach for Long Dialogues

Long document summarization poses a significant challenge in natural language processing due to input lengths that exceed the capacity of most state-of-the-art pre-trained language models. This study proposes a hierarchical framework that…

Computation and Language · Computer Science 2024-10-10 Yuan-Jhe Yin , Bo-Yu Chen , Berlin Chen

Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures

Bug localization in multi-repository microservice architectures is challenging due to the semantic gap between natural language bug reports and code, LLM context limitations, and the need to first identify the correct repository. We propose…

Software Engineering · Computer Science 2025-12-08 Amirkia Rafiei Oskooei , S. Selcan Yukcu , Mehmet Cevheri Bozoglan , Mehmet S. Aktas

Understanding LLM Reasoning for Abstractive Summarization

While the reasoning capabilities of Large Language Models (LLMs) excel in analytical tasks such as mathematics and code generation, their utility for abstractive summarization remains widely assumed but largely unverified. To bridge this…

Computation and Language · Computer Science 2025-12-10 Haohan Yuan , Haopeng Zhang

Hierarchical Knowledge Injection for Improving LLM-based Program Repair

Prompting LLMs with bug-related context (e.g., error messages, stack traces) improves automated program repair, but many bugs still remain unresolved. In real-world projects, developers often rely on broader repository and project-level…

Software Engineering · Computer Science 2026-02-10 Ramtin Ehsani , Esteban Parra , Sonia Haiduc , Preetha Chatterjee

Meta-RAG on Large Codebases Using Code Summarization

Large Language Model (LLM) systems have been at the forefront of applied Artificial Intelligence (AI) research in a multitude of domains. One such domain is software development, where researchers have pushed the automation of a number of…

Software Engineering · Computer Science 2025-08-08 Vali Tawosi , Salwa Alamir , Xiaomo Liu , Manuela Veloso

Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code

Understanding source code is a topic of great interest in the software engineering community, since it can help programmers in various tasks such as software maintenance and reuse. Recent advances in large language models (LLMs) have…

Software Engineering · Computer Science 2025-04-25 Michele Carissimi , Martina Saletta , Claudio Ferretti

Beyond Summaries: Structure-Aware Labeling of Code Changes with Large Language Models

Code review is a critical practice in software engineering, yet the growing scale and frequency of code patches in modern projects, together with the widespread adoption of AI code assistants, make manual review increasingly challenging.…

Software Engineering · Computer Science 2026-05-26 Bar Weiss , Antonio Abu-Nassar , Adi Sosnovich , Karen Yorav

LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation

Reliable evaluation of large language model (LLM)-generated summaries remains an open challenge, particularly across heterogeneous domains and document lengths. We conduct a comprehensive meta-evaluation of 14 automatic summarization…

Computation and Language · Computer Science 2026-04-29 Huyen Nguyen , Haoxuan Zhang , Yang Zhang , Junhua Ding , Haihua Chen

Abstractive Text Summarization: State of the Art, Challenges, and Improvements

Specifically focusing on the landscape of abstractive text summarization, as opposed to extractive techniques, this survey presents a comprehensive overview, delving into state-of-the-art techniques, prevailing challenges, and prospective…

Computation and Language · Computer Science 2024-09-05 Hassan Shakil , Ahmad Farooq , Jugal Kalita

Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

Large Language Models (LLM) are a new class of computation engines, "programmed" via prompt engineering. We are still learning how to best "program" these LLMs to help developers. We start with the intuition that developers tend to…

Software Engineering · Computer Science 2024-01-15 Toufique Ahmed , Kunal Suresh Pai , Premkumar Devanbu , Earl T. Barr

An Evaluation of Large Language Models on Text Summarization Tasks Using Prompt Engineering Techniques

Large Language Models (LLMs) continue to advance natural language processing with their ability to generate human-like text across a range of tasks. Despite the remarkable success of LLMs in Natural Language Processing (NLP), their…

Computation and Language · Computer Science 2025-07-08 Walid Mohamed Aly , Taysir Hassan A. Soliman , Amr Mohamed AbdelAziz

Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications

The rapid increase in unstructured data across various fields has made multi-document comprehension and summarization a critical task. Traditional approaches often fail to capture relevant context, maintain logical consistency, and extract…

Computation and Language · Computer Science 2024-09-30 Aditi Godbole , Jabin Geevarghese George , Smita Shandilya