Related papers: Improving Retrieval-Augmented Code Comment Generat…

When Retriever Meets Generator: A Joint Model for Code Comment Generation

Automatically generating concise, informative comments for source code can lighten documentation effort and accelerate program comprehension. Retrieval-augmented approaches first fetch code snippets with existing comments and then…

Software Engineering · Computer Science 2025-07-25 Tien P. T. Le , Anh M. T. Bui , Huy N. D. Pham , Alessio Bucaioni , Phuong T. Nguyen

Retrieval-Augmented Code Review Comment Generation

Automated code review comment generation (RCG) aims to assist developers by automatically producing natural language feedback for code changes. Existing approaches are primarily either generation-based, using pretrained language models, or…

Software Engineering · Computer Science 2025-06-16 Hyunsun Hong , Jongmoon Baik

Assessing the Answerability of Queries in Retrieval-Augmented Code Generation

Thanks to unprecedented language understanding and generation capabilities of large language model (LLM), Retrieval-augmented Code Generation (RaCG) has recently been widely utilized among software developers. While this has increased…

Computation and Language · Computer Science 2024-11-26 Geonmin Kim , Jaeyeon Kim , Hancheol Park , Wooksu Shin , Tae-Ho Kim

Retrieve and Refine: Exemplar-based Neural Comment Generation

Code comment generation which aims to automatically generate natural language descriptions for source code, is a crucial task in the field of automatic software development. Traditional comment generation methods use manually-crafted…

Software Engineering · Computer Science 2020-10-12 Bolin Wei , Yongmin Li , Ge Li , Xin Xia , Zhi Jin

Yet Another Combination of IR- and Neural-based Comment Generation

Code comment generation techniques aim to generate natural language descriptions for source code. There are two orthogonal approaches for this task, i.e., information retrieval (IR) based and neural-based methods. Recent studies have…

Software Engineering · Computer Science 2021-07-28 Huang Yuchao , Wei Moshi , Wang Song , Wang Junjie , Wang Qing

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a…

Computation and Language · Computer Science 2026-04-27 Lichang Song , Ting Long , Yi Chang

Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches

Recent advances in large language models (LLMs) have significantly improved automated code generation. While existing approaches have achieved strong performance at the function and file levels, real-world software engineering requires…

Software Engineering · Computer Science 2026-05-21 Yicheng Tao , Yuante Li , Yao Qin , Yepang Liu

When More Retrieval Hurts: Retrieval-Augmented Code Review Generation

Code review generation can reduce developer effort by producing concise, reviewer-style feedback for a given code snippet or code change. However, generation-only models often produce generic or off-point reviews, while retrieval-only…

Software Engineering · Computer Science 2026-03-26 Qianru Meng , Xiao Zhang , Zhaochen Ren , Joost Visser

Retrieve and Refine: Exemplar-based Neural Comment Generation

Code comment generation is a crucial task in the field of automatic software development. Most previous neural comment generation systems used an encoder-decoder neural network and encoded only information from source code as input.…

Software Engineering · Computer Science 2019-10-24 Bolin Wei

Learning Comment Generation by Leveraging User-Generated Data

Existing models on open-domain comment generation are difficult to train, and they produce repetitive and uninteresting responses. The problem is due to multiple and contradictory responses from a single article, and by the rigidity of…

Computation and Language · Computer Science 2019-02-28 Zhaojiang Lin , Genta Indra Winata , Pascale Fung

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation Download PDF Neal Gregory Lawton, Alfy Samuel, Anoop Kumar, Daben Liu Published: 20 Aug 2025, Retrieval augmented generation (RAG) is a popular…

Computation and Language · Computer Science 2025-10-21 Neal Gregory Lawton , Alfy Samuel , Anoop Kumar , Daben Liu

An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities

Code generation aims to automatically generate code snippets of specific programming language according to natural language descriptions. The continuous advancements in deep learning, particularly pre-trained models, have empowered the code…

Software Engineering · Computer Science 2025-01-24 Zezhou Yang , Sirong Chen , Cuiyun Gao , Zhenhao Li , Xing Hu , Kui Liu , Xin Xia

CodeRAG-Bench: Can Retrieval Augment Code Generation?

While language models (LMs) have proven remarkably adept at generating code, many programs are challenging for LMs to generate using their parametric knowledge alone. Providing external contexts such as library documentation can facilitate…

Software Engineering · Computer Science 2025-02-28 Zora Zhiruo Wang , Akari Asai , Xinyan Velocity Yu , Frank F. Xu , Yiqing Xie , Graham Neubig , Daniel Fried

A Survey on Retrieval-Augmented Text Generation for Large Language Models

Retrieval-Augmented Generation (RAG) merges retrieval methods with deep learning advancements to address the static limitations of large language models (LLMs) by enabling the dynamic integration of up-to-date external information. This…

Information Retrieval · Computer Science 2026-05-19 Yizheng Huang , Jimmy Huang

Retrieval-Augmented Generation for AI-Generated Content: A Survey

Advancements in model algorithms, the growth of foundational models, and access to high-quality datasets have propelled the evolution of Artificial Intelligence Generated Content (AIGC). Despite its notable successes, AIGC still faces…

Computer Vision and Pattern Recognition · Computer Science 2024-06-24 Penghao Zhao , Hailin Zhang , Qinhan Yu , Zhengren Wang , Yunteng Geng , Fangcheng Fu , Ling Yang , Wentao Zhang , Jie Jiang , Bin Cui

Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models

This paper explores a novel method for enhancing binary classification models that assess code comment quality, leveraging Generative Artificial Intelligence to elevate model performance. By integrating 1,437 newly generated code-comment…

Software Engineering · Computer Science 2024-10-30 Seetharam Killivalavan , Durairaj Thenmozhi

RAG-Reflect: Agentic Retrieval-Augmented Generation with Reflections for Comment-Driven Code Maintenance on Stack Overflow

User comments on online programming platforms such as Stack Overflow play a vital role in maintaining the correctness and relevance of shared code examples. However, the majority of comments express gratitude or clarification, while only a…

Software Engineering · Computer Science 2026-04-27 Mehedi Hasan Shanto , Muhammad Asaduzzaman , Alioune Ngom

Retrieval-Augmented Feature Generation for Domain-Specific Classification

Feature generation can significantly enhance learning outcomes, particularly for tasks with limited data. An effective way to improve feature generation is to expand the current feature space using existing features and enriching the…

Computation and Language · Computer Science 2025-11-11 Xinhao Zhang , Jinghan Zhang , Fengran Mo , Dakshak Keerthi Chandra , Yu-Zhong Chen , Fei Xie , Kunpeng Liu

ReCode: Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation

Recent advances in large language models (LLMs) have demonstrated impressive capabilities in code-related tasks, such as code generation and automated program repair. Despite their promising performance, most existing approaches for code…

Software Engineering · Computer Science 2025-09-03 Yicong Zhao , Shisong Chen , Jiacheng Zhang , Zhixu Li

RAGTrace: Understanding and Refining Retrieval-Generation Dynamics in Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) systems have emerged as a promising solution to enhance large language models (LLMs) by integrating external knowledge retrieval with generative capabilities. While significant advancements have been…

Human-Computer Interaction · Computer Science 2025-08-11 Sizhe Cheng , Jiaping Li , Huanchen Wang , Yuxin Ma