Related papers: KADEL: Knowledge-Aware Denoising Learning for Comm…

On the Evaluation of Commit Message Generation Models: An Experimental Study

Commit messages are natural language descriptions of code changes, which are important for program understanding and maintenance. However, writing commit messages manually is time-consuming and laborious, especially when the code is updated…

Software Engineering · Computer Science 2021-07-27 Wei Tao , Yanlin Wang , Ensheng Shi , Lun Du , Shi Han , Hongyu Zhang , Dongmei Zhang , Wenqiang Zhang

Automated Generation of Commit Messages in Software Repositories

Commit messages are crucial for documenting software changes, aiding in program comprehension and maintenance. However, creating effective commit messages is often overlooked by developers due to time constraints and varying levels of…

Software Engineering · Computer Science 2025-04-18 Varun Kumar Palakodeti , Abbas Heydarnoori

CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model

Commit message is a document that summarizes source code changes in natural language. A good commit message clearly shows the source code changes, so this enhances collaboration between developers. Therefore, our work is to develop a model…

Computation and Language · Computer Science 2021-06-01 Tae-Hwan Jung

Delving into Commit-Issue Correlation to Enhance Commit Message Generation Models

Commit message generation (CMG) is a challenging task in automated software engineering that aims to generate natural language descriptions of code changes for commits. Previous methods all start from the modified code snippets, outputting…

Software Engineering · Computer Science 2023-09-29 Liran Wang , Xunzhu Tang , Yichen He , Changyu Ren , Shuhua Shi , Chaoran Yan , Zhoujun Li

Generating Commit Messages from Git Diffs

Commit messages aid developers in their understanding of a continuously evolving codebase. However, developers not always document code changes properly. Automatically generating commit messages would relieve this burden on developers.…

Software Engineering · Computer Science 2019-11-27 S. R. P. van Hal , M. Post , K. Wendel

An Empirical Study on Commit Message Generation using LLMs via In-Context Learning

Commit messages concisely describe code changes in natural language and are important for software maintenance. Several approaches have been proposed to automatically generate commit messages, but they still suffer from critical…

Software Engineering · Computer Science 2025-02-27 Yifan Wu , Yunpeng Wang , Ying Li , Wei Tao , Siyu Yu , Haowen Yang , Wei Jiang , Jianguo Li

CoreGen: Contextualized Code Representation Learning for Commit Message Generation

Automatic generation of high-quality commit messages for code commits can substantially facilitate software developers' works and coordination. However, the semantic gap between source code and natural language poses a major challenge for…

Computation and Language · Computer Science 2021-06-22 Lun Yiu Nie , Cuiyun Gao , Zhicong Zhong , Wai Lam , Yang Liu , Zenglin Xu

Not All Knowledge Is Created Equal: Mutual Distillation of Confident Knowledge

Mutual knowledge distillation (MKD) improves a model by distilling knowledge from another model. However, \textit{not all knowledge is certain and correct}, especially under adverse conditions. For example, label noise usually leads to less…

Machine Learning · Computer Science 2022-11-17 Ziyun Li , Xinshao Wang , Di Hu , Neil M. Robertson , David A. Clifton , Christoph Meinel , Haojin Yang

Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance

Speech denoising is a generally adopted and impactful task, appearing in many common and everyday-life use cases. Although there are very powerful methods published, most of those are too complex for deployment in everyday and low-resources…

Sound · Computer Science 2025-05-07 Diep Luong , Mikko Heikkinen , Konstantinos Drossos , Tuomas Virtanen

KA2L: A Knowledge-Aware Active Learning Framework for LLMs

Fine-tuning large language models (LLMs) with high-quality knowledge has been shown to enhance their performance effectively. However, there is a paucity of research on the depth of domain-specific knowledge comprehension by LLMs and the…

Computation and Language · Computer Science 2026-03-19 Haoxuan Yin , Bojian Liu , Chen Tang , Yangfan Wang , Lian Yan , Jingchi Jiang

A Sketch-Based Neural Model for Generating Commit Messages from Diffs

Commit messages have an important impact in software development, especially when working in large teams. Multiple developers who have a different style of writing may often be involved in the same project. For this reason, it may be…

Computation and Language · Computer Science 2021-04-12 Nicolae-Teodor Pavel , Traian Rebedea

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability

Large-scale pre-training has been proven to be crucial for various computer vision tasks. However, with the increase of pre-training data amount, model architecture amount, and the private/inaccessible data, it is not very efficient or…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Ruifei He , Shuyang Sun , Jihan Yang , Song Bai , Xiaojuan Qi

Knowledge Distillation and Training Balance for Heterogeneous Decentralized Multi-Modal Learning over Wireless Networks

Decentralized learning is widely employed for collaboratively training models using distributed data over wireless networks. Existing decentralized learning methods primarily focus on training single-modal networks. For the decentralized…

Information Theory · Computer Science 2023-11-14 Benshun Yin , Zhiyong Chen , Meixia Tao

Automatically Generating Commit Messages from Diffs using Neural Machine Translation

Commit messages are a valuable resource in comprehension of software evolution, since they provide a record of changes such as feature additions and bug repairs. Unfortunately, programmers often neglect to write good commit messages.…

Software Engineering · Computer Science 2017-09-01 Siyuan Jiang , Ameer Armaly , Collin McMillan

Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication

Many recent breakthroughs in machine learning have been enabled by the pre-trained foundation models. By scaling up model parameters, training data, and computation resources, foundation models have significantly advanced the…

Artificial Intelligence · Computer Science 2023-10-06 Zhe Zhao , Qingyun Liu , Huan Gui , Bang An , Lichan Hong , Ed H. Chi

Brevity is the Soul of Wit: Condensing Code Changes to Improve Commit Message Generation

Commit messages are valuable resources for describing why code changes are committed to repositories in version control systems (e.g., Git). They effectively help developers understand code changes and better perform software maintenance…

Software Engineering · Computer Science 2025-09-22 Hongyu Kuang , Ning Zhang , Hui Gao , Xin Zhou , Wesley K. G. Assunção , Xiaoxing Ma , Dong Shao , Guoping Rong , He Zhang

Denoising-based Contractive Imitation Learning

A fundamental challenge in imitation learning is the \emph{covariate shift} problem. Existing methods to mitigate covariate shift often require additional expert interactions, access to environment dynamics, or complex adversarial training,…

Machine Learning · Computer Science 2025-03-21 Macheng Shen , Jishen Peng , Zefang Huang

Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection

Measuring domain relevance of data and identifying or selecting well-fit domain data for machine translation (MT) is a well-studied topic, but denoising is not yet. Denoising is concerned with a different type of data quality and tries to…

Computation and Language · Computer Science 2018-09-05 Wei Wang , Taro Watanabe , Macduff Hughes , Tetsuji Nakagawa , Ciprian Chelba

Knowledge-Decoupled Synergetic Learning: An MLLM based Collaborative Approach to Few-shot Multimodal Dialogue Intention Recognition

Few-shot multimodal dialogue intention recognition is a critical challenge in the e-commerce domainn. Previous methods have primarily enhanced model classification capabilities through post-training techniques. However, our analysis reveals…

Computation and Language · Computer Science 2025-03-07 Bin Chen , Yu Zhang , Hongfei Ye , Ziyi Huang , Hongyang Chen

Knowledge-augmented Deep Learning and Its Applications: A Survey

Deep learning models, though having achieved great success in many different fields over the past years, are usually data hungry, fail to perform well on unseen samples, and lack of interpretability. Various prior knowledge often exists in…

Machine Learning · Computer Science 2022-12-02 Zijun Cui , Tian Gao , Kartik Talamadupula , Qiang Ji