English
Related papers

Related papers: KADEL: Knowledge-Aware Denoising Learning for Comm…

200 papers

Commit messages are natural language descriptions of code changes, which are important for program understanding and maintenance. However, writing commit messages manually is time-consuming and laborious, especially when the code is updated…

Software Engineering · Computer Science 2021-07-27 Wei Tao , Yanlin Wang , Ensheng Shi , Lun Du , Shi Han , Hongyu Zhang , Dongmei Zhang , Wenqiang Zhang

Commit messages are crucial for documenting software changes, aiding in program comprehension and maintenance. However, creating effective commit messages is often overlooked by developers due to time constraints and varying levels of…

Software Engineering · Computer Science 2025-04-18 Varun Kumar Palakodeti , Abbas Heydarnoori

Commit message is a document that summarizes source code changes in natural language. A good commit message clearly shows the source code changes, so this enhances collaboration between developers. Therefore, our work is to develop a model…

Computation and Language · Computer Science 2021-06-01 Tae-Hwan Jung

Commit message generation (CMG) is a challenging task in automated software engineering that aims to generate natural language descriptions of code changes for commits. Previous methods all start from the modified code snippets, outputting…

Software Engineering · Computer Science 2023-09-29 Liran Wang , Xunzhu Tang , Yichen He , Changyu Ren , Shuhua Shi , Chaoran Yan , Zhoujun Li

Commit messages aid developers in their understanding of a continuously evolving codebase. However, developers not always document code changes properly. Automatically generating commit messages would relieve this burden on developers.…

Software Engineering · Computer Science 2019-11-27 S. R. P. van Hal , M. Post , K. Wendel

Commit messages concisely describe code changes in natural language and are important for software maintenance. Several approaches have been proposed to automatically generate commit messages, but they still suffer from critical…

Software Engineering · Computer Science 2025-02-27 Yifan Wu , Yunpeng Wang , Ying Li , Wei Tao , Siyu Yu , Haowen Yang , Wei Jiang , Jianguo Li

Automatic generation of high-quality commit messages for code commits can substantially facilitate software developers' works and coordination. However, the semantic gap between source code and natural language poses a major challenge for…

Computation and Language · Computer Science 2021-06-22 Lun Yiu Nie , Cuiyun Gao , Zhicong Zhong , Wai Lam , Yang Liu , Zenglin Xu

Mutual knowledge distillation (MKD) improves a model by distilling knowledge from another model. However, \textit{not all knowledge is certain and correct}, especially under adverse conditions. For example, label noise usually leads to less…

Machine Learning · Computer Science 2022-11-17 Ziyun Li , Xinshao Wang , Di Hu , Neil M. Robertson , David A. Clifton , Christoph Meinel , Haojin Yang

Speech denoising is a generally adopted and impactful task, appearing in many common and everyday-life use cases. Although there are very powerful methods published, most of those are too complex for deployment in everyday and low-resources…

Sound · Computer Science 2025-05-07 Diep Luong , Mikko Heikkinen , Konstantinos Drossos , Tuomas Virtanen

Fine-tuning large language models (LLMs) with high-quality knowledge has been shown to enhance their performance effectively. However, there is a paucity of research on the depth of domain-specific knowledge comprehension by LLMs and the…

Computation and Language · Computer Science 2026-03-19 Haoxuan Yin , Bojian Liu , Chen Tang , Yangfan Wang , Lian Yan , Jingchi Jiang

Commit messages have an important impact in software development, especially when working in large teams. Multiple developers who have a different style of writing may often be involved in the same project. For this reason, it may be…

Computation and Language · Computer Science 2021-04-12 Nicolae-Teodor Pavel , Traian Rebedea

Large-scale pre-training has been proven to be crucial for various computer vision tasks. However, with the increase of pre-training data amount, model architecture amount, and the private/inaccessible data, it is not very efficient or…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Ruifei He , Shuyang Sun , Jihan Yang , Song Bai , Xiaojuan Qi

Decentralized learning is widely employed for collaboratively training models using distributed data over wireless networks. Existing decentralized learning methods primarily focus on training single-modal networks. For the decentralized…

Information Theory · Computer Science 2023-11-14 Benshun Yin , Zhiyong Chen , Meixia Tao

Commit messages are a valuable resource in comprehension of software evolution, since they provide a record of changes such as feature additions and bug repairs. Unfortunately, programmers often neglect to write good commit messages.…

Software Engineering · Computer Science 2017-09-01 Siyuan Jiang , Ameer Armaly , Collin McMillan

Many recent breakthroughs in machine learning have been enabled by the pre-trained foundation models. By scaling up model parameters, training data, and computation resources, foundation models have significantly advanced the…

Artificial Intelligence · Computer Science 2023-10-06 Zhe Zhao , Qingyun Liu , Huan Gui , Bang An , Lichan Hong , Ed H. Chi

Commit messages are valuable resources for describing why code changes are committed to repositories in version control systems (e.g., Git). They effectively help developers understand code changes and better perform software maintenance…

Software Engineering · Computer Science 2025-09-22 Hongyu Kuang , Ning Zhang , Hui Gao , Xin Zhou , Wesley K. G. Assunção , Xiaoxing Ma , Dong Shao , Guoping Rong , He Zhang

A fundamental challenge in imitation learning is the \emph{covariate shift} problem. Existing methods to mitigate covariate shift often require additional expert interactions, access to environment dynamics, or complex adversarial training,…

Machine Learning · Computer Science 2025-03-21 Macheng Shen , Jishen Peng , Zefang Huang

Measuring domain relevance of data and identifying or selecting well-fit domain data for machine translation (MT) is a well-studied topic, but denoising is not yet. Denoising is concerned with a different type of data quality and tries to…

Computation and Language · Computer Science 2018-09-05 Wei Wang , Taro Watanabe , Macduff Hughes , Tetsuji Nakagawa , Ciprian Chelba

Few-shot multimodal dialogue intention recognition is a critical challenge in the e-commerce domainn. Previous methods have primarily enhanced model classification capabilities through post-training techniques. However, our analysis reveals…

Computation and Language · Computer Science 2025-03-07 Bin Chen , Yu Zhang , Hongfei Ye , Ziyi Huang , Hongyang Chen

Deep learning models, though having achieved great success in many different fields over the past years, are usually data hungry, fail to perform well on unseen samples, and lack of interpretability. Various prior knowledge often exists in…

Machine Learning · Computer Science 2022-12-02 Zijun Cui , Tian Gao , Kartik Talamadupula , Qiang Ji
‹ Prev 1 2 3 10 Next ›