Related papers: Optimizing Deep Learning Models to Address Class I…

Dopamin: Transformer-based Comment Classifiers through Domain Post-Training and Multi-level Layer Aggregation

Code comments provide important information for understanding the source code. They can help developers understand the overall purpose of a function or class, as well as identify bugs and technical debt. However, an overabundance of…

Computation and Language · Computer Science 2024-08-12 Nam Le Hai , Nghi D. Q. Bui

STACC: Code Comment Classification using SentenceTransformers

Code comments are a key resource for information about software artefacts. Depending on the use case, only some types of comments are useful. Thus, automatic approaches to classify these comments have been proposed. In this work, we address…

Software Engineering · Computer Science 2023-03-08 Ali Al-Kaswan , Maliheh Izadi , Arie van Deursen

Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance

Class-imbalance is one of the major challenges in real world datasets, where a few classes (called majority classes) constitute much more data samples than the rest (called minority classes). Learning deep neural networks using such…

Computer Vision and Pattern Recognition · Computer Science 2020-10-06 Saptarshi Sinha , Hiroki Ohashi , Katsuyuki Nakamura

CALF: A Conditionally Adaptive Loss Function to Mitigate Class-Imbalanced Segmentation

Imbalanced datasets pose a considerable challenge in training deep learning (DL) models for medical diagnostics, particularly for segmentation tasks. Imbalance may be associated with annotation quality limited annotated datasets, rare…

Image and Video Processing · Electrical Eng. & Systems 2025-04-08 Bashir Alam , Masa Cirkovic , Mete Harun Akcay , Md Kaf Shahrier , Sebastien Lafond , Hergys Rexha , Kurt Benke , Sepinoud Azimi , Janan Arslan

Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

Deep learning algorithms can fare poorly when the training dataset suffers from heavy class-imbalance but the testing criterion requires good generalization on less frequent classes. We design two novel methods to improve performance in…

Machine Learning · Computer Science 2019-10-29 Kaidi Cao , Colin Wei , Adrien Gaidon , Nikos Arechiga , Tengyu Ma

Multitask Learning for Class-Imbalanced Discourse Classification

Small class-imbalanced datasets, common in many high-level semantic tasks like discourse analysis, present a particular challenge to current deep-learning architectures. In this work, we perform an extensive analysis on sentence-level…

Computation and Language · Computer Science 2021-01-05 Alexander Spangher , Jonathan May , Sz-rung Shiang , Lingjia Deng

Adjusting Decision Boundary for Class Imbalanced Learning

Training of deep neural networks heavily depends on the data distribution. In particular, the networks easily suffer from class imbalance. The trained networks would recognize the frequent classes better than the infrequent classes. To…

Computer Vision and Pattern Recognition · Computer Science 2020-03-12 Byungju Kim , Junmo Kim

An Empirical Study of the Imbalance Issue in Software Vulnerability Detection

Vulnerability detection is crucial to protect software security. Nowadays, deep learning (DL) is the most promising technique to automate this detection task, leveraging its superior ability to extract patterns and representations within…

Software Engineering · Computer Science 2026-02-13 Yuejun Guo , Qiang Hu , Qiang Tang , Yves Le Traon

Enhancement Encoding: A Novel Imbalanced Classification Approach via Encoding the Training Labels

Class imbalance, which is also called long-tailed distribution, is a common problem in classification tasks based on machine learning. If it happens, the minority data will be overwhelmed by the majority, which presents quite a challenge…

Machine Learning · Computer Science 2023-03-29 Jia-Chen Zhao

Influence-Balanced Loss for Imbalanced Visual Classification

In this paper, we propose a balancing training method to address problems in imbalanced data learning. To this end, we derive a new loss used in the balancing training phase that alleviates the influence of samples that cause an overfitted…

Computer Vision and Pattern Recognition · Computer Science 2021-10-14 Seulki Park , Jongin Lim , Younghan Jeon , Jin Young Choi

AutoBalance: Optimized Loss Functions for Imbalanced Data

Imbalanced datasets are commonplace in modern machine learning problems. The presence of under-represented classes or groups with sensitive attributes results in concerns about generalization and fairness. Such concerns are further…

Machine Learning · Computer Science 2022-01-05 Mingchen Li , Xuechen Zhang , Christos Thrampoulidis , Jiasi Chen , Samet Oymak

High-quality data augmentation for code comment classification

Code comments serve a crucial role in software development for documenting functionality, clarifying design choices, and assisting with issue tracking. They capture developers' insights about the surrounding source code, serving as an…

Software Engineering · Computer Science 2026-01-28 Thomas Borsani , Andrea Rosani , Giuseppe Di Fatta

Review of Methods for Handling Class-Imbalanced in Classification Problems

Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more…

Machine Learning · Computer Science 2022-11-11 Satyendra Singh Rawat , Amit Kumar Mishra

A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing

Many natural language processing (NLP) tasks are naturally imbalanced, as some target categories occur much more frequently than others in the real world. In such scenarios, current NLP models still tend to perform poorly on less frequent…

Computation and Language · Computer Science 2023-02-23 Sophie Henning , William Beluch , Alexander Fraser , Annemarie Friedrich

Improving the Robustness of Federated Learning for Severely Imbalanced Datasets

With the ever increasing data deluge and the success of deep neural networks, the research of distributed deep learning has become pronounced. Two common approaches to achieve this distributed learning is synchronous and asynchronous weight…

Machine Learning · Computer Science 2022-04-29 Debasrita Chakraborty , Ashish Ghosh

Improving the classification of multi-class imbalanced data is more difficult than its two-class counterpart. In this paper, we use deep neural networks to train new representations of tabular multi-class data. Unlike the typically…

Machine Learning · Computer Science 2023-12-19 Damian Horna , Lango Mateusz , Jerzy Stefanowski

Exploring the Potential of Large Language Models in Fine-Grained Review Comment Classification

Code review is a crucial practice in software development. As code review nowadays is lightweight, various issues can be identified, and sometimes, they can be trivial. Research has investigated automated approaches to classify review…

Software Engineering · Computer Science 2025-08-14 Linh Nguyen , Chunhua Liu , Hong Yi Lin , Patanamon Thongtanunam

Striking the Right Balance: Recall Loss for Semantic Segmentation

Class imbalance is a fundamental problem in computer vision applications such as semantic segmentation. Specifically, uneven class distributions in a training dataset often result in unsatisfactory performance on under-represented classes.…

Computer Vision and Pattern Recognition · Computer Science 2022-02-07 Junjiao Tian , Niluthpol Mithun , Zach Seymour , Han-Pang Chiu , Zsolt Kira

Deep Just-In-Time Inconsistency Detection Between Comments and Source Code

Natural language comments convey key aspects of source code such as implementation, usage, and pre- and post-conditions. Failure to update comments accordingly when the corresponding code is modified introduces inconsistencies, which is…

Software Engineering · Computer Science 2020-12-29 Sheena Panthaplackel , Junyi Jessy Li , Milos Gligoric , Raymond J. Mooney

Balancing the Scales: A Comprehensive Study on Tackling Class Imbalance in Binary Classification

Class imbalance in binary classification tasks remains a significant challenge in machine learning, often resulting in poor performance on minority classes. This study comprehensively evaluates three widely-used strategies for handling…

Machine Learning · Computer Science 2024-10-01 Mohamed Abdelhamid , Abhyuday Desai