Related papers: Automated Imbalanced Learning

When Imbalance Comes Twice: Active Learning under Simulated Class Imbalance and Label Shift in Binary Semantic Segmentation

The aim of Active Learning is to select the most informative samples from an unlabelled set of data. This is useful in cases where the amount of data is large and labelling is expensive, such as in machine vision or medical imaging. Two…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Julien Combes , Alexandre Derville , Jean-François Coeurjolly

A Comprehensive Survey on Imbalanced Data Learning

With the expansion of data availability, machine learning (ML) has achieved remarkable breakthroughs in both academia and industry. However, imbalanced data distributions are prevalent in various types of raw data and severely hinder the…

Machine Learning · Computer Science 2025-09-15 Xinyi Gao , Dongting Xie , Yihang Zhang , Zhengren Wang , Chong Chen , Conghui He , Hongzhi Yin , Wentao Zhang

A Survey of Methods for Managing the Classification and Solution of Data Imbalance Problem

The problem of class imbalance is extensive for focusing on numerous applications in the real world. In such a situation, nearly all of the examples are labeled as one class called majority class, while far fewer examples are labeled as the…

Machine Learning · Computer Science 2020-12-23 Khan Md. Hasib , Md. Sadiq Iqbal , Faisal Muhammad Shah , Jubayer Al Mahmud , Mahmudul Hasan Popel , Md. Imran Hossain Showrov , Shakil Ahmed , Obaidur Rahman

A comparison of Deep Learning performances with other machine learning algorithms on credit scoring unbalanced data

Training models on highly unbalanced data is admitted to be a challenging task for machine learning algorithms. Current studies on deep learning mainly focus on data sets with balanced class labels or unbalanced data, but with massive…

Machine Learning · Computer Science 2020-02-27 Louis Marceau , Lingling Qiu , Nick Vandewiele , Eric Charton

An Empirical Analysis of the Efficacy of Different Sampling Techniques for Imbalanced Classification

Learning from imbalanced data is a challenging task. Standard classification algorithms tend to perform poorly when trained on imbalanced data. Some special strategies need to be adopted, either by modifying the data distribution or by…

Machine Learning · Computer Science 2022-08-26 Asif Newaz , Shahriar Hassan , Farhan Shahriyar Haq

Towards Label Imbalance in Multi-label Classification with Many Labels

In multi-label classification, an instance may be associated with a set of labels simultaneously. Recently, the research on multi-label classification has largely shifted its focus to the other end of the spectrum where the number of labels…

Machine Learning · Computer Science 2016-04-06 Li Li , Houfeng Wang

Multi-Label Bayesian Active Learning with Inter-Label Relationships

The primary challenge of multi-label active learning, differing it from multi-class active learning, lies in assessing the informativeness of an indefinite number of labels while also accounting for the inherited label correlation. Existing…

Machine Learning · Computer Science 2025-09-05 Yuanyuan Qi , Jueqing Lu , Xiaohao Yang , Joanne Enticott , Lan Du

A Review of Machine Learning Techniques in Imbalanced Data and Future Trends

For over two decades, detecting rare events has been a challenging task among researchers in the data mining and machine learning domain. Real-life problems inspire researchers to navigate and further improve data processing and algorithmic…

Machine Learning · Computer Science 2025-09-09 Elaheh Jafarigol , Theodore Trafalis , Neshat Mohammadi

Sampling Based Approaches to Handle Imbalances in Network Traffic Dataset for Machine Learning Techniques

Network traffic data is huge, varying and imbalanced because various classes are not equally distributed. Machine learning (ML) algorithms for traffic analysis uses the samples from this data to recommend the actions to be taken by the…

Networking and Internet Architecture · Computer Science 2013-11-13 Raman Singh , Harish Kumar , R. K. Singla

Benchmark and Survey of Automated Machine Learning Frameworks

Machine learning (ML) has become a vital part in many aspects of our daily life. However, building well performing machine learning applications requires highly specialized data scientists and domain experts. Automated machine learning…

Machine Learning · Computer Science 2021-01-27 Marc-André Zöller , Marco F. Huber

Balanced Split: A new train-test data splitting strategy for imbalanced datasets

Classification data sets with skewed class proportions are called imbalanced. Class imbalance is a problem since most machine learning classification algorithms are built with an assumption of equal representation of all classes in the…

Machine Learning · Computer Science 2022-12-22 Azal Ahmad Khan

In real-world applications, as data availability increases, obtaining labeled data for machine learning (ML) projects remains challenging due to the high costs and intensive efforts required for data annotation. Many ML projects,…

Machine Learning · Computer Science 2024-12-24 Ismail Hakki Karaman , Gulser Koksal , Levent Eriskin , Salih Salihoglu

Towards Better Performance in Incomplete LDL: Addressing Data Imbalance

Label Distribution Learning (LDL) is a novel machine learning paradigm that addresses the problem of label ambiguity and has found widespread applications. Obtaining complete label distributions in real-world scenarios is challenging, which…

Machine Learning · Computer Science 2024-10-18 Zhiqiang Kou , Haoyuan Xuan , Jing Wang , Yuheng Jia , Xin Geng

Analyzing the Effects of Handling Data Imbalance on Learned Features from Medical Images by Looking Into the Models

One challenging property lurking in medical datasets is the imbalanced data distribution, where the frequency of the samples between the different classes is not balanced. Training a model on an imbalanced dataset can introduce unique…

Image and Video Processing · Electrical Eng. & Systems 2022-04-06 Ashkan Khakzar , Yawei Li , Yang Zhang , Mirac Sanisoglu , Seong Tae Kim , Mina Rezaei , Bernd Bischl , Nassir Navab

A Study imbalance handling by various data sampling methods in binary classification

The purpose of this research report is to present the our learning curve and the exposure to the Machine Learning life cycle, with the use of a Kaggle binary classification data set and taking to explore various techniques from…

Machine Learning · Computer Science 2021-05-25 Mohamed Hamama

Automated Machine Learning for Multi-Label Classification

Automated machine learning (AutoML) aims to select and configure machine learning algorithms and combine them into machine learning pipelines tailored to a dataset at hand. For supervised learning tasks, most notably binary and multinomial…

Machine Learning · Computer Science 2024-02-29 Marcel Wever

Resampling strategies for imbalanced regression: a survey and empirical analysis

Imbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification,…

Machine Learning · Computer Science 2025-07-17 Juscimara G. Avelino , George D. C. Cavalcanti , Rafael M. O. Cruz

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

There has been considerable growth and interest in industrial applications of machine learning (ML) in recent years. ML engineers, as a consequence, are in high demand across the industry, yet improving the efficiency of ML engineers…

Machine Learning · Computer Science 2020-05-05 Anh Truong , Austin Walters , Jeremy Goodsitt , Keegan Hines , C. Bayan Bruss , Reza Farivar

A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing

Many natural language processing (NLP) tasks are naturally imbalanced, as some target categories occur much more frequently than others in the real world. In such scenarios, current NLP models still tend to perform poorly on less frequent…

Computation and Language · Computer Science 2023-02-23 Sophie Henning , William Beluch , Alexander Fraser , Annemarie Friedrich

Review of Methods for Handling Class-Imbalanced in Classification Problems

Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more…

Machine Learning · Computer Science 2022-11-11 Satyendra Singh Rawat , Amit Kumar Mishra