Related papers: Data Augmentation Imbalance For Imbalanced Attribu…

Pedestrian Attribute Recognition as Label-balanced Multi-label Learning

Rooting in the scarcity of most attributes, realistic pedestrian attribute datasets exhibit unduly skewed data distribution, from which two types of model failures are delivered: (1) label imbalance: model predictions lean greatly towards…

Computer Vision and Pattern Recognition · Computer Science 2024-05-09 Yibo Zhou , Hai-Miao Hu , Yirong Xiang , Xiaokang Zhang , Haotian Wu

Towards Understanding How Data Augmentation Works with Imbalanced Data

Data augmentation forms the cornerstone of many modern machine learning training pipelines; yet, the mechanisms by which it works are not clearly understood. Much of the research on data augmentation (DA) has focused on improving existing…

Machine Learning · Computer Science 2023-04-13 Damien A. Dablain , Nitesh V. Chawla

Building Manufacturing Deep Learning Models with Minimal and Imbalanced Training Data Using Domain Adaptation and Data Augmentation

Deep learning (DL) techniques are highly effective for defect detection from images. Training DL classification models, however, requires vast amounts of labeled data which is often expensive to collect. In many cases, not only the…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Adrian Shuai Li , Elisa Bertino , Rih-Teng Wu , Ting-Yan Wu

Imbalanced Classification via Explicit Gradient Learning From Augmented Data

Learning from imbalanced data is one of the most significant challenges in real-world classification tasks. In such cases, neural networks performance is substantially impaired due to preference towards the majority class. Existing…

Machine Learning · Computer Science 2022-11-13 Bronislav Yasinnik , Moshe Salhov , Ofir Lindenbaum , Amir Averbuch

Data Augmentation for Imbalanced Regression

In this work, we consider the problem of imbalanced data in a regression framework when the imbalanced phenomenon concerns continuous or discrete covariates. Such a situation can lead to biases in the estimates. In this case, we propose a…

Machine Learning · Statistics 2023-02-21 Samuel Stocksieker , Denys Pommeret , Arthur Charpentier

Fair for a few: Improving Fairness in Doubly Imbalanced Datasets

Fairness has been identified as an important aspect of Machine Learning and Artificial Intelligence solutions for decision making. Recent literature offers a variety of approaches for debiasing, however many of them fall short when the data…

Machine Learning · Computer Science 2025-06-18 Ata Yalcin , Asli Umay Ozturk , Yigit Sever , Viktoria Pauw , Stephan Hachinger , Ismail Hakki Toroslu , Pinar Karagoz

A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation

Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning combines multiple models to obtain a robust model and has been prominently used with…

Machine Learning · Computer Science 2023-11-28 Azal Ahmad Khan , Omkar Chaudhari , Rohitash Chandra

DatasetEquity: Are All Samples Created Equal? In The Quest For Equity Within Datasets

Data imbalance is a well-known issue in the field of machine learning, attributable to the cost of data collection, the difficulty of labeling, and the geographical distribution of the data. In computer vision, bias in data distribution…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Shubham Shrivastava , Xianling Zhang , Sushruth Nagesh , Armin Parchami

Deep Imbalanced Attribute Classification using Visual Attention Aggregation

For many computer vision applications, such as image description and human identification, recognizing the visual attributes of humans is an essential yet challenging problem. Its challenges originate from its multi-label nature, the large…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Nikolaos Sarafianos , Xiang Xu , Ioannis A. Kakadiaris

Tackling Multilabel Imbalance through Label Decoupling and Data Resampling Hybridization

The learning from imbalanced data is a deeply studied problem in standard classification and, in recent times, also in multilabel classification. A handful of multilabel resampling methods have been proposed in late years, aiming to balance…

Machine Learning · Computer Science 2018-02-15 Francisco Charte , Antonio J. Rivera , María J. del Jesus , Francisco Herrera

Toward Robustness in Multi-label Classification: A Data Augmentation Strategy against Imbalance and Noise

Multi-label classification poses challenges due to imbalanced and noisy labels in training data. We propose a unified data augmentation method, named BalanceMix, to address these challenges. Our approach includes two samplers for imbalanced…

Machine Learning · Computer Science 2023-12-13 Hwanjun Song , Minseok Kim , Jae-Gil Lee

Credit Card Fraud Detection Using Autoencoder Neural Network

Imbalanced data classification problem has always been a popular topic in the field of machine learning research. In order to balance the samples between majority and minority class. Oversampling algorithm is used to synthesize new minority…

Machine Learning · Computer Science 2019-09-02 Junyi Zou , Jinliang Zhang , Ping Jiang

Dealing with Difficult Minority Labels in Imbalanced Mutilabel Data Sets

Multilabel classification is an emergent data mining task with a broad range of real world applications. Learning from imbalanced multilabel data is being deeply studied latterly, and several resampling methods have been proposed in the…

Machine Learning · Computer Science 2018-02-15 Francisco Charte , Antonio J. Rivera , María J. del Jesus , Francisco Herrera

A Bilevel Optimization Framework for Imbalanced Data Classification

Data rebalancing techniques, including oversampling and undersampling, are a common approach to addressing the challenges of imbalanced data. To tackle unresolved problems related to both oversampling and undersampling, we propose a new…

Machine Learning · Computer Science 2025-07-11 Karen Medlin , Sven Leyffer , Krishnan Raghavan

Weak-supervision for Deep Representation Learning under Class Imbalance

Class imbalance is a pervasive issue among classification models including deep learning, whose capacity to extract task-specific features is affected in imbalanced settings. However, the challenges of handling imbalance among a large…

Machine Learning · Computer Science 2018-10-31 Shin Ando

Imbalanced Data Learning by Minority Class Augmentation using Capsule Adversarial Networks

The fact that image datasets are often imbalanced poses an intense challenge for deep learning techniques. In this paper, we propose a method to restore the balance in imbalanced images, by coalescing two concurrent methods, generative…

Machine Learning · Computer Science 2020-04-09 Pourya Shamsolmoali , Masoumeh Zareapoor , Linlin Shen , Abdul Hamid Sadka , Jie Yang

Synthetic Oversampling of Multi-Label Data based on Local Label Distribution

Class-imbalance is an inherent characteristic of multi-label data which affects the prediction accuracy of most multi-label learning methods. One efficient strategy to deal with this problem is to employ resampling techniques before…

Machine Learning · Computer Science 2021-05-18 Bin Liu , Grigorios Tsoumakas

In real-world applications, as data availability increases, obtaining labeled data for machine learning (ML) projects remains challenging due to the high costs and intensive efforts required for data annotation. Many ML projects,…

Machine Learning · Computer Science 2024-12-24 Ismail Hakki Karaman , Gulser Koksal , Levent Eriskin , Salih Salihoglu

Scalable Data Balancing for Unlabeled Satellite Imagery

Data imbalance is a ubiquitous problem in machine learning. In large scale collected and annotated datasets, data imbalance is either mitigated manually by undersampling frequent classes and oversampling rare classes, or planned for with…

Computer Vision and Pattern Recognition · Computer Science 2021-07-20 Deep Patel , Erin Gao , Anirudh Koul , Siddha Ganju , Meher Anand Kasam

DADA: Deep Adversarial Data Augmentation for Extremely Low Data Regime Classification

Deep learning has revolutionized the performance of classification, but meanwhile demands sufficient labeled data for training. Given insufficient data, while many techniques have been developed to help combat overfitting, the challenge…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Xiaofeng Zhang , Zhangyang Wang , Dong Liu , Qing Ling