English
Related papers

Related papers: Data Augmentation Imbalance For Imbalanced Attribu…

200 papers

Rooting in the scarcity of most attributes, realistic pedestrian attribute datasets exhibit unduly skewed data distribution, from which two types of model failures are delivered: (1) label imbalance: model predictions lean greatly towards…

Computer Vision and Pattern Recognition · Computer Science 2024-05-09 Yibo Zhou , Hai-Miao Hu , Yirong Xiang , Xiaokang Zhang , Haotian Wu

Data augmentation forms the cornerstone of many modern machine learning training pipelines; yet, the mechanisms by which it works are not clearly understood. Much of the research on data augmentation (DA) has focused on improving existing…

Machine Learning · Computer Science 2023-04-13 Damien A. Dablain , Nitesh V. Chawla

Deep learning (DL) techniques are highly effective for defect detection from images. Training DL classification models, however, requires vast amounts of labeled data which is often expensive to collect. In many cases, not only the…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Adrian Shuai Li , Elisa Bertino , Rih-Teng Wu , Ting-Yan Wu

Learning from imbalanced data is one of the most significant challenges in real-world classification tasks. In such cases, neural networks performance is substantially impaired due to preference towards the majority class. Existing…

Machine Learning · Computer Science 2022-11-13 Bronislav Yasinnik , Moshe Salhov , Ofir Lindenbaum , Amir Averbuch

In this work, we consider the problem of imbalanced data in a regression framework when the imbalanced phenomenon concerns continuous or discrete covariates. Such a situation can lead to biases in the estimates. In this case, we propose a…

Machine Learning · Statistics 2023-02-21 Samuel Stocksieker , Denys Pommeret , Arthur Charpentier

Fairness has been identified as an important aspect of Machine Learning and Artificial Intelligence solutions for decision making. Recent literature offers a variety of approaches for debiasing, however many of them fall short when the data…

Machine Learning · Computer Science 2025-06-18 Ata Yalcin , Asli Umay Ozturk , Yigit Sever , Viktoria Pauw , Stephan Hachinger , Ismail Hakki Toroslu , Pinar Karagoz

Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning combines multiple models to obtain a robust model and has been prominently used with…

Machine Learning · Computer Science 2023-11-28 Azal Ahmad Khan , Omkar Chaudhari , Rohitash Chandra

Data imbalance is a well-known issue in the field of machine learning, attributable to the cost of data collection, the difficulty of labeling, and the geographical distribution of the data. In computer vision, bias in data distribution…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Shubham Shrivastava , Xianling Zhang , Sushruth Nagesh , Armin Parchami

For many computer vision applications, such as image description and human identification, recognizing the visual attributes of humans is an essential yet challenging problem. Its challenges originate from its multi-label nature, the large…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Nikolaos Sarafianos , Xiang Xu , Ioannis A. Kakadiaris

The learning from imbalanced data is a deeply studied problem in standard classification and, in recent times, also in multilabel classification. A handful of multilabel resampling methods have been proposed in late years, aiming to balance…

Machine Learning · Computer Science 2018-02-15 Francisco Charte , Antonio J. Rivera , María J. del Jesus , Francisco Herrera

Multi-label classification poses challenges due to imbalanced and noisy labels in training data. We propose a unified data augmentation method, named BalanceMix, to address these challenges. Our approach includes two samplers for imbalanced…

Machine Learning · Computer Science 2023-12-13 Hwanjun Song , Minseok Kim , Jae-Gil Lee

Imbalanced data classification problem has always been a popular topic in the field of machine learning research. In order to balance the samples between majority and minority class. Oversampling algorithm is used to synthesize new minority…

Machine Learning · Computer Science 2019-09-02 Junyi Zou , Jinliang Zhang , Ping Jiang

Multilabel classification is an emergent data mining task with a broad range of real world applications. Learning from imbalanced multilabel data is being deeply studied latterly, and several resampling methods have been proposed in the…

Machine Learning · Computer Science 2018-02-15 Francisco Charte , Antonio J. Rivera , María J. del Jesus , Francisco Herrera

Data rebalancing techniques, including oversampling and undersampling, are a common approach to addressing the challenges of imbalanced data. To tackle unresolved problems related to both oversampling and undersampling, we propose a new…

Machine Learning · Computer Science 2025-07-11 Karen Medlin , Sven Leyffer , Krishnan Raghavan

Class imbalance is a pervasive issue among classification models including deep learning, whose capacity to extract task-specific features is affected in imbalanced settings. However, the challenges of handling imbalance among a large…

Machine Learning · Computer Science 2018-10-31 Shin Ando

The fact that image datasets are often imbalanced poses an intense challenge for deep learning techniques. In this paper, we propose a method to restore the balance in imbalanced images, by coalescing two concurrent methods, generative…

Machine Learning · Computer Science 2020-04-09 Pourya Shamsolmoali , Masoumeh Zareapoor , Linlin Shen , Abdul Hamid Sadka , Jie Yang

Class-imbalance is an inherent characteristic of multi-label data which affects the prediction accuracy of most multi-label learning methods. One efficient strategy to deal with this problem is to employ resampling techniques before…

Machine Learning · Computer Science 2021-05-18 Bin Liu , Grigorios Tsoumakas

In real-world applications, as data availability increases, obtaining labeled data for machine learning (ML) projects remains challenging due to the high costs and intensive efforts required for data annotation. Many ML projects,…

Machine Learning · Computer Science 2024-12-24 Ismail Hakki Karaman , Gulser Koksal , Levent Eriskin , Salih Salihoglu

Data imbalance is a ubiquitous problem in machine learning. In large scale collected and annotated datasets, data imbalance is either mitigated manually by undersampling frequent classes and oversampling rare classes, or planned for with…

Computer Vision and Pattern Recognition · Computer Science 2021-07-20 Deep Patel , Erin Gao , Anirudh Koul , Siddha Ganju , Meher Anand Kasam

Deep learning has revolutionized the performance of classification, but meanwhile demands sufficient labeled data for training. Given insufficient data, while many techniques have been developed to help combat overfitting, the challenge…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Xiaofeng Zhang , Zhangyang Wang , Dong Liu , Qing Ling
‹ Prev 1 2 3 10 Next ›