Related papers: Multi Instance Learning For Unbalanced Data

Bridging the Gap: Simultaneous Fine Tuning for Data Re-Balancing

There are many real-world classification problems wherein the issue of data imbalance (the case when a data set contains substantially more samples for one/many classes than the rest) is unavoidable. While under-sampling the problematic…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 John McKay , Isaac Gerg , Vishal Monga

Review of Methods for Handling Class-Imbalanced in Classification Problems

Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more…

Machine Learning · Computer Science 2022-11-11 Satyendra Singh Rawat , Amit Kumar Mishra

Self-paced Ensemble for Highly Imbalanced Massive Data Classification

Many real-world applications reveal difficulties in learning classifiers from imbalanced data. The rising big data era has been witnessing more classification tasks with large-scale but extremely imbalance and low-quality datasets. Most of…

Machine Learning · Computer Science 2020-10-20 Zhining Liu , Wei Cao , Zhifeng Gao , Jiang Bian , Hechang Chen , Yi Chang , Tie-Yan Liu

Multi-Instance Learning by Treating Instances As Non-I.I.D. Samples

Multi-instance learning attempts to learn from a training set consisting of labeled bags each containing many unlabeled instances. Previous studies typically treat the instances in the bags as independently and identically distributed.…

Machine Learning · Computer Science 2009-05-13 Zhi-Hua Zhou , Yu-Yin Sun , Yu-Feng Li

SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology

Multiple Instance learning (MIL) models have been extensively used in pathology to predict biomarkers and risk-stratify patients from gigapixel-sized images. Machine learning problems in medical imaging often deal with rare diseases, making…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Dinkar Juyal , Siddhant Shingi , Syed Ashar Javed , Harshith Padigela , Chintan Shah , Anand Sampat , Archit Khosla , John Abel , Amaro Taylor-Weiner

Classifying extremely imbalanced data sets

Imbalanced data sets containing much more background than signal instances are very common in particle physics, and will also be characteristic for the upcoming analyses of LHC data. Following up the work presented at ACAT 2008, we use the…

Data Analysis, Statistics and Probability · Physics 2011-08-11 Markward Britsch , Nikolai Gagunashvili , Michael Schmelling

A multi-instance learning algorithm based on a stacked ensemble of lazy learners

This document describes a novel learning algorithm that classifies "bags" of instances rather than individual instances. A bag is labeled positive if it contains at least one positive instance (which may or may not be specifically…

Machine Learning · Computer Science 2014-07-11 Ramasubramanian Sundararajan , Hima Patel , Manisha Srivastava

A Comparative Study of Calibration Methods for Imbalanced Class Incremental Learning

Deep learning approaches are successful in a wide range of AI problems and in particular for visual recognition tasks. However, there are still open problems among which is the capacity to handle streams of visual information and the…

Machine Learning · Computer Science 2022-02-02 Umang Aggarwal , Adrian Popescu , Eden Belouadah , Céline Hudelot

Imbalanced Classification via Explicit Gradient Learning From Augmented Data

Learning from imbalanced data is one of the most significant challenges in real-world classification tasks. In such cases, neural networks performance is substantially impaired due to preference towards the majority class. Existing…

Machine Learning · Computer Science 2022-11-13 Bronislav Yasinnik , Moshe Salhov , Ofir Lindenbaum , Amir Averbuch

Positive and Unlabeled Learning through Negative Selection and Imbalance-aware Classification

Motivated by applications in protein function prediction, we consider a challenging supervised classification setting in which positive labels are scarce and there are no explicit negative labels. The learning algorithm must thus select…

Machine Learning · Computer Science 2019-01-28 Marco Frasca , Nicolò Cesa-Bianchi

A comparison of Deep Learning performances with other machine learning algorithms on credit scoring unbalanced data

Training models on highly unbalanced data is admitted to be a challenging task for machine learning algorithms. Current studies on deep learning mainly focus on data sets with balanced class labels or unbalanced data, but with massive…

Machine Learning · Computer Science 2020-02-27 Louis Marceau , Lingling Qiu , Nick Vandewiele , Eric Charton

Deep Learning Meets Oversampling: A Learning Framework to Handle Imbalanced Classification

Despite extensive research spanning several decades, class imbalance is still considered a profound difficulty for both machine learning and deep learning models. While data oversampling is the foremost technique to address this issue,…

Machine Learning · Computer Science 2025-02-12 Sukumar Kishanthan , Asela Hevapathige

Box Drawings for Learning with Imbalanced Data

The vast majority of real world classification problems are imbalanced, meaning there are far fewer data from the class of interest (the positive class) than from other classes. We propose two machine learning algorithms to handle highly…

Machine Learning · Statistics 2014-06-10 Siong Thye Goh , Cynthia Rudin

Learning From Long-Tailed Data With Noisy Labels

Class imbalance and noisy labels are the norm rather than the exception in many large-scale classification datasets. Nevertheless, most works in machine learning typically assume balanced and clean data. There have been some recent attempts…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Shyamgopal Karthik , Jérome Revaud , Boris Chidlovskii

An Empirical Analysis of the Efficacy of Different Sampling Techniques for Imbalanced Classification

Learning from imbalanced data is a challenging task. Standard classification algorithms tend to perform poorly when trained on imbalanced data. Some special strategies need to be adopted, either by modifying the data distribution or by…

Machine Learning · Computer Science 2022-08-26 Asif Newaz , Shahriar Hassan , Farhan Shahriyar Haq

Class-Imbalanced Semi-Supervised Learning

Semi-Supervised Learning (SSL) has achieved great success in overcoming the difficulties of labeling and making full use of unlabeled data. However, SSL has a limited assumption that the numbers of samples in different classes are balanced,…

Machine Learning · Computer Science 2020-02-18 Minsung Hyun , Jisoo Jeong , Nojun Kwak

Cross-Patient Pseudo Bags Generation and Curriculum Contrastive Learning for Imbalanced Multiclassification of Whole Slide Image

Pathology computing has dramatically improved pathologists' workflow and diagnostic decision-making processes. Although computer-aided diagnostic systems have shown considerable value in whole slide image (WSI) analysis, the problem of…

Computer Vision and Pattern Recognition · Computer Science 2024-11-19 Yonghuang Wu , Xuan Xie , Xinyuan Niu , Chengqian Zhao , Jinhua Yu

Evaluation of Multi- and Single-objective Learning Algorithms for Imbalanced Data

Many machine learning tasks aim to find models that work well not for a single, but for a group of criteria, often opposing ones. One such example is imbalanced data classification, where, on the one hand, we want to achieve the best…

Machine Learning · Computer Science 2025-11-18 Szymon Wojciechowski , Michał Woźniak

Deep Reinforcement Learning for Multi-class Imbalanced Training

With the rapid growth of memory and computing power, datasets are becoming increasingly complex and imbalanced. This is especially severe in the context of clinical data, where there may be one rare event for many cases in the majority…

Machine Learning · Computer Science 2022-05-25 Jenny Yang , Rasheed El-Bouri , Odhran O'Donoghue , Alexander S. Lachapelle , Andrew A. S. Soltan , David A. Clifton

Cost Sensitive Learning of Deep Feature Representations from Imbalanced Data

Class imbalance is a common problem in the case of real-world object detection and classification tasks. Data of some classes is abundant making them an over-represented majority, and data of other classes is scarce, making them an…

Computer Vision and Pattern Recognition · Computer Science 2017-03-24 Salman H. Khan , Munawar Hayat , Mohammed Bennamoun , Ferdous Sohel , Roberto Togneri