Related papers: Interactive Label Cleaning with Example-based Expl…

Label Noise-Robust Learning using a Confidence-Based Sieving Strategy

In learning tasks with label noise, improving model robustness against overfitting is a pivotal challenge because the model eventually memorizes labels, including the noisy ones. Identifying the samples with noisy labels and preventing the…

Machine Learning · Computer Science 2023-09-28 Reihaneh Torkzadehmahani , Reza Nasirigerdeh , Daniel Rueckert , Georgios Kaissis

Active label cleaning for improved dataset quality under resource constraints

Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have an often-overlooked confounding effect on the assessment of model performance. Nevertheless, employing experts to…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Melanie Bernhardt , Daniel C. Castro , Ryutaro Tanno , Anton Schwaighofer , Kerem C. Tezcan , Miguel Monteiro , Shruthi Bannur , Matthew Lungren , Aditya Nori , Ben Glocker , Javier Alvarez-Valle , Ozan Oktay

Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

Deep models trained with noisy labels are prone to over-fitting and struggle in generalization. Most existing solutions are based on an ideal assumption that the label noise is class-conditional, i.e., instances of the same class share the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-01 Ganlong Zhao , Guanbin Li , Yipeng Qin , Feng Liu , Yizhou Yu

Learning to Detect Noisy Labels Using Model-Based Features

Label noise is ubiquitous in various machine learning scenarios such as self-labeling with model predictions and erroneous data annotation. Many existing approaches are based on heuristics such as sample losses, which might not be flexible…

Machine Learning · Computer Science 2022-12-29 Zhihao Wang , Zongyu Lin , Peiqi Liu , Guidong ZHeng , Junjie Wen , Xianxin Chen , Yujun Chen , Zhilin Yang

Resolving label uncertainty with implicit posterior models

We propose a method for jointly inferring labels across a collection of data samples, where each sample consists of an observation and a prior belief about the label. By implicitly assuming the existence of a generative model for which a…

Machine Learning · Computer Science 2022-06-22 Esther Rolf , Nikolay Malkin , Alexandros Graikos , Ana Jojic , Caleb Robinson , Nebojsa Jojic

Mitigating Instance-Dependent Label Noise: Integrating Self-Supervised Pretraining with Pseudo-Label Refinement

Deep learning models rely heavily on large volumes of labeled data to achieve high performance. However, real-world datasets often contain noisy labels due to human error, ambiguity, or resource constraints during the annotation process.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Gouranga Bala , Anuj Gupta , Subrat Kumar Behera , Amit Sethi

A Mathematical Programming approach to Binary Supervised Classification with Label Noise

In this paper we propose novel methodologies to construct Support Vector Machine -based classifiers that takes into account that label noises occur in the training sample. We propose different alternatives based on solving Mixed Integer…

Machine Learning · Computer Science 2020-04-22 Víctor Blanco , Alberto Japón , Justo Puerto

CHEF: A Cheap and Fast Pipeline for Iteratively Cleaning Label Uncertainties (Technical Report)

High-quality labels are expensive to obtain for many machine learning tasks, such as medical image classification tasks. Therefore, probabilistic (weak) labels produced by weak supervision tools are used to seed a process in which…

Databases · Computer Science 2021-07-27 Yinjun Wu , James Weimer , Susan B. Davidson

Exploiting Context for Robustness to Label Noise in Active Learning

Several works in computer vision have demonstrated the effectiveness of active learning for adapting the recognition model when new unlabeled data becomes available. Most of these works consider that labels obtained from the annotator are…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Sudipta Paul , Shivkumar Chandrasekaran , B. S. Manjunath , Amit K. Roy-Chowdhury

Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection

Label noise, commonly found in real-world datasets, has a detrimental impact on a model's generalization. To effectively detect incorrectly labeled instances, previous works have mostly relied on distinguishable training signals, such as…

Machine Learning · Computer Science 2024-05-31 Suyeon Kim , Dongha Lee , SeongKu Kang , Sukang Chae , Sanghwan Jang , Hwanjo Yu

Class Prototype-based Cleaner for Label Noise Learning

Semi-supervised learning based methods are current SOTA solutions to the noisy-label learning problem, which rely on learning an unsupervised label cleaner first to divide the training samples into a labeled set for clean data and an…

Computer Vision and Pattern Recognition · Computer Science 2022-12-22 Jingjia Huang , Yuanqi Chen , Jiashi Feng , Xinglong Wu

Instance-dependent Label-noise Learning under a Structural Causal Model

Label noise will degenerate the performance of deep learning algorithms because deep neural networks easily overfit label errors. Let X and Y denote the instance and clean label, respectively. When Y is a cause of X, according to which many…

Machine Learning · Statistics 2022-06-06 Yu Yao , Tongliang Liu , Mingming Gong , Bo Han , Gang Niu , Kun Zhang

Towards the Identifiability in Noisy Label Learning: A Multinomial Mixture Modelling Approach

Learning from noisy labels (LNL) is crucial in deep learning, in which one of the approaches is to identify clean-label samples from poorly-annotated datasets. Such an identification is challenging because the conventional LNL problem,…

Machine Learning · Computer Science 2025-09-26 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro

Label Denoising through Cross-Model Agreement

Learning from corrupted labels is very common in real-world machine-learning applications. Memorizing such noisy labels could affect the learning of the model, leading to sub-optimal performances. In this work, we propose a novel framework…

Machine Learning · Computer Science 2023-12-20 Yu Wang , Xin Xin , Zaiqiao Meng , Joemon Jose , Fuli Feng

Label Noise Cleaning for Supervised Classification via Bernoulli Random Sampling

Label noise - incorrect labels assigned to observations - can substantially degrade the performance of supervised classifiers. This paper proposes a label noise cleaning method based on Bernoulli random sampling. We show that the mean label…

Methodology · Statistics 2026-03-17 Yuxin Liu , Xiong Jin , Yang Han

Reliable Label Correction is a Good Booster When Learning with Extremely Noisy Labels

Learning with noisy labels has aroused much research interest since data annotations, especially for large-scale datasets, may be inevitably imperfect. Recent approaches resort to a semi-supervised learning problem by dividing training…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Kai Wang , Xiangyu Peng , Shuo Yang , Jianfei Yang , Zheng Zhu , Xinchao Wang , Yang You

Label Confusion Learning to Enhance Text Classification Models

Representing a true label as a one-hot vector is a common practice in training text classification models. However, the one-hot representation may not adequately reflect the relation between the instances and labels, as labels are often not…

Computation and Language · Computer Science 2020-12-10 Biyang Guo , Songqiao Han , Xiao Han , Hailiang Huang , Ting Lu

Unlocking the Power of Open Set : A New Perspective for Open-Set Noisy Label Learning

Learning from noisy data has attracted much attention, where most methods focus on closed-set label noise. However, a more common scenario in the real world is the presence of both open-set and closed-set noise. Existing methods typically…

Machine Learning · Computer Science 2024-02-26 Wenhai Wan , Xinrui Wang , Ming-Kun Xie , Shao-Yuan Li , Sheng-Jun Huang , Songcan Chen

Rethinking Consistent Multi-Label Classification Under Inexact Supervision

Partial multi-label learning and complementary multi-label learning are two popular weakly supervised multi-label classification paradigms that aim to alleviate the high annotation costs of collecting precisely annotated multi-label data.…

Machine Learning · Computer Science 2026-02-26 Wei Wang , Tianhao Ma , Ming-Kun Xie , Gang Niu , Masashi Sugiyama

CLIPCleaner: Cleaning Noisy Labels with CLIP

Learning with Noisy labels (LNL) poses a significant challenge for the Machine Learning community. Some of the most widely used approaches that select as clean samples for which the model itself (the in-training model) has high confidence,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Chen Feng , Georgios Tzimiropoulos , Ioannis Patras