Related papers: Binary Classification with Instance and Label Depe…

Robust Learning Under Label Noise With Iterative Noise-Filtering

We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to…

Machine Learning · Computer Science 2019-06-04 Duc Tam Nguyen , Thi-Phuong-Nhung Ngo , Zhongyu Lou , Michael Klar , Laura Beggel , Thomas Brox

Classification with unknown class-conditional label noise on non-compact feature spaces

We investigate the problem of classification in the presence of unknown class-conditional label noise in which the labels observed by the learner have been corrupted with some unknown class dependent probability. In order to obtain finite…

Machine Learning · Statistics 2019-06-11 Henry W J Reeve , Ata Kaban

A Second-Order Approach to Learning with Instance-Dependent Label Noise

The presence of label noise often misleads the training of deep neural networks. Departing from the recent literature which largely assumes the label noise rate is only determined by the true label class, the errors in human-annotated…

Machine Learning · Computer Science 2021-03-31 Zhaowei Zhu , Tongliang Liu , Yang Liu

The Exploration of Error Bounds in Classification with Noisy Labels

Numerous studies have shown that label noise can lead to poor generalization performance, negatively affecting classification accuracy. Therefore, understanding the effectiveness of classifiers trained using deep neural networks in the…

Machine Learning · Computer Science 2026-03-10 Haixia Liu , Boxiao Li , Can Yang , Yang Wang

Learning from Binary Labels with Instance-Dependent Corruption

Suppose we have a sample of instances paired with binary labels corrupted by arbitrary instance- and label-dependent noise. With sufficiently many such samples, can we optimally classify and rank instances with respect to the noise-free…

Machine Learning · Computer Science 2016-05-05 Aditya Krishna Menon , Brendan van Rooyen , Nagarajan Natarajan

Leveraging an Alignment Set in Tackling Instance-Dependent Label Noise

Noisy training labels can hurt model performance. Most approaches that aim to address label noise assume label noise is independent from the input features. In practice, however, label noise is often feature or \textit{instance-dependent},…

Machine Learning · Computer Science 2023-07-12 Donna Tjandra , Jenna Wiens

Towards Robustness to Label Noise in Text Classification via Noise Modeling

Large datasets in NLP suffer from noisy labels, due to erroneous automatic and human annotation procedures. We study the problem of text classification with label noise, and aim to capture this noise through an auxiliary noise model over…

Computation and Language · Computer Science 2022-06-22 Siddhant Garg , Goutham Ramakrishnan , Varun Thumbe

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

The drastic increase of data quantity often brings the severe decrease of data quality, such as incorrect label annotations, which poses a great challenge for robustly training Deep Neural Networks (DNNs). Existing learning \mbox{methods}…

Machine Learning · Computer Science 2022-03-18 Qizhou Wang , Bo Han , Tongliang Liu , Gang Niu , Jian Yang , Chen Gong

Classification with Noisy Labels by Importance Reweighting

In this paper, we study a classification problem in which sample labels are randomly corrupted. In this scenario, there is an unobservable sample with noise-free labels. However, before being observed, the true labels are independently…

Machine Learning · Statistics 2015-07-21 Tongliang Liu , Dacheng Tao

An Instance-Dependent Simulation Framework for Learning with Label Noise

We propose a simulation framework for generating instance-dependent noisy labels via a pseudo-labeling paradigm. We show that the distribution of the synthetic noisy labels generated with our framework is closer to human labels compared to…

Machine Learning · Computer Science 2021-10-19 Keren Gu , Xander Masotto , Vandana Bachani , Balaji Lakshminarayanan , Jack Nikodem , Dong Yin

Risk bounds for PU learning under Selected At Random assumption

Positive-unlabeled learning (PU learning) is known as a special case of semi-supervised binary classification where only a fraction of positive examples are labeled. The challenge is then to find the correct classifier despite this lack of…

Statistics Theory · Mathematics 2022-01-19 Olivier Coudray , Christine Keribin , Pascal Massart , Patrick Pamphile

Robustness and Reliability When Training With Noisy Labels

Labelling of data for supervised learning can be costly and time-consuming and the risk of incorporating label noise in large data sets is imminent. When training a flexible discriminative model using a strictly proper loss, such noise will…

Machine Learning · Statistics 2022-05-13 Amanda Olmin , Fredrik Lindsten

Rethinking the Value of Labels for Instance-Dependent Label Noise Learning

Label noise widely exists in large-scale datasets and significantly degenerates the performances of deep learning algorithms. Due to the non-identifiability of the instance-dependent noise transition matrix, most existing algorithms address…

Machine Learning · Computer Science 2023-05-16 Hanwen Deng , Weijia Zhang , Min-Ling Zhang

Classification with Asymmetric Label Noise: Consistency and Maximal Denoising

In many real-world classification problems, the labels of training examples are randomly corrupted. Most previous theoretical work on classification with label noise assumes that the two classes are separable, that the label noise is…

Machine Learning · Statistics 2016-08-08 Gilles Blanchard , Marek Flaska , Gregory Handy , Sara Pozzi , Clayton Scott

Noise Tolerance under Risk Minimization

In this paper we explore noise tolerant learning of classifiers. We formulate the problem as follows. We assume that there is an ${\bf unobservable}$ training set which is noise-free. The actual training set given to the learning algorithm…

Machine Learning · Computer Science 2013-11-27 Naresh Manwani , P. S. Sastry

Noise tolerance of learning to rank under class-conditional label noise

Often, the data used to train ranking models is subject to label noise. For example, in web-search, labels created from clickstream data are noisy due to issues such as insufficient information in item descriptions on the SERP, query…

Information Retrieval · Computer Science 2022-08-18 Dany Haddad

Harmless label noise and informative soft-labels in supervised classification

Manual labelling of training examples is common practice in supervised learning. When the labelling task is of non-trivial difficulty, the supplied labels may not be equal to the ground-truth labels, and label noise is introduced into the…

Machine Learning · Statistics 2021-04-08 Daniel Ahfock , Geoffrey J. McLachlan

Limited Gradient Descent: Learning With Noisy Labels

Label noise may affect the generalization of classifiers, and the effective learning of main patterns from samples with noisy labels is an important challenge. Recent studies have shown that deep neural networks tend to prioritize the…

Machine Learning · Computer Science 2019-12-06 Yi Sun , Yan Tian , Yiping Xu , Jianxiang Li

Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

Deep models trained with noisy labels are prone to over-fitting and struggle in generalization. Most existing solutions are based on an ideal assumption that the label noise is class-conditional, i.e., instances of the same class share the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-01 Ganlong Zhao , Guanbin Li , Yipeng Qin , Feng Liu , Yizhou Yu

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Most studies on learning from noisy labels rely on unrealistic models of i.i.d. label noise, such as class-conditional transition matrices. More recent work on instance-dependent noise models are more realistic, but assume a single…

Machine Learning · Computer Science 2021-06-10 Glenn Dawson , Robi Polikar