Related papers: Exploiting Class Learnability in Noisy Data

A Survey on Deep Learning with Noisy Labels: How to train your model when you cannot trust on the annotations?

Noisy Labels are commonly present in data sets automatically collected from the internet, mislabeled by non-specialist annotators, or even specialists in a challenging task, such as in the medical field. Although deep learning models have…

Machine Learning · Computer Science 2020-12-08 Filipe R. Cordeiro , Gustavo Carneiro

Learning From Noisy Singly-labeled Data

Supervised learning depends on annotated examples, which are taken to be the \emph{ground truth}. But these labels often come from noisy crowdsourcing platforms, like Amazon Mechanical Turk. Practitioners typically collect multiple labels…

Machine Learning · Computer Science 2018-05-22 Ashish Khetan , Zachary C. Lipton , Anima Anandkumar

Learning to Learn from Noisy Web Videos

Understanding the simultaneously very diverse and intricately fine-grained set of possible human actions is a critical open problem in computer vision. Manually labeling training videos is feasible for some action classes but doesn't scale…

Computer Vision and Pattern Recognition · Computer Science 2017-06-12 Serena Yeung , Vignesh Ramanathan , Olga Russakovsky , Liyue Shen , Greg Mori , Li Fei-Fei

A Theoretical Analysis of Learning with Noisily Labeled Data

Noisy labels are very common in deep supervised learning. Although many studies tend to improve the robustness of deep training for noisy labels, rare works focus on theoretically explaining the training behaviors of learning with noisily…

Machine Learning · Computer Science 2021-04-12 Yi Xu , Qi Qian , Hao Li , Rong Jin

Learning with Noisy Labels through Learnable Weighting and Centroid Similarity

We introduce a novel method for training machine learning models in the presence of noisy labels, which are prevalent in domains such as medical diagnosis and autonomous driving and have the potential to degrade a model's generalization…

Machine Learning · Computer Science 2024-06-26 Farooq Ahmad Wani , Maria Sofia Bucarelli , Fabrizio Silvestri

Multi-Class Classification from Noisy-Similarity-Labeled Data

A similarity label indicates whether two instances belong to the same class while a class label shows the class of the instance. Without class labels, a multi-class classifier could be learned from similarity-labeled pairwise data by meta…

Machine Learning · Computer Science 2020-02-18 Songhua Wu , Xiaobo Xia , Tongliang Liu , Bo Han , Mingming Gong , Nannan Wang , Haifeng Liu , Gang Niu

Unleashing the Potential of Regularization Strategies in Learning with Noisy Labels

In recent years, research on learning with noisy labels has focused on devising novel algorithms that can achieve robustness to noisy training labels while generalizing to clean data. These algorithms often incorporate sophisticated…

Machine Learning · Computer Science 2023-07-12 Hui Kang , Sheng Liu , Huaxi Huang , Jun Yu , Bo Han , Dadong Wang , Tongliang Liu

Learning from Noisy Labels with Deep Neural Networks: A Survey

Deep learning has achieved remarkable success in numerous domains with help from large amounts of big data. However, the quality of data labels is a concern because of the lack of high-quality labels in many real-world scenarios. As noisy…

Machine Learning · Computer Science 2022-03-11 Hwanjun Song , Minseok Kim , Dongmin Park , Yooju Shin , Jae-Gil Lee

Error-Bounded Correction of Noisy Labels

To collect large scale annotated data, it is inevitable to introduce label noise, i.e., incorrect class labels. To be robust against label noise, many successful methods rely on the noisy classifiers (i.e., models trained on the noisy…

Computer Vision and Pattern Recognition · Computer Science 2020-11-23 Songzhu Zheng , Pengxiang Wu , Aman Goswami , Mayank Goswami , Dimitris Metaxas , Chao Chen

Learning with Neighbor Consistency for Noisy Labels

Recent advances in deep learning have relied on large, labelled datasets to train high-capacity models. However, collecting large datasets in a time- and cost-efficient manner often results in label noise. We present a method for learning…

Computer Vision and Pattern Recognition · Computer Science 2022-07-07 Ahmet Iscen , Jack Valmadre , Anurag Arnab , Cordelia Schmid

On information captured by neural networks: connections with memorization and generalization

Despite the popularity and success of deep learning, there is limited understanding of when, how, and why neural networks generalize to unseen examples. Since learning can be seen as extracting information from data, we formally study…

Machine Learning · Computer Science 2023-06-29 Hrayr Harutyunyan

Deep Self-Learning From Noisy Labels

ConvNets achieve good results when training from clean data, but learning from noisy labels significantly degrades performances and remains challenging. Unlike previous works constrained by many conditions, making them infeasible to real…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Jiangfan Han , Ping Luo , Xiaogang Wang

Learning from Noisy Similar and Dissimilar Data

With the widespread use of machine learning for classification, it becomes increasingly important to be able to use weaker kinds of supervision for tasks in which it is hard to obtain standard labeled data. One such kind of supervision is…

Machine Learning · Computer Science 2020-02-05 Soham Dan , Han Bao , Masashi Sugiyama

Robust Learning Under Label Noise With Iterative Noise-Filtering

We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to…

Machine Learning · Computer Science 2019-06-04 Duc Tam Nguyen , Thi-Phuong-Nhung Ngo , Zhongyu Lou , Michael Klar , Laura Beggel , Thomas Brox

Learning to Learn from Noisy Labeled Data

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect. There…

Machine Learning · Computer Science 2019-04-15 Junnan Li , Yongkang Wong , Qi Zhao , Mohan Kankanhalli

Self-semi-supervised Learning to Learn from NoisyLabeled Data

The remarkable success of today's deep neural networks highly depends on a massive number of correctly labeled data. However, it is rather costly to obtain high-quality human-labeled data, leading to the active research area of training…

Machine Learning · Computer Science 2020-11-04 Jiacheng Wang , Yue Ma , Shuang Gao

Mitigating Memorization in Sample Selection for Learning with Noisy Labels

Because deep learning is vulnerable to noisy labels, sample selection techniques, which train networks with only clean labeled data, have attracted a great attention. However, if the labels are dominantly corrupted by few classes, these…

Machine Learning · Computer Science 2021-07-16 Kyeongbo Kong , Junggi Lee , Youngchul Kwak , Young-Rae Cho , Seong-Eun Kim , Woo-Jin Song

Harmless label noise and informative soft-labels in supervised classification

Manual labelling of training examples is common practice in supervised learning. When the labelling task is of non-trivial difficulty, the supplied labels may not be equal to the ground-truth labels, and label noise is introduced into the…

Machine Learning · Statistics 2021-04-08 Daniel Ahfock , Geoffrey J. McLachlan

Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data

Manually labeled corpora are expensive to create and often not available for low-resource languages or domains. Automatic labeling approaches are an alternative way to obtain labeled data in a quicker and cheaper way. However, these labels…

Machine Learning · Computer Science 2018-07-24 Michael A. Hedderich , Dietrich Klakow

The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Deep neural networks have incredible capacity and expressibility, and can seemingly memorize any training set. This introduces a problem when training in the presence of noisy labels, as the noisy examples cannot be distinguished from clean…

Machine Learning · Computer Science 2022-10-04 Daniel Shwartz , Uri Stern , Daphna Weinshall