Related papers: Generalization by Recognizing Confusion

Self-Adaptive Training: Bridging Supervised and Self-Supervised Learning

We propose self-adaptive training -- a unified training algorithm that dynamically calibrates and enhances training processes by model predictions without incurring an extra computational cost -- to advance both supervised and…

Machine Learning · Computer Science 2022-10-17 Lang Huang , Chao Zhang , Hongyang Zhang

Synergistic Network Learning and Label Correction for Noise-robust Image Classification

Large training datasets almost always contain examples with inaccurate or incorrect labels. Deep Neural Networks (DNNs) tend to overfit training label noise, resulting in poorer model performance in practice. To address this problem, we…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Chen Gong , Kong Bin , Eric J. Seibel , Xin Wang , Youbing Yin , Qi Song

Self-Adaptive Training: beyond Empirical Risk Minimization

We propose self-adaptive training---a new training algorithm that dynamically corrects problematic training labels by model predictions without incurring extra computational cost---to improve generalization of deep learning for potentially…

Machine Learning · Computer Science 2020-10-01 Lang Huang , Chao Zhang , Hongyang Zhang

Learning with Neighbor Consistency for Noisy Labels

Recent advances in deep learning have relied on large, labelled datasets to train high-capacity models. However, collecting large datasets in a time- and cost-efficient manner often results in label noise. We present a method for learning…

Computer Vision and Pattern Recognition · Computer Science 2022-07-07 Ahmet Iscen , Jack Valmadre , Anurag Arnab , Cordelia Schmid

Understanding deep learning requires rethinking generalization

Despite their massive size, successful deep artificial neural networks can exhibit a remarkably small difference between training and test performance. Conventional wisdom attributes small generalization error either to properties of the…

Machine Learning · Computer Science 2017-02-28 Chiyuan Zhang , Samy Bengio , Moritz Hardt , Benjamin Recht , Oriol Vinyals

Foster Adaptivity and Balance in Learning with Noisy Labels

Label noise is ubiquitous in real-world scenarios, posing a practical challenge to supervised models due to its effect in hurting the generalization performance of deep neural networks. Existing methods primarily employ the sample selection…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Mengmeng Sheng , Zeren Sun , Tao Chen , Shuchao Pang , Yucheng Wang , Yazhou Yao

Exploiting Class Learnability in Noisy Data

In many domains, collecting sufficient labeled training data for supervised machine learning requires easily accessible but noisy sources, such as crowdsourcing services or tagged Web data. Noisy labels occur frequently in data sets…

Machine Learning · Computer Science 2018-11-16 Matthew Klawonn , Eric Heim , James Hendler

On information captured by neural networks: connections with memorization and generalization

Despite the popularity and success of deep learning, there is limited understanding of when, how, and why neural networks generalize to unseen examples. Since learning can be seen as extracting information from data, we formally study…

Machine Learning · Computer Science 2023-06-29 Hrayr Harutyunyan

Mitigating Label Noise through Data Ambiguation

Label noise poses an important challenge in machine learning, especially in deep learning, in which large models with high expressive power dominate the field. Models of that kind are prone to memorizing incorrect labels, thereby harming…

Machine Learning · Computer Science 2024-01-26 Julian Lienen , Eyke Hüllermeier

Making Deep Neural Networks Robust to Label Noise: a Loss Correction Approach

We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and…

Machine Learning · Statistics 2017-03-23 Giorgio Patrini , Alessandro Rozza , Aditya Menon , Richard Nock , Lizhen Qu

Adaptive Regularization of Labels

Recently, a variety of regularization techniques have been widely applied in deep neural networks, such as dropout, batch normalization, data augmentation, and so on. These methods mainly focus on the regularization of weight parameters to…

Machine Learning · Computer Science 2019-08-16 Qianggang Ding , Sifan Wu , Hao Sun , Jiadong Guo , Shu-Tao Xia

How does Disagreement Help Generalization against Label Corruption?

Learning with noisy labels is one of the hottest problems in weakly-supervised learning. Based on memorization effects of deep neural networks, training on small-loss instances becomes very promising for handling noisy labels. This fosters…

Machine Learning · Computer Science 2019-05-14 Xingrui Yu , Bo Han , Jiangchao Yao , Gang Niu , Ivor W. Tsang , Masashi Sugiyama

Learning Deep Networks from Noisy Labels with Dropout Regularization

Large datasets often have unreliable labels-such as those obtained from Amazon's Mechanical Turk or social media platforms-and classifiers trained on mislabeled datasets often exhibit poor performance. We present a simple, effective…

Computer Vision and Pattern Recognition · Computer Science 2017-05-10 Ishan Jindal , Matthew Nokleby , Xuewen Chen

Robust Training under Label Noise by Over-parameterization

Recently, over-parameterized deep networks, with increasingly more network parameters than training samples, have dominated the performances of modern machine learning. However, when the training data is corrupted, it has been well-known…

Machine Learning · Computer Science 2022-08-04 Sheng Liu , Zhihui Zhu , Qing Qu , Chong You

Robust Feature Learning Against Noisy Labels

Supervised learning of deep neural networks heavily relies on large-scale datasets annotated by high-quality labels. In contrast, mislabeled samples can significantly degrade the generalization of models and result in memorizing samples,…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Tsung-Ming Tai , Yun-Jie Jhang , Wen-Jyi Hwang

Consistency Regularization Can Improve Robustness to Label Noise

Consistency regularization is a commonly-used technique for semi-supervised and self-supervised learning. It is an auxiliary objective function that encourages the prediction of the network to be similar in the vicinity of the observed…

Machine Learning · Computer Science 2021-10-05 Erik Englesson , Hossein Azizpour

A Simple yet Effective Baseline for Robust Deep Learning with Noisy Labels

Recently deep neural networks have shown their capacity to memorize training data, even with noisy labels, which hurts generalization performance. To mitigate this issue, we provide a simple but effective baseline method that is robust to…

Machine Learning · Computer Science 2019-09-30 Yucen Luo , Jun Zhu , Tomas Pfister

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Over-parameterized deep neural networks trained by simple first-order methods are known to be able to fit any labeling of data. Such over-fitting ability hinders generalization when mislabeled training examples are present. On the other…

Machine Learning · Computer Science 2020-10-06 Wei Hu , Zhiyuan Li , Dingli Yu

Image Classification with Deep Learning in the Presence of Noisy Labels: A Survey

Image classification systems recently made a giant leap with the advancement of deep neural networks. However, these systems require an excessive amount of labeled data to be adequately trained. Gathering a correctly annotated dataset is…

Machine Learning · Computer Science 2021-01-19 Görkem Algan , Ilkay Ulusoy

Decoding Generalization from Memorization in Deep Neural Networks

Overparameterized deep networks that generalize well have been key to the dramatic success of deep learning in recent years. The reasons for their remarkable ability to generalize are not well understood yet. When class labels in the…

Machine Learning · Computer Science 2026-02-03 Simran Ketha , Venkatakrishnan Ramaswamy