Related papers: When does loss-based prioritization fail?

Exponentiated Gradient Reweighting for Robust Training Under Label Noise and Beyond

Many learning tasks in machine learning can be viewed as taking a gradient step towards minimizing the average loss of a batch of examples in each training iteration. When noise is prevalent in the data, this uniform treatment of examples…

Machine Learning · Computer Science 2021-04-06 Negin Majidi , Ehsan Amid , Hossein Talebi , Manfred K. Warmuth

Accelerating Training of Deep Neural Networks with a Standardization Loss

A significant advance in accelerating neural network training has been the development of normalization methods, permitting the training of deep models both faster and with better accuracy. These advances come with practical challenges: for…

Machine Learning · Computer Science 2019-03-05 Jasmine Collins , Johannes Balle , Jonathon Shlens

Perception Prioritized Training of Diffusion Models

Diffusion models learn to restore noisy data, which is corrupted with different levels of noise, by optimizing the weighted sum of the corresponding loss terms, i.e., denoising score matching loss. In this paper, we show that restoring data…

Computer Vision and Pattern Recognition · Computer Science 2022-04-04 Jooyoung Choi , Jungbeom Lee , Chaehun Shin , Sungwon Kim , Hyunwoo Kim , Sungroh Yoon

Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Among attempts at giving a theoretical account of the success of deep neural networks, a recent line of work has identified a so-called lazy training regime in which the network can be well approximated by its linearization around…

Machine Learning · Computer Science 2024-04-19 Thomas George , Guillaume Lajoie , Aristide Baratin

Which Strategies Matter for Noisy Label Classification? Insight into Loss and Uncertainty

Label noise is a critical factor that degrades the generalization performance of deep neural networks, thus leading to severe issues in real-world problems. Existing studies have employed strategies based on either loss or uncertainty to…

Machine Learning · Computer Science 2020-08-17 Wonyoung Shin , Jung-Woo Ha , Shengzhe Li , Yongwoo Cho , Hoyean Song , Sunyoung Kwon

Sample Selection with Uncertainty of Losses for Learning with Noisy Labels

In learning with noisy labels, the sample selection approach is very popular, which regards small-loss data as correctly labeled during training. However, losses are generated on-the-fly based on the model being trained with noisy labels,…

Machine Learning · Computer Science 2021-06-02 Xiaobo Xia , Tongliang Liu , Bo Han , Mingming Gong , Jun Yu , Gang Niu , Masashi Sugiyama

Robust and On-the-fly Dataset Denoising for Image Classification

Memorization in over-parameterized neural networks could severely hurt generalization in the presence of mislabeled examples. However, mislabeled examples are hard to avoid in extremely large datasets collected with weak supervision. We…

Machine Learning · Computer Science 2020-04-10 Jiaming Song , Lunjia Hu , Michael Auli , Yann Dauphin , Tengyu Ma

Training Neural Networks on Data Sources with Unknown Reliability

When data is generated by multiple sources, conventional training methods update models assuming equal reliability for each source and do not consider their individual data quality. However, in many applications, sources have varied levels…

Machine Learning · Computer Science 2025-02-17 Alexander Capstick , Francesca Palermo , Tianyu Cui , Payam Barnaghi

Learning from Long-Tailed Noisy Data with Sample Selection and Balanced Loss

The success of deep learning depends on large-scale and well-curated training data, while data in real-world applications are commonly long-tailed and noisy. Many methods have been proposed to deal with long-tailed data or noisy data, while…

Machine Learning · Computer Science 2023-05-30 Lefan Zhang , Zhang-Hao Tian , Wujun Zhou , Wei Wang

A Noise-Sensitivity-Analysis-Based Test Prioritization Technique for Deep Neural Networks

Deep neural networks (DNNs) have been widely used in the fields such as natural language processing, computer vision and image recognition. But several studies have been shown that deep neural networks can be easily fooled by artificial…

Computer Vision and Pattern Recognition · Computer Science 2019-01-23 Long Zhang , Xuechao Sun , Yong Li , Zhenyu Zhang

Late Stopping: Avoiding Confidently Learning from Mislabeled Examples

Sample selection is a prevalent method in learning with noisy labels, where small-loss data are typically considered as correctly labeled data. However, this method may not effectively identify clean hard examples with large losses, which…

Machine Learning · Computer Science 2023-08-29 Suqin Yuan , Lei Feng , Tongliang Liu

Learning with Noisy Labels through Learnable Weighting and Centroid Similarity

We introduce a novel method for training machine learning models in the presence of noisy labels, which are prevalent in domains such as medical diagnosis and autonomous driving and have the potential to degrade a model's generalization…

Machine Learning · Computer Science 2024-06-26 Farooq Ahmad Wani , Maria Sofia Bucarelli , Fabrizio Silvestri

Towards Adversarial Training with Moderate Performance Improvement for Neural Network Classification

It has been demonstrated that deep neural networks are prone to noisy examples particular adversarial samples during inference process. The gap between robust deep learning systems in real world applications and vulnerable neural networks…

Machine Learning · Computer Science 2018-07-03 Xinhan Di , Pengqian Yu , Meng Tian

Robustness and Reliability When Training With Noisy Labels

Labelling of data for supervised learning can be costly and time-consuming and the risk of incorporating label noise in large data sets is imminent. When training a flexible discriminative model using a strictly proper loss, such noise will…

Machine Learning · Statistics 2022-05-13 Amanda Olmin , Fredrik Lindsten

Learning to Learn from Noisy Labeled Data

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect. There…

Machine Learning · Computer Science 2019-04-15 Junnan Li , Yongkang Wong , Qi Zhao , Mohan Kankanhalli

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

The drastic increase of data quantity often brings the severe decrease of data quality, such as incorrect label annotations, which poses a great challenge for robustly training Deep Neural Networks (DNNs). Existing learning \mbox{methods}…

Machine Learning · Computer Science 2022-03-18 Qizhou Wang , Bo Han , Tongliang Liu , Gang Niu , Jian Yang , Chen Gong

Training With Data Dependent Dynamic Learning Rates

Recently many first and second order variants of SGD have been proposed to facilitate training of Deep Neural Networks (DNNs). A common limitation of these works stem from the fact that they use the same learning rate across all instances…

Machine Learning · Computer Science 2021-05-31 Shreyas Saxena , Nidhi Vyas , Dennis DeCoste

On Emergence of Clean-Priority Learning in Early Stopped Neural Networks

When random label noise is added to a training dataset, the prediction error of a neural network on a label-noise-free test dataset initially improves during early training but eventually deteriorates, following a U-shaped dependence on…

Machine Learning · Computer Science 2023-06-06 Chaoyue Liu , Amirhesam Abedsoltan , Mikhail Belkin

Learning to Reweight Examples for Robust Deep Learning

Deep neural networks have been shown to be very powerful modeling tools for many supervised learning tasks involving complex input patterns. However, they can also easily overfit to training set biases and label noises. In addition to…

Machine Learning · Computer Science 2019-05-07 Mengye Ren , Wenyuan Zeng , Bin Yang , Raquel Urtasun

When Does Re-initialization Work?

Re-initializing a neural network during training has been observed to improve generalization in recent works. Yet it is neither widely adopted in deep learning practice nor is it often used in state-of-the-art training protocols. This…

Machine Learning · Computer Science 2023-04-04 Sheheryar Zaidi , Tudor Berariu , Hyunjik Kim , Jörg Bornschein , Claudia Clopath , Yee Whye Teh , Razvan Pascanu