Related papers: Regularization via Structural Label Smoothing

Label Smoothing Improves Neural Source Code Summarization

Label smoothing is a regularization technique for neural networks. Normally neural models are trained to an output distribution that is a vector with a single 1 for the correct prediction, and 0 for all other elements. Label smoothing…

Software Engineering · Computer Science 2023-03-29 Sakib Haque , Aakash Bansal , Collin McMillan

Regularization via Adaptive Pairwise Label Smoothing

Label Smoothing (LS) is an effective regularizer to improve the generalization of state-of-the-art deep models. For each training sample the LS strategy smooths the one-hot encoded training signal by distributing its distribution mass over…

Machine Learning · Computer Science 2020-12-04 Hongyu Guo

Label Smoothing++: Enhanced Label Regularization for Training Neural Networks

Training neural networks with one-hot target labels often results in overconfidence and overfitting. Label smoothing addresses this issue by perturbing the one-hot target labels by adding a uniform probability vector to create a regularized…

Computer Vision and Pattern Recognition · Computer Science 2025-09-09 Sachin Chhabra , Hemanth Venkateswara , Baoxin Li

Adaptive Label Smoothing with Self-Knowledge in Natural Language Generation

Overconfidence has been shown to impair generalization and calibration of a neural network. Previous studies remedy this issue by adding a regularization term to a loss function, preventing a model from making a peaked distribution. Label…

Machine Learning · Computer Science 2022-10-26 Dongkyu Lee , Ka Chun Cheung , Nevin L. Zhang

Midpoint Regularization: from High Uncertainty Training to Conservative Classification

Label Smoothing (LS) improves model generalization through penalizing models from generating overconfident output distributions. For each training sample the LS strategy smooths the one-hot encoded training signal by distributing its…

Machine Learning · Computer Science 2021-06-29 Hongyu Guo

To Smooth or Not? When Label Smoothing Meets Noisy Labels

Label smoothing (LS) is an arising learning paradigm that uses the positively weighted average of both the hard training labels and uniformly distributed soft labels. It was shown that LS serves as a regularizer for training data with hard…

Machine Learning · Computer Science 2022-06-28 Jiaheng Wei , Hangyu Liu , Tongliang Liu , Gang Niu , Masashi Sugiyama , Yang Liu

LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization

Regularization techniques are crucial to improving the generalization performance and training efficiency of deep neural networks. Many deep learning algorithms rely on weight decay, dropout, batch/layer normalization to converge faster and…

Machine Learning · Computer Science 2025-05-23 Peng Lu , Ahmad Rashid , Ivan Kobyzev , Mehdi Rezagholizadeh , Philippe Langlais

Delving Deep into Label Smoothing

Label smoothing is an effective regularization tool for deep neural networks (DNNs), which generates soft labels by applying a weighted average between the uniform distribution and the hard label. It is often used to reduce the overfitting…

Computer Vision and Pattern Recognition · Computer Science 2021-07-23 Chang-Bin Zhang , Peng-Tao Jiang , Qibin Hou , Yunchao Wei , Qi Han , Zhen Li , Ming-Ming Cheng

An Investigation of how Label Smoothing Affects Generalization

It has been hypothesized that label smoothing can reduce overfitting and improve generalization, and current empirical evidence seems to corroborate these effects. However, there is a lack of mathematical understanding of when and why such…

Machine Learning · Computer Science 2020-10-27 Blair Chen , Liu Ziyin , Zihao Wang , Paul Pu Liang

Towards Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It

Label smoothing (LS) is a popular regularisation method for training neural networks as it is effective in improving test accuracy and is simple to implement. ``Hard'' one-hot labels are ``smoothed'' by uniformly distributing probability…

Machine Learning · Computer Science 2025-02-21 Guoxuan Xia , Olivier Laurent , Gianni Franchi , Christos-Savvas Bouganis

When Does Label Smoothing Help?

The generalization and learning speed of a multi-class neural network can often be significantly improved by using soft targets that are a weighted average of the hard targets and the uniform distribution over labels. Smoothing the labels…

Machine Learning · Computer Science 2020-06-12 Rafael Müller , Simon Kornblith , Geoffrey Hinton

Generalizing Few Data to Unseen Domains Flexibly Based on Label Smoothing Integrated with Distributionally Robust Optimization

Overfitting commonly occurs when applying deep neural networks (DNNs) on small-scale datasets, where DNNs do not generalize well from existing data to unseen data. The main reason resulting in overfitting is that small-scale datasets cannot…

Machine Learning · Computer Science 2024-08-12 Yangdi Wang , Zhi-Hai Zhang , Su Xiu Xu , Wenming Guo

Regularizing CNNs using Confusion Penalty Based Label Smoothing for Histopathology Images

Deep Learning, particularly Convolutional Neural Networks (CNN), has been successful in computer vision tasks and medical image analysis. However, modern CNNs can be overconfident, making them difficult to deploy in real-world scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Somenath Kuiry , Alaka Das , Mita Nasipuri , Nibaran Das

Regularization in network optimization via trimmed stochastic gradient descent with noisy label

Regularization is essential for avoiding over-fitting to training data in network optimization, leading to better generalization of the trained networks. The label noise provides a strong implicit regularization by replacing the target…

Machine Learning · Computer Science 2022-05-04 Kensuke Nakamura , Bong-Soo Sohn , Kyoung-Jae Won , Byung-Woo Hong

An Exploration into why Output Regularization Mitigates Label Noise

Label noise presents a real challenge for supervised learning algorithms. Consequently, mitigating label noise has attracted immense research in recent years. Noise robust losses is one of the more promising approaches for dealing with…

Machine Learning · Computer Science 2021-04-27 Neta Shoham , Tomer Avidor , Nadav Israel

Semantic Label Smoothing for Sequence to Sequence Problems

Label smoothing has been shown to be an effective regularization strategy in classification, that prevents overfitting and helps in label de-noising. However, extending such methods directly to seq2seq settings, such as Machine Translation,…

Computation and Language · Computer Science 2020-10-16 Michal Lukasik , Himanshu Jain , Aditya Krishna Menon , Seungyeon Kim , Srinadh Bhojanapalli , Felix Yu , Sanjiv Kumar

MaxSup: Overcoming Representation Collapse in Label Smoothing

Label Smoothing (LS) is widely adopted to reduce overconfidence in neural network predictions and improve generalization. Despite these benefits, recent studies reveal two critical issues with LS. First, LS induces overconfidence in…

Machine Learning · Computer Science 2026-02-06 Yuxuan Zhou , Heng Li , Zhi-Qi Cheng , Xudong Yan , Yifei Dong , Mario Fritz , Margret Keuper

Consistency Regularization Can Improve Robustness to Label Noise

Consistency regularization is a commonly-used technique for semi-supervised and self-supervised learning. It is an auxiliary objective function that encourages the prediction of the network to be similar in the vicinity of the observed…

Machine Learning · Computer Science 2021-10-05 Erik Englesson , Hossein Azizpour

Towards Understanding Label Smoothing

Label smoothing regularization (LSR) has a great success in training deep neural networks by stochastic algorithms such as stochastic gradient descent and its variants. However, the theoretical understanding of its power from the view of…

Machine Learning · Computer Science 2020-10-06 Yi Xu , Yuanhong Xu , Qi Qian , Hao Li , Rong Jin

Sparse Label Smoothing Regularization for Person Re-Identification

Person re-identification (re-id) is a cross-camera retrieval task which establishes a correspondence between images of a person from multiple cameras. Deep Learning methods have been successfully applied to this problem and have achieved…

Computer Vision and Pattern Recognition · Computer Science 2019-03-06 Jean-Paul Ainam , Ke Qin , Guisong Liu , Guangchun Luo