English
Related papers

Related papers: When Does Label Smoothing Help?

200 papers

Label smoothing is widely used in deep neural networks for multi-class classification. While it enhances model generalization and reduces overconfidence by aiming to lower the probability for the predicted class, it distorts the predicted…

Machine Learning · Computer Science 2021-10-12 Mohamed Maher , Meelis Kull

Training neural networks with one-hot target labels often results in overconfidence and overfitting. Label smoothing addresses this issue by perturbing the one-hot target labels by adding a uniform probability vector to create a regularized…

Computer Vision and Pattern Recognition · Computer Science 2025-09-09 Sachin Chhabra , Hemanth Venkateswara , Baoxin Li

Label smoothing is commonly used in training deep learning models, wherein one-hot training labels are mixed with uniform label vectors. Empirically, smoothing has been shown to improve both predictive performance and model calibration. In…

Machine Learning · Computer Science 2020-03-06 Michal Lukasik , Srinadh Bhojanapalli , Aditya Krishna Menon , Sanjiv Kumar

It has been hypothesized that label smoothing can reduce overfitting and improve generalization, and current empirical evidence seems to corroborate these effects. However, there is a lack of mathematical understanding of when and why such…

Machine Learning · Computer Science 2020-10-27 Blair Chen , Liu Ziyin , Zihao Wang , Paul Pu Liang

Overconfidence has been shown to impair generalization and calibration of a neural network. Previous studies remedy this issue by adding a regularization term to a loss function, preventing a model from making a peaked distribution. Label…

Machine Learning · Computer Science 2022-10-26 Dongkyu Lee , Ka Chun Cheung , Nevin L. Zhang

Label smoothing is a regularization technique for neural networks. Normally neural models are trained to an output distribution that is a vector with a single 1 for the correct prediction, and 0 for all other elements. Label smoothing…

Software Engineering · Computer Science 2023-03-29 Sakib Haque , Aakash Bansal , Collin McMillan

Label smoothing is a widely used technique in various domains, such as text classification, image classification and speech recognition, known for effectively combating model overfitting. However, there is little fine-grained analysis on…

Computation and Language · Computer Science 2024-02-26 Yijie Gao , Shijing Si , Hua Luo , Haixia Sun , Yugui Zhang

Generating confidence calibrated outputs is of utmost importance for the applications of deep neural networks in safety-critical decision-making systems. The output of a neural network is a probability distribution where the scores are…

Machine Learning · Computer Science 2021-09-17 Chihuang Liu , Joseph JaJa

Label smoothing (LS) is an arising learning paradigm that uses the positively weighted average of both the hard training labels and uniformly distributed soft labels. It was shown that LS serves as a regularizer for training data with hard…

Machine Learning · Computer Science 2022-06-28 Jiaheng Wei , Hangyu Liu , Tongliang Liu , Gang Niu , Masashi Sugiyama , Yang Liu

Label smoothing (LS) is a popular regularisation method for training neural networks as it is effective in improving test accuracy and is simple to implement. ``Hard'' one-hot labels are ``smoothed'' by uniformly distributing probability…

Machine Learning · Computer Science 2025-02-21 Guoxuan Xia , Olivier Laurent , Gianni Franchi , Christos-Savvas Bouganis

Label smoothing loss is a widely adopted technique to mitigate overfitting in deep neural networks. This paper studies label smoothing from the perspective of Neural Collapse (NC), a powerful empirical and theoretical framework which…

Machine Learning · Computer Science 2025-09-30 Li Guo , George Andriopoulos , Zifan Zhao , Shuyang Ling , Zixuan Dong , Keith Ross

Label smoothing and vocabulary sharing are two widely used techniques in neural machine translation models. However, we argue that simply applying both techniques can be conflicting and even leads to sub-optimal performance. When allocating…

Computation and Language · Computer Science 2022-03-14 Liang Chen , Runxin Xu , Baobao Chang

Label Smoothing (LS) improves model generalization through penalizing models from generating overconfident output distributions. For each training sample the LS strategy smooths the one-hot encoded training signal by distributing its…

Machine Learning · Computer Science 2021-06-29 Hongyu Guo

Training modern neural networks is an inherently noisy process that can lead to high \emph{prediction churn} -- disagreements between re-trainings of the same model due to factors such as randomization in the parameter initialization and…

Machine Learning · Computer Science 2021-06-15 Dara Bahri , Heinrich Jiang

Label smoothing is an effective regularization tool for deep neural networks (DNNs), which generates soft labels by applying a weighted average between the uniform distribution and the hard label. It is often used to reduce the overfitting…

Computer Vision and Pattern Recognition · Computer Science 2021-07-23 Chang-Bin Zhang , Peng-Tao Jiang , Qibin Hou , Yunchao Wei , Qi Han , Zhen Li , Ming-Ming Cheng

It has been recently demonstrated that multi-generational self-distillation can improve generalization. Despite this intriguing observation, reasons for the enhancement remain poorly understood. In this paper, we first demonstrate…

Machine Learning · Computer Science 2020-10-23 Zhilu Zhang , Mert R. Sabuncu

Label smoothing is a widely studied regularization technique in machine learning. However, its potential for node classification in graph-structured data, spanning homophilic to heterophilic graphs, remains largely unexplored. We introduce…

Machine Learning · Computer Science 2026-02-02 Jaeseung Heo , Moonjeong Park , Dongwoo Kim

Label smoothing -- using softened labels instead of hard ones -- is a widely adopted regularization method for deep learning, showing diverse benefits such as enhanced generalization and calibration. Its implications for preserving model…

Machine Learning · Computer Science 2024-07-09 Lukas Struppek , Dominik Hintersdorf , Kristian Kersting

Despite the great success of state-of-the-art deep neural networks, several studies have reported models to be over-confident in predictions, indicating miscalibration. Label Smoothing has been proposed as a solution to the over-confidence…

Computer Vision and Pattern Recognition · Computer Science 2023-01-31 Shuang Ao , Stefan Rueger , Advaith Siddharthan

Regularization is an effective way to promote the generalization performance of machine learning models. In this paper, we focus on label smoothing, a form of output distribution regularization that prevents overfitting of a neural network…

Machine Learning · Computer Science 2020-07-07 Weizhi Li , Gautam Dasarathy , Visar Berisha
‹ Prev 1 2 3 10 Next ›