Related papers: Self-Balanced Dropout

Bayesian Dropout

Dropout has recently emerged as a powerful and simple method for training neural networks preventing co-adaptation by stochastically omitting neurons. Dropout is currently not grounded in explicit modelling assumptions which so far has…

Machine Learning · Statistics 2022-05-18 Tue Herlau , Morten Mørup , Mikkel N. Schmidt

A Bayesian encourages dropout

Dropout is one of the key techniques to prevent the learning from overfitting. It is explained that dropout works as a kind of modified L2 regularization. Here, we shed light on the dropout from Bayesian standpoint. Bayesian interpretation…

Machine Learning · Computer Science 2014-12-31 Shin-ichi Maeda

A Survey on Dropout Methods and Experimental Verification in Recommendation

Overfitting is a common problem in machine learning, which means the model too closely fits the training data while performing poorly in the test data. Among various methods of coping with overfitting, dropout is one of the representative…

Machine Learning · Computer Science 2022-05-17 Yangkun Li , Weizhi Ma , Chong Chen , Min Zhang , Yiqun Liu , Shaoping Ma , Yuekui Yang

Understanding Dropout as an Optimization Trick

As one of standard approaches to train deep neural networks, dropout has been applied to regularize large models to avoid overfitting, and the improvement in performance by dropout has been explained as avoiding co-adaptation between nodes.…

Machine Learning · Computer Science 2019-10-10 Sangchul Hahn , Heeyoul Choi

From Random to Supervised: A Novel Dropout Mechanism Integrated with Global Information

Dropout is used to avoid overfitting by randomly dropping units from the neural networks during training. Inspired by dropout, this paper presents GI-Dropout, a novel dropout method integrating with global information to improve neural…

Computation and Language · Computer Science 2018-10-11 Hengru Xu , Shen Li , Renfen Hu , Si Li , Sheng Gao

Dropout Reduces Underfitting

Introduced by Hinton et al. in 2012, dropout has stood the test of time as a regularizer for preventing overfitting in neural networks. In this study, we demonstrate that dropout can also mitigate underfitting when used at the start of…

Machine Learning · Computer Science 2023-06-01 Zhuang Liu , Zhiqiu Xu , Joseph Jin , Zhiqiang Shen , Trevor Darrell

Continuous Dropout

Dropout has been proven to be an effective algorithm for training robust deep networks because of its ability to prevent overfitting by avoiding the co-adaptation of feature detectors. Current explanations of dropout include bagging, naive…

Computer Vision and Pattern Recognition · Computer Science 2019-12-02 Xu Shen , Xinmei Tian , Tongliang Liu , Fang Xu , Dacheng Tao

Deep Augmentation: Dropout as Augmentation for Self-Supervised Learning

Despite dropout's ubiquity in machine learning, its effectiveness as a form of data augmentation remains under-explored. We address two key questions: (i) When is dropout effective as an augmentation strategy? (ii) Is dropout uniquely…

Machine Learning · Computer Science 2025-06-02 Rickard Brüel-Gabrielsson , Tongzhou Wang , Manel Baradad , Justin Solomon

STLM Engineering Report: Dropout

In this work we explore the relevance of dropout for modern language models, particularly in the context of models on the scale of <100M parameters. We explore it's relevance firstly in the regime of improving the sample efficiency of…

Computation and Language · Computer Science 2024-09-10 Dylan Hillier , Leon Guertler , Bobby Cheng , Cheston Tan

Analytic theory of dropout regularization

Dropout is a regularization technique widely used in training artificial neural networks to mitigate overfitting. It consists of dynamically deactivating subsets of the network during training to promote more robust representations. Despite…

Machine Learning · Statistics 2025-09-10 Francesco Mori , Francesca Mignacco

AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning

Fine-tuning large pre-trained language models on downstream tasks is apt to suffer from overfitting when limited training data is available. While dropout proves to be an effective antidote by randomly dropping a proportion of units,…

Computation and Language · Computer Science 2022-10-13 Tao Yang , Jinghao Deng , Xiaojun Quan , Qifan Wang , Shaoliang Nie

Enhancing Transformer Training Efficiency with Dynamic Dropout

We introduce Dynamic Dropout, a novel regularization technique designed to enhance the training efficiency of Transformer models by dynamically adjusting the dropout rate based on training epochs or validation loss improvements. This…

Machine Learning · Computer Science 2024-11-06 Hanrui Yan , Dan Shao

Dropout Inference with Non-Uniform Weight Scaling

Dropout as regularization has been used extensively to prevent overfitting for training neural networks. During training, units and their connections are randomly dropped, which could be considered as sampling many different submodels from…

Machine Learning · Computer Science 2022-04-28 Zhaoyuan Yang , Arpit Jain

Curriculum Dropout

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network…

Neural and Evolutionary Computing · Computer Science 2017-08-04 Pietro Morerio , Jacopo Cavazza , Riccardo Volpi , Rene Vidal , Vittorio Murino

Regularizing Meta-Learning via Gradient Dropout

With the growing attention on learning-to-learn new tasks using only a few examples, meta-learning has been widely used in numerous problems such as few-shot classification, reinforcement learning, and domain generalization. However,…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Hung-Yu Tseng , Yi-Wen Chen , Yi-Hsuan Tsai , Sifei Liu , Yen-Yu Lin , Ming-Hsuan Yang

Adaptive Data Dropout: Towards Self-Regulated Learning in Deep Neural Networks

Deep neural networks are typically trained by uniformly sampling large datasets across epochs, despite evidence that not all samples contribute equally throughout learning. Recent work shows that progressively reducing the amount of…

Machine Learning · Computer Science 2026-04-15 Amar Gahir , Varshil Patel , Shreyank N Gowda

Lyapunov-Based Dropout Deep Neural Network (Lb-DDNN) Controller

Deep neural network (DNN)-based adaptive controllers can be used to compensate for unstructured uncertainties in nonlinear dynamic systems. However, DNNs are also very susceptible to overfitting and co-adaptation. Dropout regularization is…

Systems and Control · Electrical Eng. & Systems 2023-11-01 Saiedeh Akbari , Emily J. Griffis , Omkar Sudhir Patil , Warren E. Dixon

Ising-Dropout: A Regularization Method for Training and Compression of Deep Neural Networks

Overfitting is a major problem in training machine learning models, specifically deep neural networks. This problem may be caused by imbalanced datasets and initialization of the model parameters, which conforms the model too closely to the…

Neural and Evolutionary Computing · Computer Science 2019-02-26 Hojjat Salehinejad , Shahrokh Valaee

Multi-Sample Dropout for Accelerated Training and Better Generalization

Dropout is a simple but efficient regularization technique for achieving better generalization of deep neural networks (DNNs); hence it is widely used in tasks based on DNNs. During training, dropout randomly discards a portion of the…

Neural and Evolutionary Computing · Computer Science 2020-10-22 Hiroshi Inoue

Dropout with Expectation-linear Regularization

Dropout, a simple and effective way to train deep neural networks, has led to a number of impressive empirical successes and spawned many recent theoretical investigations. However, the gap between dropout's training and inference phases,…

Machine Learning · Computer Science 2017-02-17 Xuezhe Ma , Yingkai Gao , Zhiting Hu , Yaoliang Yu , Yuntian Deng , Eduard Hovy