Related papers: Dropout Training as Adaptive Regularization

Analytic theory of dropout regularization

Dropout is a regularization technique widely used in training artificial neural networks to mitigate overfitting. It consists of dynamically deactivating subsets of the network during training to promote more robust representations. Despite…

Machine Learning · Statistics 2025-09-10 Francesco Mori , Francesca Mignacco

On the Inductive Bias of Dropout

Dropout is a simple but effective technique for learning in neural networks and other settings. A sound theoretical understanding of dropout is needed to determine when dropout should be applied and how to use it most effectively. In this…

Machine Learning · Computer Science 2017-02-21 David P. Helmbold , Philip M. Long

Curriculum Dropout

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network…

Neural and Evolutionary Computing · Computer Science 2017-08-04 Pietro Morerio , Jacopo Cavazza , Riccardo Volpi , Rene Vidal , Vittorio Murino

Data Dropout in Arbitrary Basis for Deep Network Regularization

An important problem in training deep networks with high capacity is to ensure that the trained network works well when presented with new inputs outside the training dataset. Dropout is an effective regularization technique to boost the…

Computer Vision and Pattern Recognition · Computer Science 2017-12-06 Mostafa Rahmani , George Atia

Dropout Reduces Underfitting

Introduced by Hinton et al. in 2012, dropout has stood the test of time as a regularizer for preventing overfitting in neural networks. In this study, we demonstrate that dropout can also mitigate underfitting when used at the start of…

Machine Learning · Computer Science 2023-06-01 Zhuang Liu , Zhiqiu Xu , Joseph Jin , Zhiqiang Shen , Trevor Darrell

Information Guided Regularization for Fine-tuning Language Models

The pretraining-fine-tuning paradigm has been the de facto strategy for transfer learning in modern language modeling. With the understanding that task adaptation in LMs is often a function of parameters shared across tasks, we argue that a…

Computation and Language · Computer Science 2024-06-24 Mandar Sharma , Nikhil Muralidhar , Shengzhe Xu , Raquib Bin Yousuf , Naren Ramakrishnan

The Implicit and Explicit Regularization Effects of Dropout

Dropout is a widely-used regularization technique, often required to obtain state-of-the-art for a number of architectures. This work demonstrates that dropout introduces two distinct but entangled regularization effects: an explicit effect…

Machine Learning · Computer Science 2020-10-16 Colin Wei , Sham Kakade , Tengyu Ma

Generalized Dropout

Deep Neural Networks often require good regularizers to generalize well. Dropout is one such regularizer that is widely used among Deep Learning practitioners. Recent work has shown that Dropout can also be viewed as performing Approximate…

Machine Learning · Computer Science 2016-11-22 Suraj Srinivas , R. Venkatesh Babu

Explicit Dropout: Deterministic Regularization for Transformer Architectures

Dropout is a widely used regularization technique in deep learning, but its effects are typically realized through stochastic masking rather than explicit optimization objectives. We propose a deterministic formulation that expresses…

Machine Learning · Computer Science 2026-04-23 Vidhi Agrawal , Illia Oleksiienko , Alexandros Iosifidis

On Fast Dropout and its Applicability to Recurrent Networks

Recurrent Neural Networks (RNNs) are rich models for the processing of sequential data. Recent work on advancing the state of the art has been focused on the optimization or modelling of RNNs, mostly motivated by adressing the problems of…

Machine Learning · Statistics 2014-03-06 Justin Bayer , Christian Osendorfer , Daniela Korhammer , Nutan Chen , Sebastian Urban , Patrick van der Smagt

Adaptive Data Dropout: Towards Self-Regulated Learning in Deep Neural Networks

Deep neural networks are typically trained by uniformly sampling large datasets across epochs, despite evidence that not all samples contribute equally throughout learning. Recent work shows that progressively reducing the amount of…

Machine Learning · Computer Science 2026-04-15 Amar Gahir , Varshil Patel , Shreyank N Gowda

Revisiting Structured Dropout

Large neural networks are often overparameterised and prone to overfitting, Dropout is a widely used regularization technique to combat overfitting and improve model generalization. However, unstructured Dropout is not always effective for…

Machine Learning · Computer Science 2022-10-07 Yiren Zhao , Oluwatomisin Dada , Xitong Gao , Robert D Mullins

DropCluster: A structured dropout for convolutional networks

Dropout as a common regularizer to prevent overfitting in deep neural networks has been less effective in convolutional layers than in fully connected layers. This is because Dropout drops features randomly, without considering local…

Machine Learning · Computer Science 2025-06-05 Liyan Chen , Philippos Mordohai , Sergul Aydore

Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution

Blind Super-Resolution (blind SR) aims to enhance the model's generalization ability with unknown degradation, yet it still encounters severe overfitting issues. Some previous methods inspired by dropout, which enhances generalization by…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Hang Xu , Wei Yu , Jiangtong Tan , Zhen Zou , Feng Zhao

An Analysis of Dropout for Matrix Factorization

Dropout is a simple yet effective algorithm for regularizing neural networks by randomly dropping out units through Bernoulli multiplicative noise, and for some restricted problem classes, such as linear or logistic regression, several…

Machine Learning · Computer Science 2017-10-12 Jacopo Cavazza , Connor Lane , Benjamin D. Haeffele , Vittorio Murino , René Vidal

On the Regularization Properties of Structured Dropout

Dropout and its extensions (eg. DropBlock and DropConnect) are popular heuristics for training neural networks, which have been shown to improve generalization performance in practice. However, a theoretical understanding of their…

Machine Learning · Computer Science 2020-06-23 Ambar Pal , Connor Lane , René Vidal , Benjamin D. Haeffele

Ising-Dropout: A Regularization Method for Training and Compression of Deep Neural Networks

Overfitting is a major problem in training machine learning models, specifically deep neural networks. This problem may be caused by imbalanced datasets and initialization of the model parameters, which conforms the model too closely to the…

Neural and Evolutionary Computing · Computer Science 2019-02-26 Hojjat Salehinejad , Shahrokh Valaee

Adaptive Regularization via Residual Smoothing in Deep Learning Optimization

We present an adaptive regularization algorithm that can be effectively applied to the optimization problem in deep learning framework. Our regularization algorithm aims to take into account the fitness of data to the current state of model…

Machine Learning · Computer Science 2019-09-02 Junghee Cho , Junseok Kwon , Byung-Woo Hong

Dropout with Expectation-linear Regularization

Dropout, a simple and effective way to train deep neural networks, has led to a number of impressive empirical successes and spawned many recent theoretical investigations. However, the gap between dropout's training and inference phases,…

Machine Learning · Computer Science 2017-02-17 Xuezhe Ma , Yingkai Gao , Zhiting Hu , Yaoliang Yu , Yuntian Deng , Eduard Hovy

Implicit regularization of dropout

It is important to understand how dropout, a popular regularization method, aids in achieving a good generalization solution during neural network training. In this work, we present a theoretical derivation of an implicit regularization of…

Machine Learning · Computer Science 2023-04-11 Zhongwang Zhang , Zhi-Qin John Xu