Related papers: Dropout: Explicit Forms and Capacity Control

Explicit Dropout: Deterministic Regularization for Transformer Architectures

Dropout is a widely used regularization technique in deep learning, but its effects are typically realized through stochastic masking rather than explicit optimization objectives. We propose a deterministic formulation that expresses…

Machine Learning · Computer Science 2026-04-23 Vidhi Agrawal , Illia Oleksiienko , Alexandros Iosifidis

An Analysis of Dropout for Matrix Factorization

Dropout is a simple yet effective algorithm for regularizing neural networks by randomly dropping out units through Bernoulli multiplicative noise, and for some restricted problem classes, such as linear or logistic regression, several…

Machine Learning · Computer Science 2017-10-12 Jacopo Cavazza , Connor Lane , Benjamin D. Haeffele , Vittorio Murino , René Vidal

Beyond Dropout: Feature Map Distortion to Regularize Deep Neural Networks

Deep neural networks often consist of a great number of trainable parameters for extracting powerful features from given datasets. On one hand, massive trainable parameters significantly enhance the performance of these deep networks. On…

Machine Learning · Computer Science 2020-02-26 Yehui Tang , Yunhe Wang , Yixing Xu , Boxin Shi , Chao Xu , Chunjing Xu , Chang Xu

Dropout Rademacher Complexity of Deep Neural Networks

Great successes of deep neural networks have been witnessed in various real applications. Many algorithmic and implementation techniques have been developed, however, theoretical understanding of many aspects of deep neural networks is far…

Neural and Evolutionary Computing · Computer Science 2020-07-07 Wei Gao , Zhi-Hua Zhou

The Implicit and Explicit Regularization Effects of Dropout

Dropout is a widely-used regularization technique, often required to obtain state-of-the-art for a number of architectures. This work demonstrates that dropout introduces two distinct but entangled regularization effects: an explicit effect…

Machine Learning · Computer Science 2020-10-16 Colin Wei , Sham Kakade , Tengyu Ma

Investigating the Relationship Between Dropout Regularization and Model Complexity in Neural Networks

Dropout Regularization, serving to reduce variance, is nearly ubiquitous in Deep Learning models. We explore the relationship between the dropout rate and model complexity by training 2,000 neural networks configured with random…

Machine Learning · Computer Science 2021-08-30 Christopher Sun , Jai Sharma , Milind Maiti

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Different techniques have emerged in the deep learning scenario, such as Convolutional Neural Networks, Deep Belief Networks, and Long Short-Term Memory Networks, to cite a few. In lockstep, regularization methods, which aim to prevent…

Machine Learning · Computer Science 2020-07-28 Claudio Filipi Goncalves do Santos , Danilo Colombo , Mateus Roder , João Paulo Papa

Dropout with Expectation-linear Regularization

Dropout, a simple and effective way to train deep neural networks, has led to a number of impressive empirical successes and spawned many recent theoretical investigations. However, the gap between dropout's training and inference phases,…

Machine Learning · Computer Science 2017-02-17 Xuezhe Ma , Yingkai Gao , Zhiting Hu , Yaoliang Yu , Yuntian Deng , Eduard Hovy

On Fast Dropout and its Applicability to Recurrent Networks

Recurrent Neural Networks (RNNs) are rich models for the processing of sequential data. Recent work on advancing the state of the art has been focused on the optimization or modelling of RNNs, mostly motivated by adressing the problems of…

Machine Learning · Statistics 2014-03-06 Justin Bayer , Christian Osendorfer , Daniela Korhammer , Nutan Chen , Sebastian Urban , Patrick van der Smagt

Neuron-Specific Dropout: A Deterministic Regularization Technique to Prevent Neural Networks from Overfitting & Reduce Dependence on Large Training Samples

In order to develop complex relationships between their inputs and outputs, deep neural networks train and adjust large number of parameters. To make these networks work at high accuracy, vast amounts of data are needed. Sometimes, however,…

Machine Learning · Computer Science 2022-01-19 Joshua Shunk

Data Dropout in Arbitrary Basis for Deep Network Regularization

An important problem in training deep networks with high capacity is to ensure that the trained network works well when presented with new inputs outside the training dataset. Dropout is an effective regularization technique to boost the…

Computer Vision and Pattern Recognition · Computer Science 2017-12-06 Mostafa Rahmani , George Atia

Enhancing Transformer Training Efficiency with Dynamic Dropout

We introduce Dynamic Dropout, a novel regularization technique designed to enhance the training efficiency of Transformer models by dynamically adjusting the dropout rate based on training epochs or validation loss improvements. This…

Machine Learning · Computer Science 2024-11-06 Hanrui Yan , Dan Shao

Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Dropout is a widely used regularization trick to resolve the overfitting issue in large feedforward neural networks trained on a small dataset, which performs poorly on the held-out test subset. Although the effectiveness of this…

Computation and Language · Computer Science 2023-03-29 Suman Adhya , Avishek Lahiri , Debarshi Kumar Sanyal

SoftTarget Regularization: An Effective Technique to Reduce Over-Fitting in Neural Networks

Deep neural networks are learning models with a very high capacity and therefore prone to over-fitting. Many regularization techniques such as Dropout, DropConnect, and weight decay all attempt to solve the problem of over-fitting by…

Machine Learning · Computer Science 2016-12-06 Armen Aghajanyan

Dropout as a Low-Rank Regularizer for Matrix Factorization

Regularization for matrix factorization (MF) and approximation problems has been carried out in many different ways. Due to its popularity in deep learning, dropout has been applied also for this class of problems. Despite its solid…

Machine Learning · Computer Science 2017-10-17 Jacopo Cavazza , Pietro Morerio , Benjamin Haeffele , Connor Lane , Vittorio Murino , Rene Vidal

Do deep nets really need weight decay and dropout?

The impressive success of modern deep neural networks on computer vision tasks has been achieved through models of very large capacity compared to the number of available training examples. This overparameterization is often said to be…

Computer Vision and Pattern Recognition · Computer Science 2018-07-13 Alex Hernández-García , Peter König

Excitation Dropout: Encouraging Plasticity in Deep Neural Networks

We propose a guided dropout regularizer for deep networks based on the evidence of a network prediction defined as the firing of neurons in specific paths. In this work, we utilize the evidence at each neuron to determine the probability of…

Computer Vision and Pattern Recognition · Computer Science 2021-01-22 Andrea Zunino , Sarah Adel Bargal , Pietro Morerio , Jianming Zhang , Stan Sclaroff , Vittorio Murino

Information Geometry of Dropout Training

Dropout is one of the most popular regularization techniques in neural network training. Because of its power and simplicity of idea, dropout has been analyzed extensively and many variants have been proposed. In this paper, several…

Machine Learning · Statistics 2022-06-23 Masanari Kimura , Hideitsu Hino

Dropout Training as Adaptive Regularization

Dropout and other feature noising schemes control overfitting by artificially corrupting the training data. For generalized linear models, dropout performs a form of adaptive regularization. Using this viewpoint, we show that the dropout…

Machine Learning · Statistics 2013-11-04 Stefan Wager , Sida Wang , Percy Liang

Analytic theory of dropout regularization

Dropout is a regularization technique widely used in training artificial neural networks to mitigate overfitting. It consists of dynamically deactivating subsets of the network during training to promote more robust representations. Despite…

Machine Learning · Statistics 2025-09-10 Francesco Mori , Francesca Mignacco