English
Related papers

Related papers: Revisiting Structured Dropout

200 papers

Dropout as a common regularizer to prevent overfitting in deep neural networks has been less effective in convolutional layers than in fully connected layers. This is because Dropout drops features randomly, without considering local…

Machine Learning · Computer Science 2025-06-05 Liyan Chen , Philippos Mordohai , Sergul Aydore

Deep neural networks often work well when they are over-parameterized and trained with a massive amount of noise and regularization, such as weight decay and dropout. Although dropout is widely used as a regularization technique for fully…

Computer Vision and Pattern Recognition · Computer Science 2018-10-31 Golnaz Ghiasi , Tsung-Yi Lin , Quoc V. Le

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network…

Neural and Evolutionary Computing · Computer Science 2017-08-04 Pietro Morerio , Jacopo Cavazza , Riccardo Volpi , Rene Vidal , Vittorio Murino

Neural networks are often over-parameterized and hence benefit from aggressive regularization. Conventional regularization methods, such as Dropout or weight decay, do not leverage the structures of the network's inputs and hidden states.…

Machine Learning · Computer Science 2021-01-07 Hieu Pham , Quoc V. Le

This paper proposes a new regularization algorithm referred to as macro-block dropout. The overfitting issue has been a difficult problem in training large neural network models. The dropout technique has proven to be simple yet very…

Machine Learning · Computer Science 2023-01-02 Chanwoo Kim , Sathish Indurti , Jinhwan Park , Wonyong Sung

Dropout is a simple but efficient regularization technique for achieving better generalization of deep neural networks (DNNs); hence it is widely used in tasks based on DNNs. During training, dropout randomly discards a portion of the…

Neural and Evolutionary Computing · Computer Science 2020-10-22 Hiroshi Inoue

In order to develop complex relationships between their inputs and outputs, deep neural networks train and adjust large number of parameters. To make these networks work at high accuracy, vast amounts of data are needed. Sometimes, however,…

Machine Learning · Computer Science 2022-01-19 Joshua Shunk

Dropout is often used in deep neural networks to prevent over-fitting. Conventionally, dropout training invokes \textit{random drop} of nodes from the hidden layers of a Neural Network. It is our hypothesis that a guided selection of nodes…

Machine Learning · Computer Science 2018-12-11 Rohit Keshari , Richa Singh , Mayank Vatsa

An important problem in training deep networks with high capacity is to ensure that the trained network works well when presented with new inputs outside the training dataset. Dropout is an effective regularization technique to boost the…

Computer Vision and Pattern Recognition · Computer Science 2017-12-06 Mostafa Rahmani , George Atia

Dropout is a regularization technique widely used in training artificial neural networks to mitigate overfitting. It consists of dynamically deactivating subsets of the network during training to promote more robust representations. Despite…

Machine Learning · Statistics 2025-09-10 Francesco Mori , Francesca Mignacco

Dropout is a widely used regularization technique which improves the generalization ability of a model by randomly dropping neurons. In light of this, we propose Dropout Prompt Learning, which aims for applying dropout to improve the…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Biao Chen , Lin Zuo , Mengmeng Jing , Kunbin He , Yuchen Wang

Overparameterized transformer networks have obtained state of the art results in various natural language processing tasks, such as machine translation, language modeling, and question answering. These models contain hundreds of millions of…

Machine Learning · Computer Science 2019-09-26 Angela Fan , Edouard Grave , Armand Joulin

Overfitting is a common problem in machine learning, which means the model too closely fits the training data while performing poorly in the test data. Among various methods of coping with overfitting, dropout is one of the representative…

Machine Learning · Computer Science 2022-05-17 Yangkun Li , Weizhi Ma , Chong Chen , Min Zhang , Yiqun Liu , Shaoping Ma , Yuekui Yang

Introduced by Hinton et al. in 2012, dropout has stood the test of time as a regularizer for preventing overfitting in neural networks. In this study, we demonstrate that dropout can also mitigate underfitting when used at the start of…

Machine Learning · Computer Science 2023-06-01 Zhuang Liu , Zhiqiu Xu , Joseph Jin , Zhiqiang Shen , Trevor Darrell

The pretraining-fine-tuning paradigm has been the de facto strategy for transfer learning in modern language modeling. With the understanding that task adaptation in LMs is often a function of parameters shared across tasks, we argue that a…

Computation and Language · Computer Science 2024-06-24 Mandar Sharma , Nikhil Muralidhar , Shengzhe Xu , Raquib Bin Yousuf , Naren Ramakrishnan

Recurrent neural networks (RNNs) are important class of architectures among neural networks useful for language modeling and sequential prediction. However, optimizing RNNs is known to be harder compared to feed-forward neural networks. A…

Machine Learning · Statistics 2018-03-29 Konrad Zolna , Devansh Arpit , Dendi Suhubdy , Yoshua Bengio

Dropout, a simple and effective way to train deep neural networks, has led to a number of impressive empirical successes and spawned many recent theoretical investigations. However, the gap between dropout's training and inference phases,…

Machine Learning · Computer Science 2017-02-17 Xuezhe Ma , Yingkai Gao , Zhiting Hu , Yaoliang Yu , Yuntian Deng , Eduard Hovy

The big breakthrough on the ImageNet challenge in 2012 was partially due to the `dropout' technique used to avoid overfitting. Here, we introduce a new approach called `Spectral Dropout' to improve the generalization ability of deep neural…

Computer Vision and Pattern Recognition · Computer Science 2017-11-27 Salman Khan , Munawar Hayat , Fatih Porikli

Using a large number of parameters , deep neural networks have achieved remarkable performance on computer vison and natural language processing tasks. However the networks usually suffer from overfitting by using too much parameters.…

Computer Vision and Pattern Recognition · Computer Science 2018-10-24 Zhengsu Chen Jianwei Niu Qi Tian

Dropout regularization has been widely used in deep learning but performs less effective for convolutional neural networks since the spatially correlated features allow dropped information to still flow through the networks. Some structured…

Computer Vision and Pattern Recognition · Computer Science 2020-10-22 Hui Zhu , Xiaofang Zhao
‹ Prev 1 2 3 10 Next ›