English
Related papers

Related papers: Dropout as a Bayesian Approximation: Appendix

200 papers

Deep learning tools have gained tremendous attention in applied machine learning. However such tools for regression and classification do not capture model uncertainty. In comparison, Bayesian models offer a mathematically grounded…

Machine Learning · Statistics 2016-10-05 Yarin Gal , Zoubin Ghahramani

Dropout has recently emerged as a powerful and simple method for training neural networks preventing co-adaptation by stochastically omitting neurons. Dropout is currently not grounded in explicit modelling assumptions which so far has…

Machine Learning · Statistics 2022-05-18 Tue Herlau , Morten Mørup , Mikkel N. Schmidt

Dropout regularization of deep neural networks has been a mysterious yet effective tool to prevent overfitting. Explanations for its success range from the prevention of "co-adapted" weights to it being a form of cheap Bayesian inference.…

Machine Learning · Statistics 2019-05-30 Eric Nalisnick , José Miguel Hernández-Lobato , Padhraic Smyth

Dropout, a simple and effective way to train deep neural networks, has led to a number of impressive empirical successes and spawned many recent theoretical investigations. However, the gap between dropout's training and inference phases,…

Machine Learning · Computer Science 2017-02-17 Xuezhe Ma , Yingkai Gao , Zhiting Hu , Yaoliang Yu , Yuntian Deng , Eduard Hovy

Algorithmic approaches endow deep learning systems with implicit bias that helps them generalize even in over-parametrized settings. In this paper, we focus on understanding such a bias induced in learning through dropout, a popular…

Machine Learning · Computer Science 2018-06-27 Poorya Mianjy , Raman Arora , Rene Vidal

Recurrent neural networks (RNNs) stand at the forefront of many recent developments in deep learning. Yet a major difficulty with these models is their tendency to overfit, with dropout shown to fail when applied to recurrent layers. Recent…

Machine Learning · Statistics 2016-10-06 Yarin Gal , Zoubin Ghahramani

Due to lack of data, overfitting ubiquitously exists in real-world applications of deep neural networks (DNNs). We propose advanced dropout, a model-free methodology, to mitigate overfitting and improve the performance of DNNs. The advanced…

Machine Learning · Computer Science 2021-08-11 Jiyang Xie , Zhanyu Ma , and Jianjun Lei , Guoqiang Zhang , Jing-Hao Xue , Zheng-Hua Tan , Jun Guo

We study the Automatic Relevance Determination procedure applied to deep neural networks. We show that ARD applied to Bayesian DNNs with Gaussian approximate posterior distributions leads to a variational bound similar to that of…

Machine Learning · Statistics 2018-11-29 Valery Kharitonov , Dmitry Molchanov , Dmitry Vetrov

Dropout is one of the key techniques to prevent the learning from overfitting. It is explained that dropout works as a kind of modified L2 regularization. Here, we shed light on the dropout from Bayesian standpoint. Bayesian interpretation…

Machine Learning · Computer Science 2014-12-31 Shin-ichi Maeda

Deep Neural Networks often require good regularizers to generalize well. Dropout is one such regularizer that is widely used among Deep Learning practitioners. Recent work has shown that Dropout can also be viewed as performing Approximate…

Machine Learning · Computer Science 2016-11-22 Suraj Srinivas , R. Venkatesh Babu

We investigate the convergence and convergence rate of stochastic training algorithms for Neural Networks (NNs) that have been inspired by Dropout (Hinton et al., 2012). With the goal of avoiding overfitting during training of NNs, dropout…

Optimization and Control · Mathematics 2023-03-24 Albert Senen-Cerda , Jaron Sanders

Dropout, a stochastic regularisation technique for training of neural networks, has recently been reinterpreted as a specific type of approximate inference algorithm for Bayesian neural networks. The main contribution of the…

Machine Learning · Statistics 2018-07-06 Jiri Hron , Alexander G. de G. Matthews , Zoubin Ghahramani

Bayesian Neural Networks (BNN) have recently emerged in the Deep Learning world for dealing with uncertainty estimation in classification tasks, and are used in many application domains such as astrophysics, autonomous driving...BNN assume…

Machine Learning · Computer Science 2021-02-04 Claire Theobald , Frédéric Pennerath , Brieuc Conan-Guez , Miguel Couceiro , Amedeo Napoli

Despite dropout's ubiquity in machine learning, its effectiveness as a form of data augmentation remains under-explored. We address two key questions: (i) When is dropout effective as an augmentation strategy? (ii) Is dropout uniquely…

Machine Learning · Computer Science 2025-06-02 Rickard Brüel-Gabrielsson , Tongzhou Wang , Manel Baradad , Justin Solomon

Deep neural networks have become the default choice for many of the machine learning tasks such as classification and regression. Dropout, a method commonly used to improve the convergence of deep neural networks, generates an ensemble of…

Machine Learning · Statistics 2019-04-11 Tal Kachman , Michal Moshkovitz , Michal Rosen-Zvi

We present a new method to approximate posterior probabilities of Bayesian Network using Deep Neural Network. Experiment results on several public Bayesian Network datasets shows that Deep Neural Network is capable of learning joint…

Machine Learning · Computer Science 2018-01-12 Jie Jia , Honggang Zhou , Yunchun Li

In classification applications, we often want probabilistic predictions to reflect confidence or uncertainty. Dropout, a commonly used training technique, has recently been linked to Bayesian inference, yielding an efficient way to quantify…

Machine Learning · Computer Science 2019-06-25 Zhilu Zhang , Adrian V. Dalca , Mert R. Sabuncu

Current approaches in approximate inference for Bayesian neural networks minimise the Kullback-Leibler divergence to approximate the true posterior over the weights. However, this approximation is without knowledge of the final application,…

Machine Learning · Statistics 2018-05-11 Adam D. Cobb , Stephen J. Roberts , Yarin Gal

The key distinguishing property of a Bayesian approach is marginalization instead of optimization, not the prior, or Bayes rule. Bayesian inference is especially compelling for deep neural networks. (1) Neural networks are typically…

Machine Learning · Computer Science 2020-01-30 Andrew Gordon Wilson

To obtain uncertainty estimates with real-world Bayesian deep learning models, practical inference approximations are needed. Dropout variational inference (VI) for example has been used for machine vision and medical applications, but VI…

Machine Learning · Computer Science 2017-03-09 Yingzhen Li , Yarin Gal
‹ Prev 1 2 3 10 Next ›