English

Dropout as a Bayesian Approximation: Appendix

Machine Learning 2016-05-26 v5

Abstract

We show that a neural network with arbitrary depth and non-linearities, with dropout applied before every weight layer, is mathematically equivalent to an approximation to a well known Bayesian model. This interpretation might offer an explanation to some of dropout's key properties, such as its robustness to over-fitting. Our interpretation allows us to reason about uncertainty in deep learning, and allows the introduction of the Bayesian machinery into existing deep learning frameworks in a principled way. This document is an appendix for the main paper "Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning" by Gal and Ghahramani, 2015.

Keywords

Cite

@article{arxiv.1506.02157,
  title  = {Dropout as a Bayesian Approximation: Appendix},
  author = {Yarin Gal and Zoubin Ghahramani},
  journal= {arXiv preprint arXiv:1506.02157},
  year   = {2016}
}

Comments

20 pages, 1 figure; ICML proceedings version

R2 v1 2026-06-22T09:48:29.467Z