English
Related papers

Related papers: Variational Nested Dropout

200 papers

Recently, nested dropout was proposed as a method for ordering representation units in autoencoders by their information content, without diminishing reconstruction cost. However, it has only been applied to training fully-connected…

Computer Vision and Pattern Recognition · Computer Science 2015-04-13 Chelsea Finn , Lisa Anne Hendricks , Trevor Darrell

In this paper, we study ordered representations of data in which different dimensions have different degrees of importance. To learn these representations we introduce nested dropout, a procedure for stochastically removing coherent nested…

Machine Learning · Statistics 2014-02-06 Oren Rippel , Michael A. Gelbart , Ryan P. Adams

Approximate inference in Bayesian deep networks exhibits a dilemma of how to yield high fidelity posterior approximations while maintaining computational efficiency and scalability. We tackle this challenge by introducing a novel…

Machine Learning · Computer Science 2021-11-01 Son Nguyen , Duong Nguyen , Khai Nguyen , Khoat Than , Hung Bui , Nhat Ho

Standard deep neural networks (DNNs) are commonly trained in an end-to-end fashion for specific tasks such as object recognition, face identification, or character recognition, among many examples. This specificity often leads to…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Raphaël Achddou , J. Matias di Martino , Guillermo Sapiro

In convolutional neural network (CNN), dropout cannot work well because dropped information is not entirely obscured in convolutional layers where features are correlated spatially. Except randomly discarding regions or channels, many…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Tianshu Xie , Minghui Liu , Jiali Deng , Xuan Cheng , Xiaomin Wang , Ming Liu

Variational dropout (VD) is a generalization of Gaussian dropout, which aims at inferring the posterior of network weights based on a log-uniform prior on them to learn these weights as well as dropout rate simultaneously. The log-uniform…

Machine Learning · Computer Science 2019-04-05 Yuhang Liu , Wenyong Dong , Lei Zhang , Dong Gong , Qinfeng Shi

In order to develop complex relationships between their inputs and outputs, deep neural networks train and adjust large number of parameters. To make these networks work at high accuracy, vast amounts of data are needed. Sometimes, however,…

Machine Learning · Computer Science 2022-01-19 Joshua Shunk

Dropout is often used in deep neural networks to prevent over-fitting. Conventionally, dropout training invokes \textit{random drop} of nodes from the hidden layers of a Neural Network. It is our hypothesis that a guided selection of nodes…

Machine Learning · Computer Science 2018-12-11 Rohit Keshari , Richa Singh , Mayank Vatsa

This paper proposes a novel stable learning theory for recurrent neural networks (RNNs), so-called variational adaptive noise and dropout (VAND). As stabilizing factors for RNNs, noise and dropout on the internal state of RNNs have been…

Machine Learning · Computer Science 2026-02-25 Taisuke Kobayashi , Shingo Murata

Dropout is a simple but efficient regularization technique for achieving better generalization of deep neural networks (DNNs); hence it is widely used in tasks based on DNNs. During training, dropout randomly discards a portion of the…

Neural and Evolutionary Computing · Computer Science 2020-10-22 Hiroshi Inoue

Neural networks are easier to optimise when they have many more weights than are required for modelling the mapping from inputs to outputs. This suggests a two-stage learning procedure that first learns a large net and then prunes away…

Machine Learning · Computer Science 2019-09-10 Aidan N. Gomez , Ivan Zhang , Siddhartha Rao Kamalakara , Divyam Madaan , Kevin Swersky , Yarin Gal , Geoffrey E. Hinton

Learning to infer the conditional posterior model is a key step for robust meta-learning. This paper presents a new Bayesian meta-learning approach called Neural Variational Dropout Processes (NVDPs). NVDPs model the conditional posterior…

Machine Learning · Computer Science 2025-10-23 Insu Jeon , Youngjin Park , Gunhee Kim

Overparameterized transformer networks have obtained state of the art results in various natural language processing tasks, such as machine translation, language modeling, and question answering. These models contain hundreds of millions of…

Machine Learning · Computer Science 2019-09-26 Angela Fan , Edouard Grave , Armand Joulin

An important problem in training deep networks with high capacity is to ensure that the trained network works well when presented with new inputs outside the training dataset. Dropout is an effective regularization technique to boost the…

Computer Vision and Pattern Recognition · Computer Science 2017-12-06 Mostafa Rahmani , George Atia

Recurrent neural networks (RNNs) are important class of architectures among neural networks useful for language modeling and sequential prediction. However, optimizing RNNs is known to be harder compared to feed-forward neural networks. A…

Machine Learning · Statistics 2018-03-29 Konrad Zolna , Devansh Arpit , Dendi Suhubdy , Yoshua Bengio

Recently, there have been increasing demands to construct compact deep architectures to remove unnecessary redundancy and to improve the inference speed. While many recent works focus on reducing the redundancy by eliminating unneeded…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Eunwoo Kim , Chanho Ahn , Songhwai Oh

One of the most fundamental design choices in neural networks is layer width: it affects the capacity of what a network can learn and determines the complexity of the solution. This latter property is often exploited when introducing…

Machine Learning · Computer Science 2022-05-04 Edward W. Staley , Jared Markowitz

We explore a recently proposed Variational Dropout technique that provided an elegant Bayesian interpretation to Gaussian Dropout. We extend Variational Dropout to the case when dropout rates are unbounded, propose a way to reduce the…

Machine Learning · Statistics 2017-06-14 Dmitry Molchanov , Arsenii Ashukha , Dmitry Vetrov

Dropout is typically interpreted as bagging a large number of models sharing parameters. We show that using dropout in a network can also be interpreted as a kind of data augmentation in the input space without domain knowledge. We present…

Machine Learning · Statistics 2016-01-11 Xavier Bouthillier , Kishore Konda , Pascal Vincent , Roland Memisevic

Bayesian Neural Networks (BNN) have recently emerged in the Deep Learning world for dealing with uncertainty estimation in classification tasks, and are used in many application domains such as astrophysics, autonomous driving...BNN assume…

Machine Learning · Computer Science 2021-02-04 Claire Theobald , Frédéric Pennerath , Brieuc Conan-Guez , Miguel Couceiro , Amedeo Napoli
‹ Prev 1 2 3 10 Next ›