English
Related papers

Related papers: Uncertainty Estimation via Stochastic Batch Normal…

200 papers

We show that training a deep network using batch normalization is equivalent to approximate inference in Bayesian models. We further demonstrate that this finding allows us to make meaningful estimates of the model uncertainty using…

Machine Learning · Statistics 2018-07-17 Mattias Teye , Hossein Azizpour , Kevin Smith

In this work we investigate the reasons why Batch Normalization (BN) improves the generalization performance of deep networks. We argue that one major reason, distinguishing it from data-independent normalization methods, is randomness of…

Machine Learning · Computer Science 2018-11-05 Alexander Shekhovtsov , Boris Flach

A key component of most neural network architectures is the use of normalization layers, such as Batch Normalization. Despite its common use and large utility in optimizing deep architectures, it has been challenging both to generically…

Machine Learning · Computer Science 2020-02-17 Cecilia Summers , Michael J. Dinneen

Randomized algorithms, such as randomized sketching or stochastic optimization, are a promising approach to ease the computational burden in analyzing large datasets. However, randomized algorithms also produce non-deterministic outputs,…

Methodology · Statistics 2025-05-13 Zhixiang Zhang , Sokbae Lee , Edgar Dobriban

We propose a simple method that combines neural networks and Gaussian processes. The proposed method can estimate the uncertainty of outputs and flexibly adjust target functions where training data exist, which are advantages of Gaussian…

Machine Learning · Statistics 2017-07-20 Tomoharu Iwata , Zoubin Ghahramani

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly…

Machine Learning · Statistics 2019-04-16 Shibani Santurkar , Dimitris Tsipras , Andrew Ilyas , Aleksander Madry

This paper concerns the study of optimal (supremum and infimum) uncertainty bounds for systems where the input (or prior) probability measure is only partially/imperfectly known (e.g., with only statistical moments and/or on a coarse…

Machine Learning · Computer Science 2023-01-02 Xingsheng Sun , Burigede Liu

Batch Normalization (BN) has become a cornerstone of deep learning across diverse architectures, appearing to help optimization as well as generalization. While the idea makes intuitive sense, theoretical analysis of its effectiveness has…

Machine Learning · Computer Science 2018-12-11 Sanjeev Arora , Zhiyuan Li , Kaifeng Lyu

Batch Normalization (BN) is an important preprocessing step to many deep learning applications. Since it is a data-dependent process, for some homogeneous datasets it is a redundant or even a performance-degrading process. In this paper, we…

Machine Learning · Computer Science 2022-12-01 Wael Alsobhi , Tarik Alafif , Alaa Abdel-Hakim , Weiwei Zong

Covariate shift has been shown to sharply degrade both predictive accuracy and the calibration of uncertainty estimates for deep learning models. This is worrying, because covariate shift is prevalent in a wide range of real world…

Machine Learning · Computer Science 2021-01-18 Zachary Nado , Shreyas Padhy , D. Sculley , Alexander D'Amour , Balaji Lakshminarayanan , Jasper Snoek

Batch normalization was introduced in 2015 to speed up training of deep convolution networks by normalizing the activations across the current batch to have zero mean and unity variance. The results presented here show an interesting aspect…

Computer Vision and Pattern Recognition · Computer Science 2018-02-22 Mohamed Hajaj , Duncan Gillies

While many real-world data streams imply that they change frequently in a nonstationary way, most of deep learning methods optimize neural networks on training data, and this leads to severe performance degradation when dataset shift…

Machine Learning · Computer Science 2021-07-02 Wonju Lee , Seok-Yong Byun , Jooeun Kim , Minje Park , Kirill Chechil

We consider the problem of uncertainty estimation in the context of (non-Bayesian) deep neural classification. In this context, all known methods are based on extracting uncertainty signals from a trained network optimized to solve the…

Machine Learning · Computer Science 2019-04-25 Yonatan Geifman , Guy Uziel , Ran El-Yaniv

Uncertainty quantification is an important and challenging problem in deep learning. Previous methods rely on dropout layers which are not present in modern deep architectures or batch normalization which is sensitive to batch sizes. In…

Computer Vision and Pattern Recognition · Computer Science 2020-07-10 Lukasz Wandzik , Raul Vicente Garcia , Jörg Krüger

Batch normalization (batch norm) is often used in an attempt to stabilize and accelerate training in deep neural networks. In many cases it indeed decreases the number of parameter updates required to achieve low training error. However, it…

Machine Learning · Computer Science 2019-05-31 Angus Galloway , Anna Golubeva , Thomas Tanay , Medhat Moussa , Graham W. Taylor

Neural networks predictions are unreliable when the input sample is out of the training distribution or corrupted by noise. Being able to detect such failures automatically is fundamental to integrate deep learning algorithms into robotics.…

Computer Vision and Pattern Recognition · Computer Science 2020-02-18 Antonio Loquercio , Mattia Segù , Davide Scaramuzza

Batch normalization (BN) is a milestone technique in deep learning. It normalizes the activation using mini-batch statistics during training but the estimated population statistics during inference. This paper focuses on investigating the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Lei Huang , Yi Zhou , Tian Wang , Jie Luo , Xianglong Liu

We propose a first-order stochastic optimization algorithm incorporating adaptive regularization applicable to machine learning problems in deep learning framework. The adaptive regularization is imposed by stochastic process in determining…

Machine Learning · Computer Science 2020-04-15 Kensuke Nakamura , Stefano Soatto , Byung-Woo Hong

Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks. Its tendency to improve accuracy and speed up training have established BN as a favorite technique in deep learning. Yet,…

Machine Learning · Computer Science 2018-12-03 Johan Bjorck , Carla Gomes , Bart Selman , Kilian Q. Weinberger
‹ Prev 1 2 3 10 Next ›