Related papers: Uncertainty Estimation via Stochastic Batch Normal…

Bayesian Uncertainty Estimation for Batch Normalized Deep Networks

We show that training a deep network using batch normalization is equivalent to approximate inference in Bayesian models. We further demonstrate that this finding allows us to make meaningful estimates of the model uncertainty using…

Machine Learning · Statistics 2018-07-17 Mattias Teye , Hossein Azizpour , Kevin Smith

Stochastic Normalizations as Bayesian Learning

In this work we investigate the reasons why Batch Normalization (BN) improves the generalization performance of deep networks. We argue that one major reason, distinguishing it from data-independent normalization methods, is randomness of…

Machine Learning · Computer Science 2018-11-05 Alexander Shekhovtsov , Boris Flach

Four Things Everyone Should Know to Improve Batch Normalization

A key component of most neural network architectures is the use of normalization layers, such as Batch Normalization. Despite its common use and large utility in optimizing deep architectures, it has been challenging both to generically…

Machine Learning · Computer Science 2020-02-17 Cecilia Summers , Michael J. Dinneen

A Framework for Statistical Inference via Randomized Algorithms

Randomized algorithms, such as randomized sketching or stochastic optimization, are a promising approach to ease the computational burden in analyzing large datasets. However, randomized algorithms also produce non-deterministic outputs,…

Methodology · Statistics 2025-05-13 Zhixiang Zhang , Sokbae Lee , Edgar Dobriban

Improving Output Uncertainty Estimation and Generalization in Deep Learning via Neural Network Gaussian Processes

We propose a simple method that combines neural networks and Gaussian processes. The proposed method can estimate the uncertainty of outputs and flexibly adjust target functions where training data exist, which are advantages of Gaussian…

Machine Learning · Statistics 2017-07-20 Tomoharu Iwata , Zoubin Ghahramani

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

How Does Batch Normalization Help Optimization?

Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly…

Machine Learning · Statistics 2019-04-16 Shibani Santurkar , Dimitris Tsipras , Andrew Ilyas , Aleksander Madry

A Learning-Based Optimal Uncertainty Quantification Method and Its Application to Ballistic Impact Problems

This paper concerns the study of optimal (supremum and infimum) uncertainty bounds for systems where the input (or prior) probability measure is only partially/imperfectly known (e.g., with only statistical moments and/or on a coarse…

Machine Learning · Computer Science 2023-01-02 Xingsheng Sun , Burigede Liu

Theoretical Analysis of Auto Rate-Tuning by Batch Normalization

Batch Normalization (BN) has become a cornerstone of deep learning across diverse architectures, appearing to help optimization as well as generalization. While the idea makes intuitive sense, theoretical analysis of its effectiveness has…

Machine Learning · Computer Science 2018-12-11 Sanjeev Arora , Zhiyuan Li , Kaifeng Lyu

Adaptive Batch Normalization for Training Data with Heterogeneous Features

Batch Normalization (BN) is an important preprocessing step to many deep learning applications. Since it is a data-dependent process, for some homogeneous datasets it is a redundant or even a performance-degrading process. In this paper, we…

Machine Learning · Computer Science 2022-12-01 Wael Alsobhi , Tarik Alafif , Alaa Abdel-Hakim , Weiwei Zong

Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift

Covariate shift has been shown to sharply degrade both predictive accuracy and the calibration of uncertainty estimates for deep learning models. This is worrying, because covariate shift is prevalent in a wide range of real world…

Machine Learning · Computer Science 2021-01-18 Zachary Nado , Shreyas Padhy , D. Sculley , Alexander D'Amour , Balaji Lakshminarayanan , Jasper Snoek

Batch Normalization and the impact of batch structure on the behavior of deep convolution networks

Batch normalization was introduced in 2015 to speed up training of deep convolution networks by normalizing the activations across the current batch to have zero mean and unity variance. The results presented here show an interesting aspect…

Computer Vision and Pattern Recognition · Computer Science 2018-02-22 Mohamed Hajaj , Duncan Gillies

Unsupervised Model Drift Estimation with Batch Normalization Statistics for Dataset Shift Detection and Model Selection

While many real-world data streams imply that they change frequently in a nonstationary way, most of deep learning methods optimize neural networks on training data, and this leads to severe performance degradation when dataset shift…

Machine Learning · Computer Science 2021-07-02 Wonju Lee , Seok-Yong Byun , Jooeun Kim , Minje Park , Kirill Chechil

Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers

We consider the problem of uncertainty estimation in the context of (non-Bayesian) deep neural classification. In this context, all known methods are based on extracting uncertainty signals from a trained network optimized to solve the…

Machine Learning · Computer Science 2019-04-25 Yonatan Geifman , Guy Uziel , Ran El-Yaniv

Uncertainty Quantification in Deep Residual Neural Networks

Uncertainty quantification is an important and challenging problem in deep learning. Previous methods rely on dropout layers which are not present in modern deep architectures or batch normalization which is sensitive to batch sizes. In…

Computer Vision and Pattern Recognition · Computer Science 2020-07-10 Lukasz Wandzik , Raul Vicente Garcia , Jörg Krüger

Batch Normalization is a Cause of Adversarial Vulnerability

Batch normalization (batch norm) is often used in an attempt to stabilize and accelerate training in deep neural networks. In many cases it indeed decreases the number of parameter updates required to achieve low training error. However, it…

Machine Learning · Computer Science 2019-05-31 Angus Galloway , Anna Golubeva , Thomas Tanay , Medhat Moussa , Graham W. Taylor

A General Framework for Uncertainty Estimation in Deep Learning

Neural networks predictions are unreliable when the input sample is out of the training distribution or corrupted by noise. Being able to detect such failures automatically is fundamental to integrate deep learning algorithms into robotics.…

Computer Vision and Pattern Recognition · Computer Science 2020-02-18 Antonio Loquercio , Mattia Segù , Davide Scaramuzza

Delving into the Estimation Shift of Batch Normalization in a Network

Batch normalization (BN) is a milestone technique in deep learning. It normalizes the activation using mini-batch statistics during training but the estimated population statistics during inference. This paper focuses on investigating the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Lei Huang , Yi Zhou , Tian Wang , Jie Luo , Xianglong Liu

Stochastic batch size for adaptive regularization in deep network optimization

We propose a first-order stochastic optimization algorithm incorporating adaptive regularization applicable to machine learning problems in deep learning framework. The adaptive regularization is imposed by stochastic process in determining…

Machine Learning · Computer Science 2020-04-15 Kensuke Nakamura , Stefano Soatto , Byung-Woo Hong

Understanding Batch Normalization

Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks. Its tendency to improve accuracy and speed up training have established BN as a favorite technique in deep learning. Yet,…

Machine Learning · Computer Science 2018-12-03 Johan Bjorck , Carla Gomes , Bart Selman , Kilian Q. Weinberger