Related papers: Stochastic Normalizations as Bayesian Learning

Understanding Batch Normalization

Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks. Its tendency to improve accuracy and speed up training have established BN as a favorite technique in deep learning. Yet,…

Machine Learning · Computer Science 2018-12-03 Johan Bjorck , Carla Gomes , Bart Selman , Kilian Q. Weinberger

Bayesian Uncertainty Estimation for Batch Normalized Deep Networks

We show that training a deep network using batch normalization is equivalent to approximate inference in Bayesian models. We further demonstrate that this finding allows us to make meaningful estimates of the model uncertainty using…

Machine Learning · Statistics 2018-07-17 Mattias Teye , Hossein Azizpour , Kevin Smith

Theoretical Insight into Batch Normalization: Data Dependant Auto-Tuning of Regularization Rate

Batch normalization is widely used in deep learning to normalize intermediate activations. Deep networks suffer from notoriously increased training complexity, mandating careful initialization of weights, requiring lower learning rates,…

Machine Learning · Statistics 2022-10-19 Lakshmi Annamalai , Chetan Singh Thakur

Towards Understanding Regularization in Batch Normalization

Batch Normalization (BN) improves both convergence and generalization in training neural networks. This work understands these phenomena theoretically. We analyze BN by using a basic block of neural networks, consisting of a kernel layer, a…

Machine Learning · Computer Science 2019-04-25 Ping Luo , Xinjiang Wang , Wenqi Shao , Zhanglin Peng

Batch Normalization Preconditioning for Neural Network Training

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well…

Machine Learning · Computer Science 2022-01-21 Susanna Lange , Kyle Helfrich , Qiang Ye

An Investigation into the Stochasticity of Batch Whitening

Batch Normalization (BN) is extensively employed in various network architectures by performing standardization within mini-batches. A full understanding of the process has been a central target in the deep learning communities. Unlike…

Computer Vision and Pattern Recognition · Computer Science 2020-03-30 Lei Huang , Lei Zhao , Yi Zhou , Fan Zhu , Li Liu , Ling Shao

How Does Batch Normalization Help Optimization?

Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly…

Machine Learning · Statistics 2019-04-16 Shibani Santurkar , Dimitris Tsipras , Andrew Ilyas , Aleksander Madry

Mode Normalization

Normalization methods are a central building block in the deep learning toolbox. They accelerate and stabilize training, while decreasing the dependence on manually tuned learning rate schedules. When learning from multi-modal…

Machine Learning · Computer Science 2018-10-15 Lucas Deecke , Iain Murray , Hakan Bilen

Delving into the Estimation Shift of Batch Normalization in a Network

Batch normalization (BN) is a milestone technique in deep learning. It normalizes the activation using mini-batch statistics during training but the estimated population statistics during inference. This paper focuses on investigating the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Lei Huang , Yi Zhou , Tian Wang , Jie Luo , Xianglong Liu

Batch Normalization Biases Residual Blocks Towards the Identity Function in Deep Networks

Batch normalization dramatically increases the largest trainable depth of residual networks, and this benefit has been crucial to the empirical success of deep residual networks on a wide range of benchmarks. We show that this key benefit…

Machine Learning · Computer Science 2020-12-10 Soham De , Samuel L. Smith

Batch Normalization Explained

A critically important, ubiquitous, and yet poorly understood ingredient in modern deep networks (DNs) is batch normalization (BN), which centers and normalizes the feature maps. To date, only limited progress has been made understanding…

Machine Learning · Computer Science 2022-09-30 Randall Balestriero , Richard G. Baraniuk

Supervised Batch Normalization

Batch Normalization (BN), a widely-used technique in neural networks, enhances generalization and expedites training by normalizing each mini-batch to the same mean and variance. However, its effectiveness diminishes when confronted with…

Machine Learning · Computer Science 2024-05-28 Bilal Faye , Mustapha Lebbah , Hanane Azzag

Adaptive Batch Normalization for Training Data with Heterogeneous Features

Batch Normalization (BN) is an important preprocessing step to many deep learning applications. Since it is a data-dependent process, for some homogeneous datasets it is a redundant or even a performance-degrading process. In this paper, we…

Machine Learning · Computer Science 2022-12-01 Wael Alsobhi , Tarik Alafif , Alaa Abdel-Hakim , Weiwei Zong

Why Regularized Auto-Encoders learn Sparse Representation?

While the authors of Batch Normalization (BN) identify and address an important problem involved in training deep networks-- \textit{Internal Covariate Shift}-- the current solution has certain drawbacks. For instance, BN depends on batch…

Machine Learning · Statistics 2016-06-21 Devansh Arpit , Yingbo Zhou , Hung Ngo , Venu Govindaraju

Revisit Batch Normalization: New Understanding from an Optimization View and a Refinement via Composition Optimization

Batch Normalization (BN) has been used extensively in deep learning to achieve faster training process and better resulting models. However, whether BN works strongly depends on how the batches are constructed during training and it may not…

Optimization and Control · Mathematics 2018-10-16 Xiangru Lian , Ji Liu

Theoretical Analysis of Auto Rate-Tuning by Batch Normalization

Batch Normalization (BN) has become a cornerstone of deep learning across diverse architectures, appearing to help optimization as well as generalization. While the idea makes intuitive sense, theoretical analysis of its effectiveness has…

Machine Learning · Computer Science 2018-12-11 Sanjeev Arora , Zhiyuan Li , Kaifeng Lyu

Generalized Batch Normalization: Towards Accelerating Deep Neural Networks

Utilizing recently introduced concepts from statistics and quantitative risk management, we present a general variant of Batch Normalization (BN) that offers accelerated convergence of Neural Network training compared to conventional BN. In…

Machine Learning · Computer Science 2018-12-11 Xiaoyong Yuan , Zheng Feng , Matthew Norton , Xiaolin Li

Batch Group Normalization

Deep Convolutional Neural Networks (DCNNs) are hard and time-consuming to train. Normalization is one of the effective solutions. Among previous normalization methods, Batch Normalization (BN) performs well at medium and large batch sizes…

Machine Learning · Computer Science 2020-12-10 Xiao-Yun Zhou , Jiacheng Sun , Nanyang Ye , Xu Lan , Qijun Luo , Bo-Lin Lai , Pedro Esperanca , Guang-Zhong Yang , Zhenguo Li

Making Batch Normalization Great in Federated Deep Learning

Batch Normalization (BN) is widely used in {centralized} deep learning to improve convergence and generalization. However, in {federated} learning (FL) with decentralized data, prior work has observed that training with BN could hinder…

Machine Learning · Computer Science 2024-04-01 Jike Zhong , Hong-You Chen , Wei-Lun Chao

Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization

Normalization techniques such as Batch Normalization have been applied successfully for training deep neural networks. Yet, despite its apparent empirical benefits, the reasons behind the success of Batch Normalization are mostly…

Machine Learning · Statistics 2018-10-09 Jonas Kohler , Hadi Daneshmand , Aurelien Lucchi , Ming Zhou , Klaus Neymeyr , Thomas Hofmann