English
Related papers

Related papers: Understanding Batch Normalization

200 papers

Batch Normalization (BN) improves both convergence and generalization in training neural networks. This work understands these phenomena theoretically. We analyze BN by using a basic block of neural networks, consisting of a kernel layer, a…

Machine Learning · Computer Science 2019-04-25 Ping Luo , Xinjiang Wang , Wenqi Shao , Zhanglin Peng

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well…

Machine Learning · Computer Science 2022-01-21 Susanna Lange , Kyle Helfrich , Qiang Ye

A critically important, ubiquitous, and yet poorly understood ingredient in modern deep networks (DNs) is batch normalization (BN), which centers and normalizes the feature maps. To date, only limited progress has been made understanding…

Machine Learning · Computer Science 2022-09-30 Randall Balestriero , Richard G. Baraniuk

Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly…

Machine Learning · Statistics 2019-04-16 Shibani Santurkar , Dimitris Tsipras , Andrew Ilyas , Aleksander Madry

Batch normalization is widely used in deep learning to normalize intermediate activations. Deep networks suffer from notoriously increased training complexity, mandating careful initialization of weights, requiring lower learning rates,…

Machine Learning · Statistics 2022-10-19 Lakshmi Annamalai , Chetan Singh Thakur

Batch Normalization (BN) has become a cornerstone of deep learning across diverse architectures, appearing to help optimization as well as generalization. While the idea makes intuitive sense, theoretical analysis of its effectiveness has…

Machine Learning · Computer Science 2018-12-11 Sanjeev Arora , Zhiyuan Li , Kaifeng Lyu

Batch normalization (BN) is a fundamental unit in modern deep networks, in which a linear transformation module was designed for improving BN's flexibility of fitting complex data distributions. In this paper, we demonstrate properly…

Computer Vision and Pattern Recognition · Computer Science 2020-12-01 Yuhui Xu , Lingxi Xie , Cihang Xie , Jieru Mei , Siyuan Qiao , Wei Shen , Hongkai Xiong , Alan Yuille

Batch Normalization (BN) has been used extensively in deep learning to achieve faster training process and better resulting models. However, whether BN works strongly depends on how the batches are constructed during training and it may not…

Optimization and Control · Mathematics 2018-10-16 Xiangru Lian , Ji Liu

In this work we investigate the reasons why Batch Normalization (BN) improves the generalization performance of deep networks. We argue that one major reason, distinguishing it from data-independent normalization methods, is randomness of…

Machine Learning · Computer Science 2018-11-05 Alexander Shekhovtsov , Boris Flach

Training neural networks is an optimization problem, and finding a decent set of parameters through gradient descent can be a difficult task. A host of techniques has been developed to aid this process before and during the training phase.…

Machine Learning · Computer Science 2020-08-19 Divya Gaur , Joachim Folz , Andreas Dengel

Batch Normalization (BN) is a commonly used technique to accelerate and stabilize training of deep neural networks. Despite its empirical success, a full theoretical understanding of BN is yet to be developed. In this work, we analyze BN…

Machine Learning · Computer Science 2022-03-22 Tolga Ergen , Arda Sahiner , Batu Ozturkler , John Pauly , Morteza Mardani , Mert Pilanci

In this paper, we propose a generalization of the Batch Normalization (BN) algorithm, diminishing batch normalization (DBN), where we update the BN parameters in a diminishing moving average way. BN is very effective in accelerating the…

Machine Learning · Computer Science 2019-02-20 Yintai Ma , Diego Klabjan

Batch Normalization (BN) is widely used in {centralized} deep learning to improve convergence and generalization. However, in {federated} learning (FL) with decentralized data, prior work has observed that training with BN could hinder…

Machine Learning · Computer Science 2024-04-01 Jike Zhong , Hong-You Chen , Wei-Lun Chao

Batch Normalization (BN) is a milestone technique in the development of deep learning, enabling various networks to train. However, normalizing along the batch dimension introduces problems --- BN's error increases rapidly when the batch…

Computer Vision and Pattern Recognition · Computer Science 2018-06-13 Yuxin Wu , Kaiming He

Normalization is a pre-processing step that converts the data into a more usable representation. As part of the deep neural networks (DNNs), the batch normalization (BN) technique uses normalization to address the problem of internal…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Bilal Faye , Mohamed-Djallel Dilmi , Hanane Azzag , Mustapha Lebbah , Djamel Bouchaffra

Batch normalization (BN) is comprised of a normalization component followed by an affine transformation and has become essential for training deep neural networks. Standard initialization of each BN in a network sets the affine…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Jim Davis , Logan Frank

Utilizing recently introduced concepts from statistics and quantitative risk management, we present a general variant of Batch Normalization (BN) that offers accelerated convergence of Neural Network training compared to conventional BN. In…

Machine Learning · Computer Science 2018-12-11 Xiaoyong Yuan , Zheng Feng , Matthew Norton , Xiaolin Li

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the…

Machine Learning · Statistics 2016-07-22 Jimmy Lei Ba , Jamie Ryan Kiros , Geoffrey E. Hinton

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

Batch Normalization (BN) is one of the most widely used techniques in Deep Learning field. But its performance can awfully degrade with insufficient batch size. This weakness limits the usage of BN on many computer vision tasks like…

Computer Vision and Pattern Recognition · Computer Science 2020-06-17 Junjie Yan , Ruosi Wan , Xiangyu Zhang , Wei Zhang , Yichen Wei , Jian Sun
‹ Prev 1 2 3 10 Next ›