English
Related papers

Related papers: Restructuring Batch Normalization to Accelerate CN…

200 papers

In this paper, we propose a generalization of the Batch Normalization (BN) algorithm, diminishing batch normalization (DBN), where we update the BN parameters in a diminishing moving average way. BN is very effective in accelerating the…

Machine Learning · Computer Science 2019-02-20 Yintai Ma , Diego Klabjan

Batch normalization (BN) is a fundamental unit in modern deep networks, in which a linear transformation module was designed for improving BN's flexibility of fitting complex data distributions. In this paper, we demonstrate properly…

Computer Vision and Pattern Recognition · Computer Science 2020-12-01 Yuhui Xu , Lingxi Xie , Cihang Xie , Jieru Mei , Siyuan Qiao , Wei Shen , Hongkai Xiong , Alan Yuille

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well…

Machine Learning · Computer Science 2022-01-21 Susanna Lange , Kyle Helfrich , Qiang Ye

Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks. Its tendency to improve accuracy and speed up training have established BN as a favorite technique in deep learning. Yet,…

Machine Learning · Computer Science 2018-12-03 Johan Bjorck , Carla Gomes , Bart Selman , Kilian Q. Weinberger

Batch Normalization (BN) has become an essential technique in contemporary neural network design, enhancing training stability. Specifically, BN employs centering and scaling operations to standardize features along the batch dimension and…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Shaobo Wang , Xiangdong Zhang , Dongrui Liu , Junchi Yan

Batch Normalization (BN) improves both convergence and generalization in training neural networks. This work understands these phenomena theoretically. We analyze BN by using a basic block of neural networks, consisting of a kernel layer, a…

Machine Learning · Computer Science 2019-04-25 Ping Luo , Xinjiang Wang , Wenqi Shao , Zhanglin Peng

This study introduces a new normalization layer termed Batch Layer Normalization (BLN) to reduce the problem of internal covariate shift in deep neural network layers. As a combined version of batch and layer normalization, BLN adaptively…

Machine Learning · Computer Science 2023-01-16 Amir Ziaee , Erion Çano

Batch Normalization (BN) is essential to effectively train state-of-the-art deep Convolutional Neural Networks (CNN). It normalizes inputs to the layers during training using the statistics of each mini-batch. In this work, we study BN from…

Machine Learning · Computer Science 2018-11-16 Mahdi M. Kalayeh , Mubarak Shah

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the…

Machine Learning · Statistics 2016-07-22 Jimmy Lei Ba , Jamie Ryan Kiros , Geoffrey E. Hinton

Batch Normalization (BN) is a commonly used technique to accelerate and stabilize training of deep neural networks. Despite its empirical success, a full theoretical understanding of BN is yet to be developed. In this work, we analyze BN…

Machine Learning · Computer Science 2022-03-22 Tolga Ergen , Arda Sahiner , Batu Ozturkler , John Pauly , Morteza Mardani , Mert Pilanci

Deep Convolutional Neural Networks (DCNNs) are hard and time-consuming to train. Normalization is one of the effective solutions. Among previous normalization methods, Batch Normalization (BN) performs well at medium and large batch sizes…

Machine Learning · Computer Science 2020-12-10 Xiao-Yun Zhou , Jiacheng Sun , Nanyang Ye , Xu Lan , Qijun Luo , Bo-Lin Lai , Pedro Esperanca , Guang-Zhong Yang , Zhenguo Li

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

Batch Normalization (BN) has become an out-of-box technique to improve deep network training. However, its effectiveness is limited for micro-batch training, i.e., each GPU typically has only 1-2 images for training, which is inevitable for…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Siyuan Qiao , Huiyu Wang , Chenxi Liu , Wei Shen , Alan Yuille

In this paper, we have studied how the training of the convolutional neural networks (CNNs) can be affected by changing the position of the batch normalization (BN) layer. Three different convolutional neural networks have been chosen for…

Computer Vision and Pattern Recognition · Computer Science 2020-04-24 Moein Hasani , Hassan Khotanlou

Recurrent Neural Networks (RNNs) are powerful models for sequential data that have the potential to learn long-term dependencies. However, they are computationally expensive to train and difficult to parallelize. Recent work has shown that…

Machine Learning · Statistics 2015-10-07 César Laurent , Gabriel Pereyra , Philémon Brakel , Ying Zhang , Yoshua Bengio

A critically important, ubiquitous, and yet poorly understood ingredient in modern deep networks (DNs) is batch normalization (BN), which centers and normalizes the feature maps. To date, only limited progress has been made understanding…

Machine Learning · Computer Science 2022-09-30 Randall Balestriero , Richard G. Baraniuk

Batch Normalization (BN) has been proven to be quite effective at accelerating and improving the training of deep neural networks (DNNs). However, BN brings additional computation, consumes more memory and generally slows down the training…

Machine Learning · Computer Science 2019-05-23 Shuang Wu , Guoqi Li , Lei Deng , Liu Liu , Yuan Xie , Luping Shi

Batch Normalization (BN) is widely used in {centralized} deep learning to improve convergence and generalization. However, in {federated} learning (FL) with decentralized data, prior work has observed that training with BN could hinder…

Machine Learning · Computer Science 2024-04-01 Jike Zhong , Hong-You Chen , Wei-Lun Chao

Batch Normalization has become one of the essential components in CNN. It allows the network to use a higher learning rate and speed up training. And the network doesn't need to be initialized carefully. However, in our work, we find that a…

Computer Vision and Pattern Recognition · Computer Science 2021-01-18 Chuan Liu , Yi Gao , Jiancheng Lv

Normalization techniques have been widely used in the field of deep learning due to their capability of enabling higher learning rates and are less careful in initialization. However, the effectiveness of popular normalization technologies…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Afifa Khaled , Chao Li , Jia Ning , Kun He
‹ Prev 1 2 3 10 Next ›