English
Related papers

Related papers: Cross-Iteration Batch Normalization

200 papers

Batch Normalization (BN) has become an out-of-box technique to improve deep network training. However, its effectiveness is limited for micro-batch training, i.e., each GPU typically has only 1-2 images for training, which is inevitable for…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Siyuan Qiao , Huiyu Wang , Chenxi Liu , Wei Shen , Alan Yuille

Batch normalization (BN) has become a standard technique for training the modern deep networks. However, its effectiveness diminishes when the batch size becomes smaller, since the batch statistics estimation becomes inaccurate. That…

Computer Vision and Pattern Recognition · Computer Science 2020-03-13 Chunjie Luo , Jianfeng Zhan , Lei Wang , Wanling Gao

Batch Normalization (BN) is a milestone technique in the development of deep learning, enabling various networks to train. However, normalizing along the batch dimension introduces problems --- BN's error increases rapidly when the batch…

Computer Vision and Pattern Recognition · Computer Science 2018-06-13 Yuxin Wu , Kaiming He

Over the past few years, Batch-Normalization has been commonly used in deep networks, allowing faster training and high performance for a wide variety of applications. However, the reasons behind its merits remained unanswered, with several…

Machine Learning · Statistics 2019-02-08 Elad Hoffer , Ron Banner , Itay Golan , Daniel Soudry

Normalization techniques have been widely used in the field of deep learning due to their capability of enabling higher learning rates and are less careful in initialization. However, the effectiveness of popular normalization technologies…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Afifa Khaled , Chao Li , Jia Ning , Kun He

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

Batch Normalization is an important approach to advancing deep learning since it allows multiple networks to train simultaneously. A problem arises when normalizing along the batch dimension because B.N.'s error increases significantly as…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Gousia Habib , Ishfaq Ahmed Malik , Jameel Ahmad , Imtiaz Ahmed , Shaima Qureshi

Existing continual learning methods use Batch Normalization (BN) to facilitate training and improve generalization across tasks. However, the non-i.i.d and non-stationary nature of continual learning data, especially in the online setting,…

Machine Learning · Computer Science 2022-03-31 Quang Pham , Chenghao Liu , Steven Hoi

Deep Neural Networks (DNNs) thrive in recent years in which Batch Normalization (BN) plays an indispensable role. However, it has been observed that BN is costly due to the reduction operations. In this paper, we propose alleviating this…

Machine Learning · Computer Science 2018-11-05 Zhaodong Chen , Lei Deng , Guoqi Li , Jiawei Sun , Xing Hu , Xin Ma , Yuan Xie

While the authors of Batch Normalization (BN) identify and address an important problem involved in training deep networks-- \textit{Internal Covariate Shift}-- the current solution has certain drawbacks. For instance, BN depends on batch…

Machine Learning · Statistics 2016-06-21 Devansh Arpit , Yingbo Zhou , Hung Ngo , Venu Govindaraju

Batch normalization (BN) has become a de facto standard for training deep convolutional networks. However, BN accounts for a significant fraction of training run-time and is difficult to accelerate, since it is a memory-bandwidth bounded…

Computer Vision and Pattern Recognition · Computer Science 2017-10-10 Igor Gitman , Boris Ginsburg

This study introduces a new normalization layer termed Batch Layer Normalization (BLN) to reduce the problem of internal covariate shift in deep neural network layers. As a combined version of batch and layer normalization, BLN adaptively…

Machine Learning · Computer Science 2023-01-16 Amir Ziaee , Erion Çano

In real-world scenarios, the number of training samples across classes usually subjects to a long-tailed distribution. The conventionally trained network may achieve unexpected inferior performance on the rare class compared to the frequent…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Yuxiang Bao , Guoliang Kang , Linlin Yang , Xiaoyue Duan , Bo Zhao , Baochang Zhang

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well…

Machine Learning · Computer Science 2022-01-21 Susanna Lange , Kyle Helfrich , Qiang Ye

While the authors of Batch Normalization (BN) identify and address an important problem involved in training deep networks-- Internal Covariate Shift-- the current solution has certain drawbacks. Specifically, BN depends on batch statistics…

Machine Learning · Statistics 2016-07-13 Devansh Arpit , Yingbo Zhou , Bhargava U. Kota , Venu Govindaraju

This paper proposes a novel batch normalization strategy for test-time adaptation. Recent test-time adaptation methods heavily rely on the modified batch normalization, i.e., transductive batch normalization (TBN), which calculates the mean…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Hyesu Lim , Byeonggeun Kim , Jaegul Choo , Sungha Choi

Recurrent Neural Networks (RNNs) are powerful models for sequential data that have the potential to learn long-term dependencies. However, they are computationally expensive to train and difficult to parallelize. Recent work has shown that…

Machine Learning · Statistics 2015-10-07 César Laurent , Gabriel Pereyra , Philémon Brakel , Ying Zhang , Yoshua Bengio

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the…

Machine Learning · Statistics 2016-07-22 Jimmy Lei Ba , Jamie Ryan Kiros , Geoffrey E. Hinton

Normalization is a pre-processing step that converts the data into a more usable representation. As part of the deep neural networks (DNNs), the batch normalization (BN) technique uses normalization to address the problem of internal…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Bilal Faye , Mohamed-Djallel Dilmi , Hanane Azzag , Mustapha Lebbah , Djamel Bouchaffra

Deep Convolutional Neural Networks (DCNNs) are hard and time-consuming to train. Normalization is one of the effective solutions. Among previous normalization methods, Batch Normalization (BN) performs well at medium and large batch sizes…

Machine Learning · Computer Science 2020-12-10 Xiao-Yun Zhou , Jiacheng Sun , Nanyang Ye , Xu Lan , Qijun Luo , Bo-Lin Lai , Pedro Esperanca , Guang-Zhong Yang , Zhenguo Li
‹ Prev 1 2 3 10 Next ›