Related papers: WeightAlign: Normalizing Activations by Weight Ali…

Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization

Deep convolutional neural networks are known to be unstable during training at high learning rate unless normalization techniques are employed. Normalizing weights or activations allows the use of higher learning rates, resulting in faster…

Machine Learning · Computer Science 2019-12-02 Brendan Ruff , Taylor Beck , Joscha Bach

Improvement of Batch Normalization in Imbalanced Data

In this study, we consider classification problems based on neural networks in data-imbalanced environment. Learning from an imbalanced data set is one of the most important and practical problems in the field of machine learning. A…

Machine Learning · Statistics 2019-12-02 Muneki Yasuda , Seishirou Ueno

Theoretical Insight into Batch Normalization: Data Dependant Auto-Tuning of Regularization Rate

Batch normalization is widely used in deep learning to normalize intermediate activations. Deep networks suffer from notoriously increased training complexity, mandating careful initialization of weights, requiring lower learning rates,…

Machine Learning · Statistics 2022-10-19 Lakshmi Annamalai , Chetan Singh Thakur

Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification

Batch normalization (BN) has become a de facto standard for training deep convolutional networks. However, BN accounts for a significant fraction of training run-time and is difficult to accelerate, since it is a memory-bandwidth bounded…

Computer Vision and Pattern Recognition · Computer Science 2017-10-10 Igor Gitman , Boris Ginsburg

Consistent Batch Normalization for Weighted Loss in Imbalanced-Data Environment

In this study, classification problems based on feedforward neural networks in a data-imbalanced environment are considered. Learning from an imbalanced dataset is one of the most important practical problems in the field of machine…

Machine Learning · Statistics 2020-12-23 Muneki Yasuda , Yeo Xian En , Seishirou Ueno

Batch Layer Normalization, A new normalization layer for CNNs and RNN

This study introduces a new normalization layer termed Batch Layer Normalization (BLN) to reduce the problem of internal covariate shift in deep neural network layers. As a combined version of batch and layer normalization, BLN adaptively…

Machine Learning · Computer Science 2023-01-16 Amir Ziaee , Erion Çano

Batch Normalization Sampling

Deep Neural Networks (DNNs) thrive in recent years in which Batch Normalization (BN) plays an indispensable role. However, it has been observed that BN is costly due to the reduction operations. In this paper, we propose alleviating this…

Machine Learning · Computer Science 2018-11-05 Zhaodong Chen , Lei Deng , Guoqi Li , Jiawei Sun , Xing Hu , Xin Ma , Yuan Xie

"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization

Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art binary neural networks (BNN). However, the BN layer is costly to calculate and is typically implemented with non-binary parameters, leaving a hurdle…

Machine Learning · Computer Science 2021-04-19 Tianlong Chen , Zhenyu Zhang , Xu Ouyang , Zechun Liu , Zhiqiang Shen , Zhangyang Wang

Controlling Covariate Shift using Balanced Normalization of Weights

We introduce a new normalization technique that exhibits the fast convergence properties of batch normalization using a transformation of layer weights instead of layer outputs. The proposed technique keeps the contribution of positive and…

Machine Learning · Computer Science 2019-05-13 Aaron Defazio , Léon Bottou

Norm matters: efficient and accurate normalization schemes in deep networks

Over the past few years, Batch-Normalization has been commonly used in deep networks, allowing faster training and high performance for a wide variety of applications. However, the reasons behind its merits remained unanswered, with several…

Machine Learning · Statistics 2019-02-08 Elad Hoffer , Ron Banner , Itay Golan , Daniel Soudry

On the Effects of Batch and Weight Normalization in Generative Adversarial Networks

Generative adversarial networks (GANs) are highly effective unsupervised learning frameworks that can generate very sharp data, even for data such as images with complex, highly multimodal distributions. However GANs are known to be very…

Machine Learning · Statistics 2017-12-05 Sitao Xiang , Hao Li

MimicNorm: Weight Mean and Last BN Layer Mimic the Dynamic of Batch Normalization

Substantial experiments have validated the success of Batch Normalization (BN) Layer in benefiting convergence and generalization. However, BN requires extra memory and float-point calculation. Moreover, BN would be inaccurate on…

Machine Learning · Computer Science 2024-10-30 Wen Fei , Wenrui Dai , Chenglin Li , Junni Zou , Hongkai Xiong

Cross-Iteration Batch Normalization

A well-known issue of Batch Normalization is its significantly reduced effectiveness in the case of small mini-batch sizes. When a mini-batch contains few examples, the statistics upon which the normalization is defined cannot be reliably…

Machine Learning · Computer Science 2021-03-26 Zhuliang Yao , Yue Cao , Shuxin Zheng , Gao Huang , Stephen Lin

Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks

Batch Normalization (BN) uses mini-batch statistics to normalize the activations during training, introducing dependence between mini-batch elements. This dependency can hurt the performance if the mini-batch size is too small, or if the…

Machine Learning · Computer Science 2020-04-02 Saurabh Singh , Shankar Krishnan

Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks

While the authors of Batch Normalization (BN) identify and address an important problem involved in training deep networks-- Internal Covariate Shift-- the current solution has certain drawbacks. Specifically, BN depends on batch statistics…

Machine Learning · Statistics 2016-07-13 Devansh Arpit , Yingbo Zhou , Bhargava U. Kota , Venu Govindaraju

Riemannian approach to batch normalization

Batch Normalization (BN) has proven to be an effective algorithm for deep neural network training by normalizing the input to each neuron and reducing the internal covariate shift. The space of weight vectors in the BN layer can be…

Machine Learning · Computer Science 2017-11-01 Minhyung Cho , Jaehyung Lee

Micro-Batch Training with Batch-Channel Normalization and Weight Standardization

Batch Normalization (BN) has become an out-of-box technique to improve deep network training. However, its effectiveness is limited for micro-batch training, i.e., each GPU typically has only 1-2 images for training, which is inevitable for…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Siyuan Qiao , Huiyu Wang , Chenxi Liu , Wei Shen , Alan Yuille

Diminishing Batch Normalization

In this paper, we propose a generalization of the Batch Normalization (BN) algorithm, diminishing batch normalization (DBN), where we update the BN parameters in a diminishing moving average way. BN is very effective in accelerating the…

Machine Learning · Computer Science 2019-02-20 Yintai Ma , Diego Klabjan

Rebalancing Batch Normalization for Exemplar-based Class-Incremental Learning

Batch Normalization (BN) and its variants has been extensively studied for neural nets in various computer vision tasks, but relatively little work has been dedicated to studying the effect of BN in continual learning. To that end, we…

Computer Vision and Pattern Recognition · Computer Science 2023-04-19 Sungmin Cha , Sungjun Cho , Dasol Hwang , Sunwon Hong , Moontae Lee , Taesup Moon

Adaptive Batch Normalization for Training Data with Heterogeneous Features

Batch Normalization (BN) is an important preprocessing step to many deep learning applications. Since it is a data-dependent process, for some homogeneous datasets it is a redundant or even a performance-degrading process. In this paper, we…

Machine Learning · Computer Science 2022-12-01 Wael Alsobhi , Tarik Alafif , Alaa Abdel-Hakim , Weiwei Zong