English
Related papers

Related papers: Counterbalancing Teacher: Regularizing Batch Norma…

200 papers

Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks. Its tendency to improve accuracy and speed up training have established BN as a favorite technique in deep learning. Yet,…

Machine Learning · Computer Science 2018-12-03 Johan Bjorck , Carla Gomes , Bart Selman , Kilian Q. Weinberger

Batch Normalization (BN) improves both convergence and generalization in training neural networks. This work understands these phenomena theoretically. We analyze BN by using a basic block of neural networks, consisting of a kernel layer, a…

Machine Learning · Computer Science 2019-04-25 Ping Luo , Xinjiang Wang , Wenqi Shao , Zhanglin Peng

Existing continual learning methods use Batch Normalization (BN) to facilitate training and improve generalization across tasks. However, the non-i.i.d and non-stationary nature of continual learning data, especially in the online setting,…

Machine Learning · Computer Science 2022-03-31 Quang Pham , Chenghao Liu , Steven Hoi

Batch Normalization (BN) is widely used in {centralized} deep learning to improve convergence and generalization. However, in {federated} learning (FL) with decentralized data, prior work has observed that training with BN could hinder…

Machine Learning · Computer Science 2024-04-01 Jike Zhong , Hong-You Chen , Wei-Lun Chao

Batch Normalization (BN) has become a cornerstone of deep learning across diverse architectures, appearing to help optimization as well as generalization. While the idea makes intuitive sense, theoretical analysis of its effectiveness has…

Machine Learning · Computer Science 2018-12-11 Sanjeev Arora , Zhiyuan Li , Kaifeng Lyu

Batch normalization is widely used in deep learning to normalize intermediate activations. Deep networks suffer from notoriously increased training complexity, mandating careful initialization of weights, requiring lower learning rates,…

Machine Learning · Statistics 2022-10-19 Lakshmi Annamalai , Chetan Singh Thakur

In real-world scenarios, the number of training samples across classes usually subjects to a long-tailed distribution. The conventionally trained network may achieve unexpected inferior performance on the rare class compared to the frequent…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Yuxiang Bao , Guoliang Kang , Linlin Yang , Xiaoyue Duan , Bo Zhao , Baochang Zhang

Extensive researches have applied deep neural networks (DNNs) in class incremental learning (Class-IL). As building blocks of DNNs, batch normalization (BN) standardizes intermediate feature maps and has been widely validated to improve…

Machine Learning · Computer Science 2022-02-17 Minghao Zhou , Quanziang Wang , Jun Shu , Qian Zhao , Deyu Meng

Batch Normalization (BN) is a commonly used technique to accelerate and stabilize training of deep neural networks. Despite its empirical success, a full theoretical understanding of BN is yet to be developed. In this work, we analyze BN…

Machine Learning · Computer Science 2022-03-22 Tolga Ergen , Arda Sahiner , Batu Ozturkler , John Pauly , Morteza Mardani , Mert Pilanci

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well…

Machine Learning · Computer Science 2022-01-21 Susanna Lange , Kyle Helfrich , Qiang Ye

Batch Normalization (BN) has been used extensively in deep learning to achieve faster training process and better resulting models. However, whether BN works strongly depends on how the batches are constructed during training and it may not…

Optimization and Control · Mathematics 2018-10-16 Xiangru Lian , Ji Liu

Batch Normalization (BN) and its variants has been extensively studied for neural nets in various computer vision tasks, but relatively little work has been dedicated to studying the effect of BN in continual learning. To that end, we…

Computer Vision and Pattern Recognition · Computer Science 2023-04-19 Sungmin Cha , Sungjun Cho , Dasol Hwang , Sunwon Hong , Moontae Lee , Taesup Moon

Normalization methods are a central building block in the deep learning toolbox. They accelerate and stabilize training, while decreasing the dependence on manually tuned learning rate schedules. When learning from multi-modal…

Machine Learning · Computer Science 2018-10-15 Lucas Deecke , Iain Murray , Hakan Bilen

Despite the significant success of deep learning in computer vision tasks, cross-domain tasks still present a challenge in which the model's performance will degrade when the training set and the test set follow different distributions.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Lei Qi , Dongjia Zhao , Yinghuan Shi , Xin Geng

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

Batch Normalization (BN) is a core and prevalent technique in accelerating the training of deep neural networks and improving the generalization on Computer Vision (CV) tasks. However, it fails to defend its position in Natural Language…

Computation and Language · Computer Science 2022-10-14 Jiaxi Wang , Ji Wu , Lei Huang

In this paper, we propose a generalization of the Batch Normalization (BN) algorithm, diminishing batch normalization (DBN), where we update the BN parameters in a diminishing moving average way. BN is very effective in accelerating the…

Machine Learning · Computer Science 2019-02-20 Yintai Ma , Diego Klabjan

Batch Normalization (BN) is essential to effectively train state-of-the-art deep Convolutional Neural Networks (CNN). It normalizes inputs to the layers during training using the statistics of each mini-batch. In this work, we study BN from…

Machine Learning · Computer Science 2018-11-16 Mahdi M. Kalayeh , Mubarak Shah

Batch normalization (BN) has been widely used in modern deep neural networks (DNNs) due to improved convergence. BN is observed to increase the model accuracy while at the cost of adversarial robustness. There is an increasing interest in…

Machine Learning · Computer Science 2021-10-08 Philipp Benz , Chaoning Zhang , In So Kweon

Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art binary neural networks (BNN). However, the BN layer is costly to calculate and is typically implemented with non-binary parameters, leaving a hurdle…

Machine Learning · Computer Science 2021-04-19 Tianlong Chen , Zhenyu Zhang , Xu Ouyang , Zechun Liu , Zhiqiang Shen , Zhangyang Wang
‹ Prev 1 2 3 10 Next ›