Related papers: Differentiable Learning-to-Normalize via Switchabl…

Switchable Normalization for Learning-to-Normalize Deep Representation

We address a learning-to-normalize problem by proposing Switchable Normalization (SN), which learns to select different normalizers for different normalization layers of a deep neural network. SN employs three distinct scopes to compute…

Computer Vision and Pattern Recognition · Computer Science 2019-07-25 Ping Luo , Ruimao Zhang , Jiamin Ren , Zhanglin Peng , Jingyu Li

SSN: Learning Sparse Switchable Normalization via SparsestMax

Normalization methods improve both optimization and generalization of ConvNets. To further boost performance, the recently-proposed switchable normalization (SN) provides a new perspective for deep learning: it learns to select different…

Computer Vision and Pattern Recognition · Computer Science 2019-03-12 Wenqi Shao , Tianjian Meng , Jingyu Li , Ruimao Zhang , Yudian Li , Xiaogang Wang , Ping Luo

Exemplar Normalization for Learning Deep Representation

Normalization techniques are important in different advanced neural networks and different tasks. This work investigates a novel dynamic learning-to-normalize (L2N) problem by proposing Exemplar Normalization (EN), which is able to learn…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Ruimao Zhang , Zhanglin Peng , Lingyun Wu , Zhen Li , Ping Luo

Switchable Whitening for Deep Representation Learning

Normalization methods are essential components in convolutional neural networks (CNNs). They either standardize or whiten data using statistics estimated in predefined sets of pixels. Unlike existing works that design normalization…

Computer Vision and Pattern Recognition · Computer Science 2019-12-13 Xingang Pan , Xiaohang Zhan , Jianping Shi , Xiaoou Tang , Ping Luo

Training Deep Neural Networks Without Batch Normalization

Training neural networks is an optimization problem, and finding a decent set of parameters through gradient descent can be a difficult task. A host of techniques has been developed to aid this process before and during the training phase.…

Machine Learning · Computer Science 2020-08-19 Divya Gaur , Joachim Folz , Andreas Dengel

Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization

Deep convolutional neural networks are known to be unstable during training at high learning rate unless normalization techniques are employed. Normalizing weights or activations allows the use of higher learning rates, resulting in faster…

Machine Learning · Computer Science 2019-12-02 Brendan Ruff , Taylor Beck , Joscha Bach

Batch Layer Normalization, A new normalization layer for CNNs and RNN

This study introduces a new normalization layer termed Batch Layer Normalization (BLN) to reduce the problem of internal covariate shift in deep neural network layers. As a combined version of batch and layer normalization, BLN adaptively…

Machine Learning · Computer Science 2023-01-16 Amir Ziaee , Erion Çano

New Interpretations of Normalization Methods in Deep Learning

In recent years, a variety of normalization methods have been proposed to help train neural networks, such as batch normalization (BN), layer normalization (LN), weight normalization (WN), group normalization (GN), etc. However,…

Machine Learning · Computer Science 2020-06-17 Jiacheng Sun , Xiangyong Cao , Hanwen Liang , Weiran Huang , Zewei Chen , Zhenguo Li

Do Normalization Layers in a Deep ConvNet Really Need to Be Distinct?

Yes, they do. This work investigates a perspective for deep learning: whether different normalization layers in a ConvNet require different normalizers. This is the first step towards understanding this phenomenon. We allow each…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Ping Luo , Zhanglin Peng , Jiamin Ren , Ruimao Zhang

Learning to Learn Parameterized Classification Networks for Scalable Input Images

Convolutional Neural Networks (CNNs) do not have a predictable recognition behavior with respect to the input resolution change. This prevents the feasibility of deployment on different input image resolutions for a specific model. To…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Duo Li , Anbang Yao , Qifeng Chen

Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation

Continual learning entails learning a sequence of tasks and balancing their knowledge appropriately. With limited access to old training samples, much of the current work in deep neural networks has focused on overcoming catastrophic…

Machine Learning · Computer Science 2023-10-16 Yilin Lyu , Liyuan Wang , Xingxing Zhang , Zicheng Sun , Hang Su , Jun Zhu , Liping Jing

Mean Spectral Normalization of Deep Neural Networks for Embedded Automation

Deep Neural Networks (DNNs) have begun to thrive in the field of automation systems, owing to the recent advancements in standardising various aspects such as architecture, optimization techniques, and regularization. In this paper, we take…

Machine Learning · Computer Science 2019-07-10 Anand Krishnamoorthy Subramanian , Nak Young Chong

Layer Normalization

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the…

Machine Learning · Statistics 2016-07-22 Jimmy Lei Ba , Jamie Ryan Kiros , Geoffrey E. Hinton

Deep Adaptive Input Normalization for Time Series Forecasting

Deep Learning (DL) models can be used to tackle time series analysis tasks with great success. However, the performance of DL models can degenerate rapidly if the data are not appropriately normalized. This issue is even more apparent when…

Computational Finance · Quantitative Finance 2019-09-24 Nikolaos Passalis , Anastasios Tefas , Juho Kanniainen , Moncef Gabbouj , Alexandros Iosifidis

Normalizing Batch Normalization for Long-Tailed Recognition

In real-world scenarios, the number of training samples across classes usually subjects to a long-tailed distribution. The conventionally trained network may achieve unexpected inferior performance on the rare class compared to the frequent…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Yuxiang Bao , Guoliang Kang , Linlin Yang , Xiaoyue Duan , Bo Zhao , Baochang Zhang

Batch Normalized Recurrent Neural Networks

Recurrent Neural Networks (RNNs) are powerful models for sequential data that have the potential to learn long-term dependencies. However, they are computationally expensive to train and difficult to parallelize. Recent work has shown that…

Machine Learning · Statistics 2015-10-07 César Laurent , Gabriel Pereyra , Philémon Brakel , Ying Zhang , Yoshua Bengio

Supervised Batch Normalization

Batch Normalization (BN), a widely-used technique in neural networks, enhances generalization and expedites training by normalizing each mini-batch to the same mean and variance. However, its effectiveness diminishes when confronted with…

Machine Learning · Computer Science 2024-05-28 Bilal Faye , Mustapha Lebbah , Hanane Azzag

STN: Scalable Tensorizing Networks via Structure-Aware Training and Adaptive Compression

Deep neural networks (DNNs) have delivered a remarkable performance in many tasks of computer vision. However, over-parameterized representations of popular architectures dramatically increase their computational complexity and storage…

Computer Vision and Pattern Recognition · Computer Science 2022-05-31 Chang Nie , Huan Wang , Lu Zhao

BCN: Batch Channel Normalization for Image Classification

Normalization techniques have been widely used in the field of deep learning due to their capability of enabling higher learning rates and are less careful in initialization. However, the effectiveness of popular normalization technologies…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Afifa Khaled , Chao Li , Jia Ning , Kun He

Differentiable Sorting Networks for Scalable Sorting and Ranking Supervision

Sorting and ranking supervision is a method for training neural networks end-to-end based on ordering constraints. That is, the ground truth order of sets of samples is known, while their absolute values remain unsupervised. For that, we…

Machine Learning · Computer Science 2021-07-15 Felix Petersen , Christian Borgelt , Hilde Kuehne , Oliver Deussen