Related papers: LocalNorm: Robust Image Classification through Dyn…

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

How Does Batch Normalization Help Optimization?

Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly…

Machine Learning · Statistics 2019-04-16 Shibani Santurkar , Dimitris Tsipras , Andrew Ilyas , Aleksander Madry

Impact of Batch Normalization on Convolutional Network Representations

Batch normalization (BatchNorm) is a popular layer normalization technique used when training deep neural networks. It has been shown to enhance the training speed and accuracy of deep learning models. However, the mechanics by which…

Machine Learning · Computer Science 2025-02-14 Hermanus L. Potgieter , Coenraad Mouton , Marelie H. Davel

Robust and Generalizable Visual Representation Learning via Random Convolutions

While successful for various computer vision tasks, deep neural networks have shown to be vulnerable to texture style shifts and small perturbations to which humans are robust. In this work, we show that the robustness of neural networks…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Zhenlin Xu , Deyi Liu , Junlin Yang , Colin Raffel , Marc Niethammer

DropBlock: A regularization method for convolutional networks

Deep neural networks often work well when they are over-parameterized and trained with a massive amount of noise and regularization, such as weight decay and dropout. Although dropout is widely used as a regularization technique for fully…

Computer Vision and Pattern Recognition · Computer Science 2018-10-31 Golnaz Ghiasi , Tsung-Yi Lin , Quoc V. Le

Learning Robust Global Representations by Penalizing Local Predictive Power

Despite their renowned predictive power on i.i.d. data, convolutional neural networks are known to rely more on high-frequency patterns that humans deem superficial than on low-frequency patterns that agree better with intuitions about what…

Computer Vision and Pattern Recognition · Computer Science 2019-11-06 Haohan Wang , Songwei Ge , Eric P. Xing , Zachary C. Lipton

Frequency Dropout: Feature-Level Regularization via Randomized Filtering

Deep convolutional neural networks have shown remarkable performance on various computer vision tasks, and yet, they are susceptible to picking up spurious correlations from the training signal. So called `shortcuts' can occur during…

Computer Vision and Pattern Recognition · Computer Science 2022-09-21 Mobarakol Islam , Ben Glocker

TaskNorm: Rethinking Batch Normalization for Meta-Learning

Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical…

Machine Learning · Statistics 2020-07-14 John Bronskill , Jonathan Gordon , James Requeima , Sebastian Nowozin , Richard E. Turner

Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training

Normalization techniques have become a basic component in modern convolutional neural networks (ConvNets). In particular, many recent works demonstrate that promoting the orthogonality of the weights helps train deep models and improve…

Computer Vision and Pattern Recognition · Computer Science 2022-01-05 Sheng Liu , Xiao Li , Yuexiang Zhai , Chong You , Zhihui Zhu , Carlos Fernandez-Granda , Qing Qu

Learning Robust Representations of Text

Deep neural networks have achieved remarkable results across many language processing tasks, however these methods are highly sensitive to noise and adversarial attacks. We present a regularization based method for limiting network…

Computation and Language · Computer Science 2016-09-21 Yitong Li , Trevor Cohn , Timothy Baldwin

High-Performance Large-Scale Image Recognition Without Normalization

Batch normalization is a key component of most image classification models, but it has many undesirable properties stemming from its dependence on the batch size and interactions between examples. Although recent work has succeeded in…

Computer Vision and Pattern Recognition · Computer Science 2021-02-12 Andrew Brock , Soham De , Samuel L. Smith , Karen Simonyan

Layer Normalization

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the…

Machine Learning · Statistics 2016-07-22 Jimmy Lei Ba , Jamie Ryan Kiros , Geoffrey E. Hinton

Robust and interpretable blind image denoising via bias-free convolutional neural networks

Deep convolutional networks often append additive constant ("bias") terms to their convolution operations, enabling a richer repertoire of functional mappings. Biases are also used to facilitate training, by subtracting mean response over…

Image and Video Processing · Electrical Eng. & Systems 2020-02-11 Sreyas Mohan , Zahra Kadkhodaie , Eero P. Simoncelli , Carlos Fernandez-Granda

Towards an Adversarially Robust Normalization Approach

Batch Normalization (BatchNorm) is effective for improving the performance and accelerating the training of deep neural networks. However, it has also shown to be a cause of adversarial vulnerability, i.e., networks without it are more…

Machine Learning · Computer Science 2020-06-22 Muhammad Awais , Fahad Shamshad , Sung-Ho Bae

Local Unsupervised Learning for Image Analysis

Local Hebbian learning is believed to be inferior in performance to end-to-end training using a backpropagation algorithm. We question this popular belief by designing a local algorithm that can learn convolutional filters at scale on large…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Leopold Grinberg , John Hopfield , Dmitry Krotov

Convolutional Neural Networks with Dynamic Regularization

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Yi Wang , Zhen-Peng Bian , Junhui Hou , Lap-Pui Chau

On the ability of CNNs to extract color invariant intensity based features for image classification

Convolutional neural networks (CNNs) have demonstrated remarkable success in vision-related tasks. However, their susceptibility to failing when inputs deviate from the training distribution is well-documented. Recent studies suggest that…

Computer Vision and Pattern Recognition · Computer Science 2023-07-14 Pradyumna Elavarthi , James Lee , Anca Ralescu

PatchShuffle Regularization

This paper focuses on regularizing the training of the convolutional neural network (CNN). We propose a new regularization approach named ``PatchShuffle`` that can be adopted in any classification-oriented CNN models. It is easy to…

Computer Vision and Pattern Recognition · Computer Science 2017-07-25 Guoliang Kang , Xuanyi Dong , Liang Zheng , Yi Yang

TargetDrop: A Targeted Regularization Method for Convolutional Neural Networks

Dropout regularization has been widely used in deep learning but performs less effective for convolutional neural networks since the spatially correlated features allow dropped information to still flow through the networks. Some structured…

Computer Vision and Pattern Recognition · Computer Science 2020-10-22 Hui Zhu , Xiaofang Zhao

Generalization by Recognizing Confusion

A recently-proposed technique called self-adaptive training augments modern neural networks by allowing them to adjust training labels on the fly, to avoid overfitting to samples that may be mislabeled or otherwise non-representative. By…

Machine Learning · Computer Science 2020-06-16 Daniel Chiu , Franklyn Wang , Scott Duke Kominers