English
Related papers

Related papers: LocalNorm: Robust Image Classification through Dyn…

200 papers

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly…

Machine Learning · Statistics 2019-04-16 Shibani Santurkar , Dimitris Tsipras , Andrew Ilyas , Aleksander Madry

Batch normalization (BatchNorm) is a popular layer normalization technique used when training deep neural networks. It has been shown to enhance the training speed and accuracy of deep learning models. However, the mechanics by which…

Machine Learning · Computer Science 2025-02-14 Hermanus L. Potgieter , Coenraad Mouton , Marelie H. Davel

While successful for various computer vision tasks, deep neural networks have shown to be vulnerable to texture style shifts and small perturbations to which humans are robust. In this work, we show that the robustness of neural networks…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Zhenlin Xu , Deyi Liu , Junlin Yang , Colin Raffel , Marc Niethammer

Deep neural networks often work well when they are over-parameterized and trained with a massive amount of noise and regularization, such as weight decay and dropout. Although dropout is widely used as a regularization technique for fully…

Computer Vision and Pattern Recognition · Computer Science 2018-10-31 Golnaz Ghiasi , Tsung-Yi Lin , Quoc V. Le

Despite their renowned predictive power on i.i.d. data, convolutional neural networks are known to rely more on high-frequency patterns that humans deem superficial than on low-frequency patterns that agree better with intuitions about what…

Computer Vision and Pattern Recognition · Computer Science 2019-11-06 Haohan Wang , Songwei Ge , Eric P. Xing , Zachary C. Lipton

Deep convolutional neural networks have shown remarkable performance on various computer vision tasks, and yet, they are susceptible to picking up spurious correlations from the training signal. So called `shortcuts' can occur during…

Computer Vision and Pattern Recognition · Computer Science 2022-09-21 Mobarakol Islam , Ben Glocker

Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical…

Machine Learning · Statistics 2020-07-14 John Bronskill , Jonathan Gordon , James Requeima , Sebastian Nowozin , Richard E. Turner

Normalization techniques have become a basic component in modern convolutional neural networks (ConvNets). In particular, many recent works demonstrate that promoting the orthogonality of the weights helps train deep models and improve…

Computer Vision and Pattern Recognition · Computer Science 2022-01-05 Sheng Liu , Xiao Li , Yuexiang Zhai , Chong You , Zhihui Zhu , Carlos Fernandez-Granda , Qing Qu

Deep neural networks have achieved remarkable results across many language processing tasks, however these methods are highly sensitive to noise and adversarial attacks. We present a regularization based method for limiting network…

Computation and Language · Computer Science 2016-09-21 Yitong Li , Trevor Cohn , Timothy Baldwin

Batch normalization is a key component of most image classification models, but it has many undesirable properties stemming from its dependence on the batch size and interactions between examples. Although recent work has succeeded in…

Computer Vision and Pattern Recognition · Computer Science 2021-02-12 Andrew Brock , Soham De , Samuel L. Smith , Karen Simonyan

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the…

Machine Learning · Statistics 2016-07-22 Jimmy Lei Ba , Jamie Ryan Kiros , Geoffrey E. Hinton

Deep convolutional networks often append additive constant ("bias") terms to their convolution operations, enabling a richer repertoire of functional mappings. Biases are also used to facilitate training, by subtracting mean response over…

Image and Video Processing · Electrical Eng. & Systems 2020-02-11 Sreyas Mohan , Zahra Kadkhodaie , Eero P. Simoncelli , Carlos Fernandez-Granda

Batch Normalization (BatchNorm) is effective for improving the performance and accelerating the training of deep neural networks. However, it has also shown to be a cause of adversarial vulnerability, i.e., networks without it are more…

Machine Learning · Computer Science 2020-06-22 Muhammad Awais , Fahad Shamshad , Sung-Ho Bae

Local Hebbian learning is believed to be inferior in performance to end-to-end training using a backpropagation algorithm. We question this popular belief by designing a local algorithm that can learn convolutional filters at scale on large…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Leopold Grinberg , John Hopfield , Dmitry Krotov

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Yi Wang , Zhen-Peng Bian , Junhui Hou , Lap-Pui Chau

Convolutional neural networks (CNNs) have demonstrated remarkable success in vision-related tasks. However, their susceptibility to failing when inputs deviate from the training distribution is well-documented. Recent studies suggest that…

Computer Vision and Pattern Recognition · Computer Science 2023-07-14 Pradyumna Elavarthi , James Lee , Anca Ralescu

This paper focuses on regularizing the training of the convolutional neural network (CNN). We propose a new regularization approach named ``PatchShuffle`` that can be adopted in any classification-oriented CNN models. It is easy to…

Computer Vision and Pattern Recognition · Computer Science 2017-07-25 Guoliang Kang , Xuanyi Dong , Liang Zheng , Yi Yang

Dropout regularization has been widely used in deep learning but performs less effective for convolutional neural networks since the spatially correlated features allow dropped information to still flow through the networks. Some structured…

Computer Vision and Pattern Recognition · Computer Science 2020-10-22 Hui Zhu , Xiaofang Zhao

A recently-proposed technique called self-adaptive training augments modern neural networks by allowing them to adjust training labels on the fly, to avoid overfitting to samples that may be mislabeled or otherwise non-representative. By…

Machine Learning · Computer Science 2020-06-16 Daniel Chiu , Franklyn Wang , Scott Duke Kominers
‹ Prev 1 2 3 10 Next ›