Related papers: Easy Batch Normalization

BOWL: A Deceptively Simple Open World Learner

Traditional machine learning excels on static benchmarks, but the real world is dynamic and seldom as carefully curated as test sets. Practical applications may generally encounter undesired inputs, are required to deal with novel…

Machine Learning · Computer Science 2025-03-17 Roshni . R. Kamath , Rupert Mitchell , Subarnaduti Paul , Kristian Kersting , Martin Mundt

Batch Normalized Recurrent Neural Networks

Recurrent Neural Networks (RNNs) are powerful models for sequential data that have the potential to learn long-term dependencies. However, they are computationally expensive to train and difficult to parallelize. Recent work has shown that…

Machine Learning · Statistics 2015-10-07 César Laurent , Gabriel Pereyra , Philémon Brakel , Ying Zhang , Yoshua Bengio

Empirical Study of Easy and Hard Examples in CNN Training

Deep Neural Networks (DNNs) generalize well despite their massive size and capability of memorizing all examples. There is a hypothesis that DNNs start learning from simple patterns and the hypothesis is based on the existence of examples…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Ikki Kishida , Hideki Nakayama

Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization

Normalization techniques such as Batch Normalization have been applied successfully for training deep neural networks. Yet, despite its apparent empirical benefits, the reasons behind the success of Batch Normalization are mostly…

Machine Learning · Statistics 2018-10-09 Jonas Kohler , Hadi Daneshmand , Aurelien Lucchi , Ming Zhou , Klaus Neymeyr , Thomas Hofmann

Learning from networked examples

Many machine learning algorithms are based on the assumption that training examples are drawn independently. However, this assumption does not hold anymore when learning from a networked sample because two or more training examples may…

Artificial Intelligence · Computer Science 2017-06-06 Yuyi Wang , Jan Ramon , Zheng-Chu Guo

Adversarial Examples on Object Recognition: A Comprehensive Survey

Deep neural networks are at the forefront of machine learning research. However, despite achieving impressive performance on complex tasks, they can be very sensitive: Small perturbations of inputs can be sufficient to induce incorrect…

Computer Vision and Pattern Recognition · Computer Science 2020-09-04 Alex Serban , Erik Poll , Joost Visser

Batchless Normalization: How to Normalize Activations Across Instances with Minimal Memory Requirements

In training neural networks, batch normalization has many benefits, not all of them entirely understood. But it also has some drawbacks. Foremost is arguably memory consumption, as computing the batch statistics requires all instances…

Machine Learning · Computer Science 2024-07-26 Benjamin Berger , Victor Uc Cetina

Accelerating Training of Deep Neural Networks with a Standardization Loss

A significant advance in accelerating neural network training has been the development of normalization methods, permitting the training of deep models both faster and with better accuracy. These advances come with practical challenges: for…

Machine Learning · Computer Science 2019-03-05 Jasmine Collins , Johannes Balle , Jonathon Shlens

Batch Normalization is a Cause of Adversarial Vulnerability

Batch normalization (batch norm) is often used in an attempt to stabilize and accelerate training in deep neural networks. In many cases it indeed decreases the number of parameter updates required to achieve low training error. However, it…

Machine Learning · Computer Science 2019-05-31 Angus Galloway , Anna Golubeva , Thomas Tanay , Medhat Moussa , Graham W. Taylor

Effectiveness of Adversarial Examples and Defenses for Malware Classification

Artificial neural networks have been successfully used for many different classification tasks including malware detection and distinguishing between malicious and non-malicious programs. Although artificial neural networks perform very…

Machine Learning · Computer Science 2019-09-12 Robert Podschwadt , Hassan Takabi

Simultaneous Adversarial Training - Learn from Others Mistakes

Adversarial examples are maliciously tweaked images that can easily fool machine learning techniques, such as neural networks, but they are normally not visually distinguishable for human beings. One of the main approaches to solve this…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Zukang Liao

Adversarial Machine Learning at Scale

Adversarial examples are malicious inputs designed to fool machine learning models. They often transfer from one model to another, allowing attackers to mount black box attacks without knowledge of the target model's parameters. Adversarial…

Computer Vision and Pattern Recognition · Computer Science 2017-02-14 Alexey Kurakin , Ian Goodfellow , Samy Bengio

Image Transformation can make Neural Networks more robust against Adversarial Examples

Neural networks are being applied in many tasks related to IoT with encouraging results. For example, neural networks can precisely detect human, objects and animal via surveillance camera for security purpose. However, neural networks have…

Computer Vision and Pattern Recognition · Computer Science 2019-01-11 Dang Duy Thang , Toshihiro Matsui

Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models

Batch Normalization is quite effective at accelerating and improving the training of deep models. However, its effectiveness diminishes when the training minibatches are small, or do not consist of independent samples. We hypothesize that…

Machine Learning · Computer Science 2017-03-31 Sergey Ioffe

mixup: Beyond Empirical Risk Minimization

Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence,…

Machine Learning · Computer Science 2018-05-01 Hongyi Zhang , Moustapha Cisse , Yann N. Dauphin , David Lopez-Paz

Difficult Examples Hurt Unsupervised Contrastive Learning: A Theoretical Perspective

Unsupervised contrastive learning has shown significant performance improvements in recent years, often approaching or even rivaling supervised learning in various tasks. However, its learning mechanism is fundamentally different from…

Machine Learning · Computer Science 2026-03-05 Yi-Ge Zhang , Jingyi Cui , Qiran Li , Yisen Wang

Towards Adversarial Training with Moderate Performance Improvement for Neural Network Classification

It has been demonstrated that deep neural networks are prone to noisy examples particular adversarial samples during inference process. The gap between robust deep learning systems in real world applications and vulnerable neural networks…

Machine Learning · Computer Science 2018-07-03 Xinhan Di , Pengqian Yu , Meng Tian

Explaining and Harnessing Adversarial Examples

Several machine learning models, including neural networks, consistently misclassify adversarial examples---inputs formed by applying small but intentionally worst-case perturbations to examples from the dataset, such that the perturbed…

Machine Learning · Statistics 2015-03-24 Ian J. Goodfellow , Jonathon Shlens , Christian Szegedy

Towards Understanding Regularization in Batch Normalization

Batch Normalization (BN) improves both convergence and generalization in training neural networks. This work understands these phenomena theoretically. We analyze BN by using a basic block of neural networks, consisting of a kernel layer, a…

Machine Learning · Computer Science 2019-04-25 Ping Luo , Xinjiang Wang , Wenqi Shao , Zhanglin Peng

Classification by Ensembles of Neural Networks

We introduce a new procedure for training of artificial neural networks by using the approximation of an objective function by arithmetic mean of an ensemble of selected randomly generated neural networks, and apply this procedure to the…

Neural and Evolutionary Computing · Computer Science 2012-02-21 S. V. Kozyrev