Related papers: Concurrent Adversarial Learning for Large-Batch Tr…

Large Batch Training of Convolutional Networks

A common way to speed up training of large convolutional networks is to add computational units. Training is then performed using data-parallel synchronous Stochastic Gradient Descent (SGD) with mini-batch divided between computational…

Computer Vision and Pattern Recognition · Computer Science 2017-09-15 Yang You , Igor Gitman , Boris Ginsburg

Large batch size training of neural networks with adversarial training and second-order information

The most straightforward method to accelerate Stochastic Gradient Descent (SGD) computation is to distribute the randomly selected batch of inputs over multiple processors. To keep the distributed processors fully utilized requires…

Machine Learning · Computer Science 2020-01-06 Zhewei Yao , Amir Gholami , Daiyaan Arfeen , Richard Liaw , Joseph Gonzalez , Kurt Keutzer , Michael Mahoney

AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks

Training deep neural networks with Stochastic Gradient Descent, or its variants, requires careful choice of both learning rate and batch size. While smaller batch sizes generally converge in fewer training epochs, larger batch sizes offer…

Machine Learning · Computer Science 2018-02-15 Aditya Devarakonda , Maxim Naumov , Michael Garland

Efficient and Effective Augmentation Strategy for Adversarial Training

Adversarial training of Deep Neural Networks is known to be significantly more data-hungry when compared to standard training. Furthermore, complex data augmentations such as AutoAugment, which have led to substantial gains in standard…

Machine Learning · Computer Science 2022-10-28 Sravanti Addepalli , Samyak Jain , R. Venkatesh Babu

Adversarial Coreset Selection for Efficient Robust Training

Neural networks are vulnerable to adversarial attacks: adding well-crafted, imperceptible perturbations to their input can modify their output. Adversarial training is one of the most effective approaches to training robust models against…

Machine Learning · Computer Science 2023-08-09 Hadi M. Dolatabadi , Sarah Erfani , Christopher Leckie

Don't Decay the Learning Rate, Increase the Batch Size

It is common practice to decay the learning rate. Here we show one can usually obtain the same learning curve on both training and test sets by instead increasing the batch size during training. This procedure is successful for stochastic…

Machine Learning · Computer Science 2018-02-27 Samuel L. Smith , Pieter-Jan Kindermans , Chris Ying , Quoc V. Le

AccelAT: A Framework for Accelerating the Adversarial Training of Deep Neural Networks through Accuracy Gradient

Adversarial training is exploited to develop a robust Deep Neural Network (DNN) model against the malicious altered data. These attacks may have catastrophic effects on DNN models but are indistinguishable for a human being. For example, an…

Machine Learning · Computer Science 2022-10-14 Farzad Nikfam , Alberto Marchisio , Maurizio Martina , Muhammad Shafique

Long-term Cross Adversarial Training: A Robust Meta-learning Method for Few-shot Classification Tasks

Meta-learning model can quickly adapt to new tasks using few-shot labeled data. However, despite achieving good generalization on few-shot classification tasks, it is still challenging to improve the adversarial robustness of the…

Machine Learning · Computer Science 2021-07-02 Fan Liu , Shuyu Zhao , Xuelong Dai , Bin Xiao

Utilizing Adversarial Targeted Attacks to Boost Adversarial Robustness

Adversarial attacks have been shown to be highly effective at degrading the performance of deep neural networks (DNNs). The most prominent defense is adversarial training, a method for learning a robust model. Nevertheless, adversarial…

Computer Vision and Pattern Recognition · Computer Science 2021-09-07 Uriya Pesso , Koby Bibas , Meir Feder

A Simple Fine-tuning Is All You Need: Towards Robust Deep Learning Via Adversarial Fine-tuning

Adversarial Training (AT) with Projected Gradient Descent (PGD) is an effective approach for improving the robustness of the deep neural networks. However, PGD AT has been shown to suffer from two main limitations: i) high computational…

Computer Vision and Pattern Recognition · Computer Science 2020-12-29 Ahmadreza Jeddi , Mohammad Javad Shafiee , Alexander Wong

Single-Step Adversarial Training for Semantic Segmentation

Even though deep neural networks succeed on many different tasks including semantic segmentation, they lack on robustness against adversarial examples. To counteract this exploit, often adversarial training is used. However, it is known…

Computer Vision and Pattern Recognition · Computer Science 2021-07-01 Daniel Wiens , Barbara Hammer

Bilateral Adversarial Training: Towards Fast Training of More Robust Models Against Adversarial Attacks

In this paper, we study fast training of adversarially robust models. From the analyses of the state-of-the-art defense method, i.e., the multi-step adversarial training, we hypothesize that the gradient magnitude links to the model…

Computer Vision and Pattern Recognition · Computer Science 2019-08-02 Jianyu Wang , Haichao Zhang

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale

Current deep neural networks (DNNs) are vulnerable to adversarial attacks, where adversarial perturbations to the inputs can change or manipulate classification. To defend against such attacks, an effective and popular approach, known as…

Machine Learning · Computer Science 2022-09-08 Gaoyuan Zhang , Songtao Lu , Yihua Zhang , Xiangyi Chen , Pin-Yu Chen , Quanfu Fan , Lee Martie , Lior Horesh , Mingyi Hong , Sijia Liu

Revisiting Adversarial Training at Scale

The machine learning community has witnessed a drastic change in the training pipeline, pivoted by those ''foundation models'' with unprecedented scales. However, the field of adversarial training is lagging behind, predominantly centered…

Computer Vision and Pattern Recognition · Computer Science 2024-04-23 Zeyu Wang , Xianhang Li , Hongru Zhu , Cihang Xie

Adversarial Momentum-Contrastive Pre-Training

Recently proposed adversarial self-supervised learning methods usually require big batches and long training epochs to extract robust features, which will bring heavy computational overhead on platforms with limited resources. In order to…

Computer Vision and Pattern Recognition · Computer Science 2022-05-31 Cong Xu , Dan Li , Min Yang

Efficient Optimization Algorithms for Linear Adversarial Training

Adversarial training can be used to learn models that are robust against perturbations. For linear models, it can be formulated as a convex optimization problem. Compared to methods proposed in the context of deep learning, leveraging the…

Machine Learning · Statistics 2025-03-20 Antônio H. RIbeiro , Thomas B. Schön , Dave Zahariah , Francis Bach

Convergence and Margin of Adversarial Training on Separable Data

Adversarial training is a technique for training robust machine learning models. To encourage robustness, it iteratively computes adversarial examples for the model, and then re-trains on these examples via some update rule. This work…

Machine Learning · Computer Science 2019-05-23 Zachary Charles , Shashank Rajput , Stephen Wright , Dimitris Papailiopoulos

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Adversarial training has been proven to be an effective technique for improving the adversarial robustness of models. However, there seems to be an inherent trade-off between optimizing the model for accuracy and robustness. To this end, we…

Computer Vision and Pattern Recognition · Computer Science 2020-08-20 Elahe Arani , Fahad Sarfraz , Bahram Zonooz

ScaLA: Accelerating Adaptation of Pre-Trained Transformer-Based Language Models via Efficient Large-Batch Adversarial Noise

In recent years, large pre-trained Transformer-based language models have led to dramatic improvements in many natural language understanding tasks. To train these models with increasing sizes, many neural network practitioners attempt to…

Machine Learning · Computer Science 2022-02-01 Minjia Zhang , Niranjan Uma Naresh , Yuxiong He

Analysis and Extensions of Adversarial Training for Video Classification

Adversarial training (AT) is a simple yet effective defense against adversarial attacks to image classification systems, which is based on augmenting the training set with attacks that maximize the loss. However, the effectiveness of AT as…

Computer Vision and Pattern Recognition · Computer Science 2022-06-17 Kaleab A. Kinfu , René Vidal