Related papers: CAT:Collaborative Adversarial Training

CAT: Customized Adversarial Training for Improved Robustness

Adversarial training has become one of the most effective methods for improving robustness of neural networks. However, it often suffers from poor generalization on both clean and perturbed data. In this paper, we propose a new algorithm,…

Machine Learning · Computer Science 2020-02-19 Minhao Cheng , Qi Lei , Pin-Yu Chen , Inderjit Dhillon , Cho-Jui Hsieh

Case-Aware Adversarial Training

The neural network (NN) becomes one of the most heated type of models in various signal processing applications. However, NNs are extremely vulnerable to adversarial examples (AEs). To defend AEs, adversarial training (AT) is believed to be…

Machine Learning · Computer Science 2023-08-31 Mingyuan Fan , Yang Liu , Cen Chen

Mutual Adversarial Training: Learning together is better than going alone

Recent studies have shown that robustness to adversarial attacks can be transferred across networks. In other words, we can make a weak model more robust with the help of a strong teacher model. We ask if instead of learning from a static…

Machine Learning · Computer Science 2023-02-13 Jiang Liu , Chun Pong Lau , Hossein Souri , Soheil Feizi , Rama Chellappa

WAT: Improve the Worst-class Robustness in Adversarial Training

Deep Neural Networks (DNN) have been shown to be vulnerable to adversarial examples. Adversarial training (AT) is a popular and effective strategy to defend against adversarial attacks. Recent works (Benz et al., 2020; Xu et al., 2021; Tian…

Machine Learning · Computer Science 2023-02-09 Boqi Li , Weiwei Liu

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Adversarial training has been proven to be an effective technique for improving the adversarial robustness of models. However, there seems to be an inherent trade-off between optimizing the model for accuracy and robustness. To this end, we…

Computer Vision and Pattern Recognition · Computer Science 2020-08-20 Elahe Arani , Fahad Sarfraz , Bahram Zonooz

Adversarial Training: A Survey

Adversarial training (AT) refers to integrating adversarial examples -- inputs altered with imperceptible perturbations that can significantly impact model predictions -- into the training process. Recent studies have demonstrated the…

Machine Learning · Computer Science 2024-10-22 Mengnan Zhao , Lihe Zhang , Jingwen Ye , Huchuan Lu , Baocai Yin , Xinchao Wang

Robustness through Cognitive Dissociation Mitigation in Contrastive Adversarial Training

In this paper, we introduce a novel neural network training framework that increases model's adversarial robustness to adversarial attacks while maintaining high clean accuracy by combining contrastive learning (CL) with adversarial…

Machine Learning · Computer Science 2022-09-13 Adir Rahamim , Itay Naeh

Blind Adversarial Training: Balance Accuracy and Robustness

Adversarial training (AT) aims to improve the robustness of deep learning models by mixing clean data and adversarial examples (AEs). Most existing AT approaches can be grouped into restricted and unrestricted approaches. Restricted AT…

Machine Learning · Computer Science 2020-04-14 Haidong Xie , Xueshuang Xiang , Naijin Liu , Bin Dong

Strength-Adaptive Adversarial Training

Adversarial training (AT) is proved to reliably improve network's robustness against adversarial data. However, current AT with a pre-specified perturbation budget has limitations in learning a robust network. Firstly, applying a…

Machine Learning · Computer Science 2022-10-05 Chaojian Yu , Dawei Zhou , Li Shen , Jun Yu , Bo Han , Mingming Gong , Nannan Wang , Tongliang Liu

Adversarial Distributional Training for Robust Deep Learning

Adversarial training (AT) is among the most effective techniques to improve model robustness by augmenting training data with adversarial examples. However, most existing AT methods adopt a specific attack to craft adversarial examples,…

Machine Learning · Computer Science 2020-11-20 Yinpeng Dong , Zhijie Deng , Tianyu Pang , Hang Su , Jun Zhu

FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training

Deep neural networks are susceptible to adversarial attacks and common corruptions, which undermine their robustness. In order to enhance model resilience against such challenges, Adversarial Training (AT) has emerged as a prominent…

Machine Learning · Computer Science 2025-06-17 Tejaswini Medi , Steffen Jung , Margret Keuper

Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data

Adversarial training is a widely-applied approach to training deep neural networks to be robust against adversarial perturbation. However, although adversarial training has achieved empirical success in practice, it still remains unclear…

Machine Learning · Computer Science 2025-02-10 Binghui Li , Yuanzhi Li

Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation

Neural language models show vulnerability to adversarial examples which are semantically similar to their original counterparts with a few words replaced by their synonyms. A common way to improve model robustness is adversarial training…

Computation and Language · Computer Science 2022-03-25 Hanjie Chen , Yangfeng Ji

Curriculum Adversarial Training

Recently, deep learning has been applied to many security-sensitive applications, such as facial authentication. The existence of adversarial examples hinders such applications. The state-of-the-art result on defense shows that adversarial…

Machine Learning · Computer Science 2018-05-15 Qi-Zhi Cai , Min Du , Chang Liu , Dawn Song

Criticality Leveraged Adversarial Training (CLAT) for Boosted Performance via Parameter Efficiency

Adversarial training enhances neural network robustness but suffers from a tendency to overfit and increased generalization errors on clean data. This work introduces CLAT, an innovative approach that mitigates adversarial overfitting by…

Machine Learning · Computer Science 2024-12-25 Bhavna Gopal , Huanrui Yang , Jingyang Zhang , Mark Horton , Yiran Chen

Towards Alternative Techniques for Improving Adversarial Robustness: Analysis of Adversarial Training at a Spectrum of Perturbations

Adversarial training (AT) and its variants have spearheaded progress in improving neural network robustness to adversarial perturbations and common corruptions in the last few years. Algorithm design of AT and its variants are focused on…

Machine Learning · Computer Science 2022-06-15 Kaustubh Sridhar , Souradeep Dutta , Ramneet Kaur , James Weimer , Oleg Sokolsky , Insup Lee

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale

Current deep neural networks (DNNs) are vulnerable to adversarial attacks, where adversarial perturbations to the inputs can change or manipulate classification. To defend against such attacks, an effective and popular approach, known as…

Machine Learning · Computer Science 2022-09-08 Gaoyuan Zhang , Songtao Lu , Yihua Zhang , Xiangyi Chen , Pin-Yu Chen , Quanfu Fan , Lee Martie , Lior Horesh , Mingyi Hong , Sijia Liu

Exploring Model Robustness with Adaptive Networks and Improved Adversarial Training

Adversarial training has proven to be effective in hardening networks against adversarial examples. However, the gained robustness is limited by network capacity and number of training samples. Consequently, to build more robust models, it…

Machine Learning · Computer Science 2020-06-02 Zheng Xu , Ali Shafahi , Tom Goldstein

Failure Cases Are Better Learned But Boundary Says Sorry: Facilitating Smooth Perception Change for Accuracy-Robustness Trade-Off in Adversarial Training

Adversarial Training (AT) is one of the most effective methods to train robust Deep Neural Networks (DNNs). However, AT creates an inherent trade-off between clean accuracy and adversarial robustness, which is commonly attributed to the…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Yanyun Wang , Li Liu

Efficient Adversarial Training with Transferable Adversarial Examples

Adversarial training is an effective defense method to protect classification models against adversarial attacks. However, one limitation of this approach is that it can require orders of magnitude additional training time due to high cost…

Machine Learning · Computer Science 2020-07-03 Haizhong Zheng , Ziqi Zhang , Juncheng Gu , Honglak Lee , Atul Prakash