Related papers: Self-Progressing Robust Training

Adapting to Evolving Adversaries with Regularized Continual Robust Training

Robust training methods typically defend against specific attack types, such as Lp attacks with fixed budgets, and rarely account for the fact that defenders may encounter new attacks over time. A natural solution is to adapt the defended…

Machine Learning · Computer Science 2025-02-07 Sihui Dai , Christian Cianfarani , Arjun Bhagoji , Vikash Sehwag , Prateek Mittal

Revisiting Semi-supervised Adversarial Robustness via Noise-aware Online Robust Distillation

The robust self-training (RST) framework has emerged as a prominent approach for semi-supervised adversarial training. To explore the possibility of tackling more complicated tasks with even lower labeling budgets, unlike prior approaches…

Machine Learning · Computer Science 2024-09-20 Tsung-Han Wu , Hung-Ting Su , Shang-Tse Chen , Winston H. Hsu

Regional Adversarial Training for Better Robust Generalization

Adversarial training (AT) has been demonstrated as one of the most promising defense methods against various adversarial attacks. To our knowledge, existing AT-based methods usually train with the locally most adversarial perturbed points…

Computer Vision and Pattern Recognition · Computer Science 2021-09-07 Chuanbiao Song , Yanbo Fan , Yichen Yang , Baoyuan Wu , Yiming Li , Zhifeng Li , Kun He

Learning Robustness at Test-Time from a Non-Robust Teacher

Nowadays, pretrained models are increasingly used as general-purpose backbones and adapted at test-time to downstream environments where target data are scarce and unlabeled. While this paradigm has proven effective for improving clean…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Stefano Bianchettin , Giulio Rossolini , Giorgio Buttazzo

Toward Adversarial Robustness via Semi-supervised Robust Training

Adversarial examples have been shown to be the severe threat to deep neural networks (DNNs). One of the most effective adversarial defense methods is adversarial training (AT) through minimizing the adversarial risk $R_{adv}$, which…

Machine Learning · Computer Science 2020-06-17 Yiming Li , Baoyuan Wu , Yan Feng , Yanbo Fan , Yong Jiang , Zhifeng Li , Shutao Xia

Adversarial Training in Low-Label Regimes with Margin-Based Interpolation

Adversarial training has emerged as an effective approach to train robust neural network models that are resistant to adversarial attacks, even in low-label regimes where labeled data is scarce. In this paper, we introduce a novel…

Machine Learning · Computer Science 2024-11-28 Tian Ye , Rajgopal Kannan , Viktor Prasanna

SRoUDA: Meta Self-training for Robust Unsupervised Domain Adaptation

As acquiring manual labels on data could be costly, unsupervised domain adaptation (UDA), which transfers knowledge learned from a rich-label dataset to the unlabeled target dataset, is gaining increasing popularity. While extensive studies…

Computer Vision and Pattern Recognition · Computer Science 2022-12-13 Wanqing Zhu , Jia-Li Yin , Bo-Hao Chen , Ximeng Liu

Robust Upper Bounds for Adversarial Training

Many state-of-the-art adversarial training methods for deep learning leverage upper bounds of the adversarial loss to provide security guarantees against adversarial attacks. Yet, these methods rely on convex relaxations to propagate lower…

Machine Learning · Computer Science 2023-04-07 Dimitris Bertsimas , Xavier Boix , Kimberly Villalobos Carballo , Dick den Hertog

Soften to Defend: Towards Adversarial Robustness via Self-Guided Label Refinement

Adversarial training (AT) is currently one of the most effective ways to obtain the robustness of deep neural networks against adversarial attacks. However, most AT methods suffer from robust overfitting, i.e., a significant generalization…

Machine Learning · Computer Science 2024-03-15 Daiwei Yu , Zhuorong Li , Lina Wei , Canghong Jin , Yun Zhang , Sixian Chan

Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers

Recent works have shown the effectiveness of randomized smoothing as a scalable technique for building neural network-based classifiers that are provably robust to $\ell_2$-norm adversarial perturbations. In this paper, we employ…

Machine Learning · Computer Science 2020-01-13 Hadi Salman , Greg Yang , Jerry Li , Pengchuan Zhang , Huan Zhang , Ilya Razenshteyn , Sebastien Bubeck

Using Pre-Training Can Improve Model Robustness and Uncertainty

He et al. (2018) have called into question the utility of pre-training by showing that training from scratch can often yield similar performance to pre-training. We show that although pre-training may not improve performance on traditional…

Machine Learning · Computer Science 2019-10-22 Dan Hendrycks , Kimin Lee , Mantas Mazeika

Achieving Model Robustness through Discrete Adversarial Training

Discrete adversarial attacks are symbolic perturbations to a language input that preserve the output label but lead to a prediction error. While such attacks have been extensively explored for the purpose of evaluating model robustness,…

Machine Learning · Computer Science 2021-11-02 Maor Ivgi , Jonathan Berant

Certified Adversarial Robustness with Additive Noise

The existence of adversarial data examples has drawn significant attention in the deep-learning community; such data are seemingly minimally perturbed relative to the original data, but lead to very different outputs from a deep-learning…

Machine Learning · Computer Science 2019-11-12 Bai Li , Changyou Chen , Wenlin Wang , Lawrence Carin

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

Despite their promising performance across various natural language processing (NLP) tasks, current NLP systems are vulnerable to textual adversarial attacks. To defend against these attacks, most existing methods apply adversarial training…

Computation and Language · Computer Science 2023-07-06 Junjie Wu , Dit-Yan Yeung

CAT: Customized Adversarial Training for Improved Robustness

Adversarial training has become one of the most effective methods for improving robustness of neural networks. However, it often suffers from poor generalization on both clean and perturbed data. In this paper, we propose a new algorithm,…

Machine Learning · Computer Science 2020-02-19 Minhao Cheng , Qi Lei , Pin-Yu Chen , Inderjit Dhillon , Cho-Jui Hsieh

Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks

In this paper, we present an approach to improve the robustness of BERT language models against word substitution-based adversarial attacks by leveraging adversarial perturbations for self-supervised contrastive learning. We create a…

Computation and Language · Computer Science 2022-05-25 Zhao Meng , Yihan Dong , Mrinmaya Sachan , Roger Wattenhofer

RUSH: Robust Contrastive Learning via Randomized Smoothing

Recently, adversarial training has been incorporated in self-supervised contrastive pre-training to augment label efficiency with exciting adversarial robustness. However, the robustness came at a cost of expensive adversarial training. In…

Machine Learning · Computer Science 2022-11-01 Yijiang Pang , Boyang Liu , Jiayu Zhou

Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples

Adversarial training, which is to enhance robustness against adversarial attacks, has received much attention because it is easy to generate human-imperceptible perturbations of data to deceive a given deep neural network. In this paper, we…

Machine Learning · Statistics 2023-06-02 Dongyoon Yang , Insung Kong , Yongdai Kim

Efficient Robust Training via Backward Smoothing

Adversarial training is so far the most effective strategy in defending against adversarial examples. However, it suffers from high computational costs due to the iterative adversarial attacks in each training step. Recent studies show that…

Machine Learning · Computer Science 2022-01-03 Jinghui Chen , Yu Cheng , Zhe Gan , Quanquan Gu , Jingjing Liu

Adversarial Fine-tune with Dynamically Regulated Adversary

Adversarial training is an effective method to boost model robustness to malicious, adversarial attacks. However, such improvement in model robustness often leads to a significant sacrifice of standard performance on clean images. In many…

Machine Learning · Computer Science 2022-04-29 Pengyue Hou , Ming Zhou , Jie Han , Petr Musilek , Xingyu Li