Related papers: Generating Less Certain Adversarial Examples Impro…

Predicting Adversarial Examples with High Confidence

It has been suggested that adversarial examples cause deep learning models to make incorrect predictions with high confidence. In this work, we take the opposite stance: an overly confident model is more likely to be vulnerable to…

Machine Learning · Computer Science 2018-02-14 Angus Galloway , Graham W. Taylor , Medhat Moussa

Boundary Adversarial Examples Against Adversarial Overfitting

Standard adversarial training approaches suffer from robust overfitting where the robust accuracy decreases when models are adversarially trained for too long. The origin of this problem is still unclear and conflicting explanations have…

Machine Learning · Computer Science 2022-11-28 Muhammad Zaid Hameed , Beat Buesser

On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training

Adversarial training is a popular method to robustify models against adversarial attacks. However, it exhibits much more severe overfitting than training on clean inputs. In this work, we investigate this phenomenon from the perspective of…

Machine Learning · Computer Science 2024-12-18 Chen Liu , Zhichao Huang , Mathieu Salzmann , Tong Zhang , Sabine Süsstrunk

Robustness and Generalization via Generative Adversarial Training

While deep neural networks have achieved remarkable success in various computer vision tasks, they often fail to generalize to new domains and subtle variations of input images. Several defenses have been proposed to improve the robustness…

Computer Vision and Pattern Recognition · Computer Science 2021-09-08 Omid Poursaeed , Tianxing Jiang , Harry Yang , Serge Belongie , SerNam Lim

Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training

Despite remarkable achievements in deep learning across various domains, its inherent vulnerability to adversarial examples still remains a critical concern for practical deployment. Adversarial training has emerged as one of the most…

Machine Learning · Computer Science 2024-11-06 Junhao Dong , Xinghua Qu , Z. Jane Wang , Yew-Soon Ong

Disentangling Adversarial Robustness and Generalization

Obtaining deep networks that are robust against adversarial examples and generalize well is an open problem. A recent hypothesis even states that both robust and accurate models are impossible, i.e., adversarial robustness and…

Computer Vision and Pattern Recognition · Computer Science 2019-04-11 David Stutz , Matthias Hein , Bernt Schiele

Towards Understanding Adversarial Examples Systematically: Exploring Data Size, Task and Model Factors

Most previous works usually explained adversarial examples from several specific perspectives, lacking relatively integral comprehension about this problem. In this paper, we present a systematic study on adversarial examples from three…

Machine Learning · Computer Science 2019-03-01 Ke Sun , Zhanxing Zhu , Zhouchen Lin

On the Robustness of Adversarial Training Against Uncertainty Attacks

In learning problems, the noise inherent to the task at hand hinders the possibility to infer without a certain degree of uncertainty. Quantifying this uncertainty, regardless of its wide use, assumes high relevance for security-sensitive…

Machine Learning · Computer Science 2025-05-28 Emanuele Ledda , Giovanni Scodeller , Daniele Angioni , Giorgio Piras , Antonio Emanuele Cinà , Giorgio Fumera , Battista Biggio , Fabio Roli

Are Accuracy and Robustness Correlated?

Machine learning models are vulnerable to adversarial examples formed by applying small carefully chosen perturbations to inputs that cause unexpected classification errors. In this paper, we perform experiments on various adversarial…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Andras Rozsa , Manuel Günther , Terrance E. Boult

Adversarially Robust Generalization Requires More Data

Machine learning models are often susceptible to adversarial perturbations of their inputs. Even small perturbations can cause state-of-the-art classifiers with high "standard" accuracy to produce an incorrect prediction with high…

Machine Learning · Computer Science 2018-05-03 Ludwig Schmidt , Shibani Santurkar , Dimitris Tsipras , Kunal Talwar , Aleksander Mądry

Robust Models are less Over-Confident

Despite the success of convolutional neural networks (CNNs) in many academic benchmarks for computer vision tasks, their application in the real-world is still facing fundamental challenges. One of these open problems is the inherent lack…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Julia Grabinski , Paul Gavrikov , Janis Keuper , Margret Keuper

Theoretical Foundations of Adversarially Robust Learning

Despite extraordinary progress, current machine learning systems have been shown to be brittle against adversarial examples: seemingly innocuous but carefully crafted perturbations of test examples that cause machine learning predictors to…

Machine Learning · Computer Science 2023-06-14 Omar Montasser

Adversarial robustness for latent models: Revisiting the robust-standard accuracies tradeoff

Over the past few years, several adversarial training methods have been proposed to improve the robustness of machine learning models against adversarial perturbations in the input. Despite remarkable progress in this regard, adversarial…

Machine Learning · Computer Science 2022-04-04 Adel Javanmard , Mohammad Mehrabi

Robustness May Be at Odds with Accuracy

We show that there may exist an inherent tension between the goal of adversarial robustness and that of standard generalization. Specifically, training robust models may not only be more resource-consuming, but also lead to a reduction of…

Machine Learning · Statistics 2019-09-10 Dimitris Tsipras , Shibani Santurkar , Logan Engstrom , Alexander Turner , Aleksander Madry

Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples

Adversarial training, which is to enhance robustness against adversarial attacks, has received much attention because it is easy to generate human-imperceptible perturbations of data to deceive a given deep neural network. In this paper, we…

Machine Learning · Statistics 2023-06-02 Dongyoon Yang , Insung Kong , Yongdai Kim

A3T: Accuracy Aware Adversarial Training

Adversarial training has been empirically shown to be more prone to overfitting than standard training. The exact underlying reasons still need to be fully understood. In this paper, we identify one cause of overfitting related to current…

Machine Learning · Computer Science 2022-11-30 Enes Altinisik , Safa Messaoud , Husrev Taha Sencar , Sanjay Chawla

Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization

Adversarial training is widely used to improve the robustness of deep neural networks to adversarial attack. However, adversarial training is prone to overfitting, and the cause is far from clear. This work sheds light on the mechanisms…

Machine Learning · Computer Science 2022-12-12 Lin Li , Michael Spratling

Relating Adversarially Robust Generalization to Flat Minima

Adversarial training (AT) has become the de-facto standard to obtain models robust against adversarial examples. However, AT exhibits severe robust overfitting: cross-entropy loss on adversarial examples, so-called robust loss, decreases…

Machine Learning · Computer Science 2021-10-07 David Stutz , Matthias Hein , Bernt Schiele

Adversarial Vertex Mixup: Toward Better Adversarially Robust Generalization

Adversarial examples cause neural networks to produce incorrect outputs with high confidence. Although adversarial training is one of the most effective forms of defense against adversarial examples, unfortunately, a large gap exists…

Computer Vision and Pattern Recognition · Computer Science 2020-07-28 Saehyung Lee , Hyungyu Lee , Sungroh Yoon

Impact of Adversarial Training on Robustness and Generalizability of Language Models

Adversarial training is widely acknowledged as the most effective defense against adversarial attacks. However, it is also well established that achieving both robustness and generalization in adversarially trained models involves a…

Computation and Language · Computer Science 2023-12-12 Enes Altinisik , Hassan Sajjad , Husrev Taha Sencar , Safa Messaoud , Sanjay Chawla