Related papers: When adversarial examples are excusable

Instance adaptive adversarial training: Improved accuracy tradeoffs in neural nets

Adversarial training is by far the most successful strategy for improving robustness of neural networks to adversarial attacks. Despite its success as a defense mechanism, adversarial training fails to generalize well to unperturbed test…

Machine Learning · Computer Science 2019-10-18 Yogesh Balaji , Tom Goldstein , Judy Hoffman

Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods

Neural networks are known to be vulnerable to adversarial examples: inputs that are close to natural inputs but classified incorrectly. In order to better understand the space of adversarial examples, we survey ten recent proposals that are…

Machine Learning · Computer Science 2017-11-02 Nicholas Carlini , David Wagner

Towards Imperceptible and Robust Adversarial Example Attacks against Neural Networks

Machine learning systems based on deep neural networks, being able to produce state-of-the-art results on various perception tasks, have gained mainstream adoption in many applications. However, they are shown to be vulnerable to…

Machine Learning · Computer Science 2018-01-16 Bo Luo , Yannan Liu , Lingxiao Wei , Qiang Xu

When and How to Fool Explainable Models (and Humans) with Adversarial Examples

Reliable deployment of machine learning models such as neural networks continues to be challenging due to several limitations. Some of the main shortcomings are the lack of interpretability and the lack of robustness against adversarial…

Machine Learning · Computer Science 2025-02-18 Jon Vadillo , Roberto Santana , Jose A. Lozano

Adversarial Examples on Object Recognition: A Comprehensive Survey

Deep neural networks are at the forefront of machine learning research. However, despite achieving impressive performance on complex tasks, they can be very sensitive: Small perturbations of inputs can be sufficient to induce incorrect…

Computer Vision and Pattern Recognition · Computer Science 2020-09-04 Alex Serban , Erik Poll , Joost Visser

Explaining and Harnessing Adversarial Examples

Several machine learning models, including neural networks, consistently misclassify adversarial examples---inputs formed by applying small but intentionally worst-case perturbations to examples from the dataset, such that the perturbed…

Machine Learning · Statistics 2015-03-24 Ian J. Goodfellow , Jonathon Shlens , Christian Szegedy

Simultaneous Adversarial Training - Learn from Others Mistakes

Adversarial examples are maliciously tweaked images that can easily fool machine learning techniques, such as neural networks, but they are normally not visually distinguishable for human beings. One of the main approaches to solve this…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Zukang Liao

Towards Adversarial Training with Moderate Performance Improvement for Neural Network Classification

It has been demonstrated that deep neural networks are prone to noisy examples particular adversarial samples during inference process. The gap between robust deep learning systems in real world applications and vulnerable neural networks…

Machine Learning · Computer Science 2018-07-03 Xinhan Di , Pengqian Yu , Meng Tian

Adversarial Robustness Guarantees for Random Deep Neural Networks

The reliability of deep learning algorithms is fundamentally challenged by the existence of adversarial examples, which are incorrectly classified inputs that are extremely close to a correctly classified input. We explore the properties of…

Machine Learning · Statistics 2021-07-23 Giacomo De Palma , Bobak T. Kiani , Seth Lloyd

The Limitations of Model Uncertainty in Adversarial Settings

Machine learning models are vulnerable to adversarial examples: minor perturbations to input samples intended to deliberately cause misclassification. While an obvious security threat, adversarial examples yield as well insights about the…

Cryptography and Security · Computer Science 2019-11-19 Kathrin Grosse , David Pfaff , Michael Thomas Smith , Michael Backes

Are Accuracy and Robustness Correlated?

Machine learning models are vulnerable to adversarial examples formed by applying small carefully chosen perturbations to inputs that cause unexpected classification errors. In this paper, we perform experiments on various adversarial…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Andras Rozsa , Manuel Günther , Terrance E. Boult

Are adversarial examples inevitable?

A wide range of defenses have been proposed to harden neural networks against adversarial attacks. However, a pattern has emerged in which the majority of adversarial defenses are quickly broken by new attacks. Given the lack of success at…

Machine Learning · Computer Science 2020-02-05 Ali Shafahi , W. Ronny Huang , Christoph Studer , Soheil Feizi , Tom Goldstein

Verifying the Causes of Adversarial Examples

The robustness of neural networks is challenged by adversarial examples that contain almost imperceptible perturbations to inputs, which mislead a classifier to incorrect outputs in high confidence. Limited by the extreme difficulty in…

Machine Learning · Computer Science 2020-10-20 Honglin Li , Yifei Fan , Frieder Ganz , Anthony Yezzi , Payam Barnaghi

Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

Machine learning models are vulnerable to adversarial examples: small changes to images can cause computer vision models to make mistakes such as identifying a school bus as an ostrich. However, it is still an open question whether humans…

Machine Learning · Computer Science 2019-09-12 Gamaleldin F. Elsayed , Shreya Shankar , Brian Cheung , Nicolas Papernot , Alex Kurakin , Ian Goodfellow , Jascha Sohl-Dickstein

Fast Training of Deep Neural Networks Robust to Adversarial Perturbations

Deep neural networks are capable of training fast and generalizing well within many domains. Despite their promising performance, deep networks have shown sensitivities to perturbations of their inputs (e.g., adversarial examples) and their…

Machine Learning · Computer Science 2020-07-09 Justin Goodwin , Olivia Brown , Victoria Helus

Understanding the Decision Boundary of Deep Neural Networks: An Empirical Study

Despite achieving remarkable performance on many image classification tasks, state-of-the-art machine learning (ML) classifiers remain vulnerable to small input perturbations. Especially, the existence of adversarial examples raises…

Machine Learning · Computer Science 2020-02-06 David Mickisch , Felix Assion , Florens Greßner , Wiebke Günther , Mariele Motta

How Wrong Am I? - Studying Adversarial Examples and their Impact on Uncertainty in Gaussian Process Machine Learning Models

Machine learning models are vulnerable to Adversarial Examples: minor perturbations to input samples intended to deliberately cause misclassification. Current defenses against adversarial examples, especially for Deep Neural Networks (DNN),…

Cryptography and Security · Computer Science 2019-01-04 Kathrin Grosse , David Pfaff , Michael Thomas Smith , Michael Backes

A Boundary Tilting Persepective on the Phenomenon of Adversarial Examples

Deep neural networks have been shown to suffer from a surprising weakness: their classification outputs can be changed by small, non-random perturbations of their inputs. This adversarial example phenomenon has been explained as originating…

Machine Learning · Computer Science 2016-08-30 Thomas Tanay , Lewis Griffin

An Extended Study of Human-like Behavior under Adversarial Training

Neural networks have a number of shortcomings. Amongst the severest ones is the sensitivity to distribution shifts which allows models to be easily fooled into wrong predictions by small perturbations to inputs that are often imperceivable…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Paul Gavrikov , Janis Keuper , Margret Keuper

Adversarial attacks hidden in plain sight

Convolutional neural networks have been used to achieve a string of successes during recent years, but their lack of interpretability remains a serious issue. Adversarial examples are designed to deliberately fool neural networks into…

Machine Learning · Computer Science 2020-04-28 Jan Philip Göpfert , André Artelt , Heiko Wersing , Barbara Hammer