Related papers: Adversarial Examples from Cryptographic Pseudo-Ran…

Adversarial examples from computational constraints

Why are classifiers in high dimension vulnerable to "adversarial" perturbations? We show that it is likely not due to information theoretic limitations, but rather it could be due to computational constraints. First we prove that, for a…

Machine Learning · Statistics 2018-05-28 Sébastien Bubeck , Eric Price , Ilya Razenshteyn

Adversarially Robust Learning Could Leverage Computational Hardness

Over recent years, devising classification algorithms that are robust to adversarial perturbations has emerged as a challenging problem. In particular, deep neural nets (DNNs) seem to be susceptible to small imperceptible changes over test…

Machine Learning · Computer Science 2019-12-20 Sanjam Garg , Somesh Jha , Saeed Mahloujifar , Mohammad Mahmoody

Theoretical Foundations of Adversarially Robust Learning

Despite extraordinary progress, current machine learning systems have been shown to be brittle against adversarial examples: seemingly innocuous but carefully crafted perturbations of test examples that cause machine learning predictors to…

Machine Learning · Computer Science 2023-06-14 Omar Montasser

Adversarial Examples and Metrics

Adversarial examples are a type of attack on machine learning (ML) systems which cause misclassification of inputs. Achieving robustness against adversarial examples is crucial to apply ML in the real world. While most prior work on…

Cryptography and Security · Computer Science 2020-07-16 Nico Döttling , Kathrin Grosse , Michael Backes , Ian Molloy

Are Accuracy and Robustness Correlated?

Machine learning models are vulnerable to adversarial examples formed by applying small carefully chosen perturbations to inputs that cause unexpected classification errors. In this paper, we perform experiments on various adversarial…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Andras Rozsa , Manuel Günther , Terrance E. Boult

Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness

The susceptibility of modern machine learning classifiers to adversarial examples has motivated theoretical results suggesting that these might be unavoidable. However, these results can be too general to be applicable to natural data…

Machine Learning · Computer Science 2024-05-28 Ambar Pal , Jeremias Sulam , René Vidal

Detecting Adversarial Examples Is (Nearly) As Hard As Classifying Them

Making classifiers robust to adversarial examples is hard. Thus, many defenses tackle the seemingly easier task of detecting perturbed inputs. We show a barrier towards this goal. We prove a general hardness reduction between detection and…

Machine Learning · Computer Science 2022-06-17 Florian Tramèr

Adversarial Examples Are Not Bugs, They Are Features

Adversarial examples have attracted significant attention in machine learning, but the reasons for their existence and pervasiveness remain unclear. We demonstrate that adversarial examples can be directly attributed to the presence of…

Machine Learning · Statistics 2019-08-13 Andrew Ilyas , Shibani Santurkar , Dimitris Tsipras , Logan Engstrom , Brandon Tran , Aleksander Madry

Understanding and Quantifying Adversarial Examples Existence in Linear Classification

State-of-art deep neural networks (DNN) are vulnerable to attacks by adversarial examples: a carefully designed small perturbation to the input, that is imperceptible to human, can mislead DNN. To understand the root cause of adversarial…

Machine Learning · Statistics 2019-10-29 Xupeng Shi , A. Adam Ding

A Theoretical Framework for Robustness of (Deep) Classifiers against Adversarial Examples

Most machine learning classifiers, including deep neural networks, are vulnerable to adversarial examples. Such inputs are typically generated by adding small but purposeful modifications that lead to incorrect outputs while imperceptible…

Machine Learning · Computer Science 2017-09-28 Beilun Wang , Ji Gao , Yanjun Qi

Adversarial Examples Are a Natural Consequence of Test Error in Noise

Over the last few years, the phenomenon of adversarial examples --- maliciously constructed inputs that fool trained machine learning models --- has captured the attention of the research community, especially when the adversary is…

Machine Learning · Computer Science 2019-01-31 Nic Ford , Justin Gilmer , Nicolas Carlini , Dogus Cubuk

Are adversarial examples inevitable?

A wide range of defenses have been proposed to harden neural networks against adversarial attacks. However, a pattern has emerged in which the majority of adversarial defenses are quickly broken by new attacks. Given the lack of success at…

Machine Learning · Computer Science 2020-02-05 Ali Shafahi , W. Ronny Huang , Christoph Studer , Soheil Feizi , Tom Goldstein

When Should You Defend Your Classifier -- A Game-theoretical Analysis of Countermeasures against Adversarial Examples

Adversarial machine learning, i.e., increasing the robustness of machine learning algorithms against so-called adversarial examples, is now an established field. Yet, newly proposed methods are evaluated and compared under unrealistic…

Machine Learning · Computer Science 2021-09-28 Maximilian Samsinger , Florian Merkle , Pascal Schöttle , Tomas Pevny

Adversarial Examples on Object Recognition: A Comprehensive Survey

Deep neural networks are at the forefront of machine learning research. However, despite achieving impressive performance on complex tasks, they can be very sensitive: Small perturbations of inputs can be sufficient to induce incorrect…

Computer Vision and Pattern Recognition · Computer Science 2020-09-04 Alex Serban , Erik Poll , Joost Visser

Learning with a Strong Adversary

The robustness of neural networks to intended perturbations has recently attracted significant attention. In this paper, we propose a new method, \emph{learning with a strong adversary}, that learns robust classifiers from supervised data.…

Machine Learning · Computer Science 2016-01-19 Ruitong Huang , Bing Xu , Dale Schuurmans , Csaba Szepesvari

Towards Imperceptible and Robust Adversarial Example Attacks against Neural Networks

Machine learning systems based on deep neural networks, being able to produce state-of-the-art results on various perception tasks, have gained mainstream adoption in many applications. However, they are shown to be vulnerable to…

Machine Learning · Computer Science 2018-01-16 Bo Luo , Yannan Liu , Lingxiao Wei , Qiang Xu

On the Effect of Adversarial Training Against Invariance-based Adversarial Examples

Adversarial examples are carefully crafted attack points that are supposed to fool machine learning classifiers. In the last years, the field of adversarial machine learning, especially the study of perturbation-based adversarial examples,…

Machine Learning · Computer Science 2023-09-19 Roland Rauter , Martin Nocker , Florian Merkle , Pascal Schöttle

A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers

Text classifiers are vulnerable to adversarial examples -- correctly-classified examples that are deliberately transformed to be misclassified while satisfying acceptability constraints. The conventional approach to finding adversarial…

Computation and Language · Computer Science 2024-05-21 Tom Roth , Inigo Jauregi Unanue , Alsharif Abuadbba , Massimo Piccardi

Adversarial Robustness Guarantees for Random Deep Neural Networks

The reliability of deep learning algorithms is fundamentally challenged by the existence of adversarial examples, which are incorrectly classified inputs that are extremely close to a correctly classified input. We explore the properties of…

Machine Learning · Statistics 2021-07-23 Giacomo De Palma , Bobak T. Kiani , Seth Lloyd

Adversarial Examples - A Complete Characterisation of the Phenomenon

We provide a complete characterisation of the phenomenon of adversarial examples - inputs intentionally crafted to fool machine learning models. We aim to cover all the important concerns in this field of study: (1) the conjectures on the…

Computer Vision and Pattern Recognition · Computer Science 2019-02-19 Alexandru Constantin Serban , Erik Poll , Joost Visser