Related papers: Learning perturbation sets for robust machine lear…

Perturbation Analysis of Learning Algorithms: A Unifying Perspective on Generation of Adversarial Examples

Despite the tremendous success of deep neural networks in various learning problems, it has been observed that adding an intentionally designed adversarial perturbation to inputs of these architectures leads to erroneous classification with…

Machine Learning · Computer Science 2018-12-19 Emilio Rafael Balda , Arash Behboodi , Rudolf Mathar

Model-Based Robust Deep Learning: Generalizing to Natural, Out-of-Distribution Data

While deep learning has resulted in major breakthroughs in many application domains, the frameworks commonly used in deep learning remain fragile to artificially-crafted and imperceptible changes in the data. In response to this fragility,…

Machine Learning · Computer Science 2020-11-03 Alexander Robey , Hamed Hassani , George J. Pappas

Theoretical Understanding of Learning from Adversarial Perturbations

It is not fully understood why adversarial examples can deceive neural networks and transfer between different networks. To elucidate this, several studies have hypothesized that adversarial perturbations, while appearing as noises, contain…

Machine Learning · Computer Science 2024-02-19 Soichiro Kumano , Hiroshi Kera , Toshihiko Yamasaki

Metrics and methods for robustness evaluation of neural networks with generative models

Recent studies have shown that modern deep neural network classifiers are easy to fool, assuming that an adversary is able to slightly modify their inputs. Many papers have proposed adversarial attacks, defenses and methods to measure…

Machine Learning · Computer Science 2020-03-17 Igor Buzhinsky , Arseny Nerinovsky , Stavros Tripakis

Adversarially Robust Learning with Unknown Perturbation Sets

We study the problem of learning predictors that are robust to adversarial examples with respect to an unknown perturbation set, relying instead on interaction with an adversarial attacker or access to attack oracles, examining different…

Machine Learning · Computer Science 2021-02-04 Omar Montasser , Steve Hanneke , Nathan Srebro

Natural Perturbed Training for General Robustness of Neural Network Classifiers

We focus on the robustness of neural networks for classification. To permit a fair comparison between methods to achieve robustness, we first introduce a standard based on the mensuration of a classifier's degradation. Then, we propose…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Sadaf Gulshad , Arnold Smeulders

Perturbation: A simple and efficient adversarial tracer for representation learning in language models

Linguistic representation learning in deep neural language models (LMs) has been studied for decades, for both practical and theoretical reasons. However, finding representations in LMs remains an unsolved problem, in part due to a dilemma…

Computation and Language · Computer Science 2026-03-26 Joshua Rozner , Cory Shain

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

While existing work in robust deep learning has focused on small pixel-level norm-based perturbations, this may not account for perturbations encountered in several real-world settings. In many such cases although test data might not be…

Computer Vision and Pattern Recognition · Computer Science 2021-04-09 Tejas Gokhale , Rushil Anirudh , Bhavya Kailkhura , Jayaraman J. Thiagarajan , Chitta Baral , Yezhou Yang

Assessing Robustness of Machine Learning Models using Covariate Perturbations

As machine learning models become increasingly prevalent in critical decision-making models and systems in fields like finance, healthcare, etc., ensuring their robustness against adversarial attacks and changes in the input data is…

Machine Learning · Statistics 2024-08-05 Arun Prakash R , Anwesha Bhattacharyya , Joel Vaughan , Vijayan N. Nair

Evaluating Robustness to Context-Sensitive Feature Perturbations of Different Granularities

We cannot guarantee that training datasets are representative of the distribution of inputs that will be encountered during deployment. So we must have confidence that our models do not over-rely on this assumption. To this end, we…

Computer Vision and Pattern Recognition · Computer Science 2020-10-26 Isaac Dunn , Laura Hanu , Hadrien Pouget , Daniel Kroening , Tom Melham

Adversarially Robust Generalization Requires More Data

Machine learning models are often susceptible to adversarial perturbations of their inputs. Even small perturbations can cause state-of-the-art classifiers with high "standard" accuracy to produce an incorrect prediction with high…

Machine Learning · Computer Science 2018-05-03 Ludwig Schmidt , Shibani Santurkar , Dimitris Tsipras , Kunal Talwar , Aleksander Mądry

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

In this work, we consider one challenging training time attack by modifying training data with bounded perturbation, hoping to manipulate the behavior (both targeted or non-targeted) of any corresponding trained classifier during test time…

Machine Learning · Computer Science 2019-05-23 Ji Feng , Qi-Zhi Cai , Zhi-Hua Zhou

Defending Against Universal Perturbations With Shared Adversarial Training

Classifiers such as deep neural networks have been shown to be vulnerable against adversarial perturbations on problems with high-dimensional input space. While adversarial training improves the robustness of image classifiers against such…

Computer Vision and Pattern Recognition · Computer Science 2019-08-14 Chaithanya Kumar Mummadi , Thomas Brox , Jan Hendrik Metzen

Wide Two-Layer Networks can Learn from Adversarial Perturbations

Adversarial examples have raised several open questions, such as why they can deceive classifiers and transfer between different models. A prevailing hypothesis to explain these phenomena suggests that adversarial perturbations appear as…

Machine Learning · Computer Science 2025-01-22 Soichiro Kumano , Hiroshi Kera , Toshihiko Yamasaki

Robustness-via-Synthesis: Robust Training with Generative Adversarial Perturbations

Upon the discovery of adversarial attacks, robust models have become obligatory for deep learning-based systems. Adversarial training with first-order attacks has been one of the most effective defenses against adversarial perturbations to…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Inci M. Baytas , Debayan Deb

Adversarial Machine Learning Phases of Matter

We study the robustness of machine learning approaches to adversarial perturbations, with a focus on supervised learning scenarios. We find that typical phase classifiers based on deep neural networks are extremely vulnerable to adversarial…

Disordered Systems and Neural Networks · Physics 2024-01-26 Si Jiang , Sirui Lu , Dong-Ling Deng

Can Perceptual Guidance Lead to Semantically Explainable Adversarial Perturbations?

It is well known that carefully crafted imperceptible perturbations can cause state-of-the-art deep learning classification models to misclassify. Understanding and analyzing these adversarial perturbations play a crucial role in the design…

Image and Video Processing · Electrical Eng. & Systems 2023-08-08 P Charantej Reddy , Aditya Siripuram , Sumohana S. Channappayya

Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system

Adversarial training, in which a network is trained on both adversarial and clean examples, is one of the most trusted defense methods against adversarial attacks. However, there are three major practical difficulties in implementing and…

Machine Learning · Computer Science 2019-10-11 Shixian Wen , Laurent Itti

Theoretical Foundations of Adversarially Robust Learning

Despite extraordinary progress, current machine learning systems have been shown to be brittle against adversarial examples: seemingly innocuous but carefully crafted perturbations of test examples that cause machine learning predictors to…

Machine Learning · Computer Science 2023-06-14 Omar Montasser

GAP++: Learning to generate target-conditioned adversarial examples

Adversarial examples are perturbed inputs which can cause a serious threat for machine learning models. Finding these perturbations is such a hard task that we can only use the iterative methods to traverse. For computational efficiency,…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Xiaofeng Mao , Yuefeng Chen , Yuhong Li , Yuan He , Hui Xue