Related papers: Algebraic Adversarial Attacks on Explainability Mo…

Algebraic Adversarial Attacks on Integrated Gradients

Adversarial attacks on explainability models have drastic consequences when explanations are used to understand the reasoning of neural networks in safety critical systems. Path methods are one such class of attribution methods susceptible…

Machine Learning · Computer Science 2025-02-28 Lachlan Simpson , Federico Costanza , Kyle Millar , Adriel Cheng , Cheng-Chew Lim , Hong Gunn Chew

A Geometry-Inspired Attack for Generating Natural Language Adversarial Examples

Generating adversarial examples for natural language is hard, as natural language consists of discrete symbols, and examples are often of variable lengths. In this paper, we propose a geometry-inspired attack for generating natural language…

Computation and Language · Computer Science 2020-10-06 Zhao Meng , Roger Wattenhofer

Generating Band-Limited Adversarial Surfaces Using Neural Networks

Generating adversarial examples is the art of creating a noise that is added to an input signal of a classifying neural network, and thus changing the network's classification, while keeping the noise as tenuous as possible. While the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Roee Ben-Shlomo , Yevgeniy Men , Ido Imanuel

Towards Imperceptible and Robust Adversarial Example Attacks against Neural Networks

Machine learning systems based on deep neural networks, being able to produce state-of-the-art results on various perception tasks, have gained mainstream adoption in many applications. However, they are shown to be vulnerable to…

Machine Learning · Computer Science 2018-01-16 Bo Luo , Yannan Liu , Lingxiao Wei , Qiang Xu

Adversarial Examples in Deep Learning: Characterization and Divergence

The burgeoning success of deep learning has raised the security and privacy concerns as more and more tasks are accompanied with sensitive data. Adversarial attacks in deep learning have emerged as one of the dominating security threat to a…

Machine Learning · Computer Science 2019-01-01 Wenqi Wei , Ling Liu , Margaret Loper , Stacey Truex , Lei Yu , Mehmet Emre Gursoy , Yanzhao Wu

When and How to Fool Explainable Models (and Humans) with Adversarial Examples

Reliable deployment of machine learning models such as neural networks continues to be challenging due to several limitations. Some of the main shortcomings are the lack of interpretability and the lack of robustness against adversarial…

Machine Learning · Computer Science 2025-02-18 Jon Vadillo , Roberto Santana , Jose A. Lozano

Adversarial Examples: Attacks and Defenses for Deep Learning

With rapid progress and significant successes in a wide spectrum of applications, deep learning is being applied in many safety-critical environments. However, deep neural networks have been recently found vulnerable to well-designed input…

Machine Learning · Computer Science 2018-07-10 Xiaoyong Yuan , Pan He , Qile Zhu , Xiaolin Li

Generating Adversarial Examples With Conditional Generative Adversarial Net

Recently, deep neural networks have significant progress and successful application in various fields, but they are found vulnerable to attack instances, e.g., adversarial examples. State-of-art attack methods can generate attack images by…

Machine Learning · Computer Science 2019-03-19 Ping Yu , Kaitao Song , Jianfeng Lu

Customizing an Adversarial Example Generator with Class-Conditional GANs

Adversarial examples are intentionally crafted data with the purpose of deceiving neural networks into misclassification. When we talk about strategies to create such examples, we usually refer to perturbation-based methods that fabricate…

Computer Vision and Pattern Recognition · Computer Science 2018-06-28 Shih-hong Tsai

Adversarial attacks hidden in plain sight

Convolutional neural networks have been used to achieve a string of successes during recent years, but their lack of interpretability remains a serious issue. Adversarial examples are designed to deliberately fool neural networks into…

Machine Learning · Computer Science 2020-04-28 Jan Philip Göpfert , André Artelt , Heiko Wersing , Barbara Hammer

Generating Unrestricted Adversarial Examples via Three Parameters

Deep neural networks have been shown to be vulnerable to adversarial examples deliberately constructed to misclassify victim models. As most adversarial examples have restricted their perturbations to $L_{p}$-norm, existing defense methods…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Hanieh Naderi , Leili Goli , Shohreh Kasaei

Analyzing the Impact of Adversarial Examples on Explainable Machine Learning

Adversarial attacks are a type of attack on machine learning models where an attacker deliberately modifies the inputs to cause the model to make incorrect predictions. Adversarial attacks can have serious consequences, particularly in…

Machine Learning · Computer Science 2025-09-15 Prathyusha Devabhakthini , Sasmita Parida , Raj Mani Shukla , Suvendu Chandan Nayak , Tapadhir Das

Explainable Adversarial Attacks in Deep Neural Networks Using Activation Profiles

As neural networks become the tool of choice to solve an increasing variety of problems in our society, adversarial attacks become critical. The possibility of generating data instances deliberately designed to fool a network's analysis can…

Machine Learning · Computer Science 2021-03-19 Gabriel D. Cantareira , Rodrigo F. Mello , Fernando V. Paulovich

Adversarial Examples in Modern Machine Learning: A Review

Recent research has found that many families of machine learning models are vulnerable to adversarial examples: inputs that are specifically designed to cause the target model to produce erroneous outputs. In this survey, we focus on…

Machine Learning · Computer Science 2019-11-19 Rey Reza Wiyatno , Anqi Xu , Ousmane Dia , Archy de Berker

Game Theory for Adversarial Attacks and Defenses

Adversarial attacks can generate adversarial inputs by applying small but intentionally worst-case perturbations to samples from the dataset, which leads to even state-of-the-art deep neural networks outputting incorrect answers with high…

Machine Learning · Computer Science 2024-01-08 Shorya Sharma

Are adversarial examples inevitable?

A wide range of defenses have been proposed to harden neural networks against adversarial attacks. However, a pattern has emerged in which the majority of adversarial defenses are quickly broken by new attacks. Given the lack of success at…

Machine Learning · Computer Science 2020-02-05 Ali Shafahi , W. Ronny Huang , Christoph Studer , Soheil Feizi , Tom Goldstein

Towards Deep Learning Models Resistant to Adversarial Attacks

Recent work has demonstrated that deep neural networks are vulnerable to adversarial examples---inputs that are almost indistinguishable from natural data and yet classified incorrectly by the network. In fact, some of the latest findings…

Machine Learning · Statistics 2019-09-06 Aleksander Madry , Aleksandar Makelov , Ludwig Schmidt , Dimitris Tsipras , Adrian Vladu

On Neural Network approximation of ideal adversarial attack and convergence of adversarial training

Adversarial attacks are usually expressed in terms of a gradient-based operation on the input data and model, this results in heavy computations every time an attack is generated. In this work, we solidify the idea of representing…

Machine Learning · Computer Science 2023-08-01 Rajdeep Haldar , Qifan Song

Towards Explaining Adversarial Examples Phenomenon in Artificial Neural Networks

In this paper, we study the adversarial examples existence and adversarial training from the standpoint of convergence and provide evidence that pointwise convergence in ANNs can explain these observations. The main contribution of our…

Machine Learning · Computer Science 2022-05-27 Ramin Barati , Reza Safabakhsh , Mohammad Rahmati

Perturbations are not Enough: Generating Adversarial Examples with Spatial Distortions

Deep neural network image classifiers are reported to be susceptible to adversarial evasion attacks, which use carefully crafted images created to mislead a classifier. Recently, various kinds of adversarial attack methods have been…

Machine Learning · Computer Science 2019-10-04 He Zhao , Trung Le , Paul Montague , Olivier De Vel , Tamas Abraham , Dinh Phung