Related papers: Adversarial Attack Type I: Cheat Classifiers by Si…

Type I Attack for Generative Models

Generative models are popular tools with a wide range of applications. Nevertheless, it is as vulnerable to adversarial samples as classifiers. The existing attack methods mainly focus on generating adversarial examples by adding…

Computer Vision and Pattern Recognition · Computer Science 2020-03-05 Chengjin Sun , Sizhe Chen , Jia Cai , Xiaolin Huang

Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers

Deep neural networks have been shown to exhibit an intriguing vulnerability to adversarial input images corrupted with imperceptible perturbations. However, the majority of adversarial attacks assume global, fine-grained control over the…

Computer Vision and Pattern Recognition · Computer Science 2019-08-19 Ameya Joshi , Amitangshu Mukherjee , Soumik Sarkar , Chinmay Hegde

On the reversibility of adversarial attacks

Adversarial attacks modify images with perturbations that change the prediction of classifiers. These modified images, known as adversarial examples, expose the vulnerabilities of deep neural network classifiers. In this paper, we…

Machine Learning · Computer Science 2022-06-03 Chau Yi Li , Ricardo Sánchez-Matilla , Ali Shahin Shamsabadi , Riccardo Mazzon , Andrea Cavallaro

On Adversarial Examples and Stealth Attacks in Artificial Intelligence Systems

In this work we present a formal theoretical framework for assessing and analyzing two classes of malevolent action towards generic Artificial Intelligence (AI) systems. Our results apply to general multi-class classifiers that map from an…

Machine Learning · Computer Science 2021-01-01 Ivan Y. Tyukin , Desmond J. Higham , Alexander N. Gorban

Concept-based Adversarial Attacks: Tricking Humans and Classifiers Alike

We propose to generate adversarial samples by modifying activations of upper layers encoding semantically meaningful concepts. The original sample is shifted towards a target sample, yielding an adversarial sample, by using the modified…

Machine Learning · Computer Science 2022-03-22 Johannes Schneider , Giovanni Apruzzese

How adversarial attacks can disrupt seemingly stable accurate classifiers

Adversarial attacks dramatically change the output of an otherwise accurate learning system using a seemingly inconsequential modification to a piece of input data. Paradoxically, empirical evidence indicates that even systems which are…

Machine Learning · Computer Science 2024-09-13 Oliver J. Sutton , Qinghua Zhou , Ivan Y. Tyukin , Alexander N. Gorban , Alexander Bastounis , Desmond J. Higham

Adversarial Attacks on Neural Network Policies

Machine learning classifiers are known to be vulnerable to inputs maliciously constructed by adversaries to force misclassification. Such adversarial examples have been extensively studied in the context of computer vision applications. In…

Machine Learning · Computer Science 2017-02-09 Sandy Huang , Nicolas Papernot , Ian Goodfellow , Yan Duan , Pieter Abbeel

Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers

An adversarial attack paradigm explores various scenarios for the vulnerability of deep learning models: minor changes of the input can force a model failure. Most of the state of the art frameworks focus on adversarial attacks for images…

Machine Learning · Computer Science 2020-06-22 I. Fursov , A. Zaytsev , N. Kluchnikov , A. Kravchenko , E. Burnaev

On the Transferability of Adversarial Attacksagainst Neural Text Classifier

Deep neural networks are vulnerable to adversarial attacks, where a small perturbation to an input alters the model prediction. In many cases, malicious inputs intentionally crafted for one model can fool another model. In this paper, we…

Machine Learning · Computer Science 2021-09-23 Liping Yuan , Xiaoqing Zheng , Yi Zhou , Cho-Jui Hsieh , Kai-wei Chang

Identifying Adversarial Attacks on Text Classifiers

The landscape of adversarial attacks against text classifiers continues to grow, with new attacks developed every year and many of them available in standard toolkits, such as TextAttack and OpenAttack. In response, there is a growing body…

Computation and Language · Computer Science 2022-01-24 Zhouhang Xie , Jonathan Brophy , Adam Noack , Wencong You , Kalyani Asthana , Carter Perkins , Sabrina Reis , Sameer Singh , Daniel Lowd

Perturbations are not Enough: Generating Adversarial Examples with Spatial Distortions

Deep neural network image classifiers are reported to be susceptible to adversarial evasion attacks, which use carefully crafted images created to mislead a classifier. Recently, various kinds of adversarial attack methods have been…

Machine Learning · Computer Science 2019-10-04 He Zhao , Trung Le , Paul Montague , Olivier De Vel , Tamas Abraham , Dinh Phung

Adversarial Attacks and Dimensionality in Text Classifiers

Adversarial attacks on machine learning algorithms have been a key deterrent to the adoption of AI in many real-world use cases. They significantly undermine the ability of high-performance neural networks by forcing misclassifications.…

Machine Learning · Computer Science 2024-04-04 Nandish Chattopadhyay , Atreya Goswami , Anupam Chattopadhyay

Adversarial attacks hidden in plain sight

Convolutional neural networks have been used to achieve a string of successes during recent years, but their lack of interpretability remains a serious issue. Adversarial examples are designed to deliberately fool neural networks into…

Machine Learning · Computer Science 2020-04-28 Jan Philip Göpfert , André Artelt , Heiko Wersing , Barbara Hammer

Adversarial examples in the physical world

Most existing machine learning classifiers are highly vulnerable to adversarial examples. An adversarial example is a sample of input data which has been modified very slightly in a way that is intended to cause a machine learning…

Computer Vision and Pattern Recognition · Computer Science 2017-02-14 Alexey Kurakin , Ian Goodfellow , Samy Bengio

A New Kind of Adversarial Example

Almost all adversarial attacks are formulated to add an imperceptible perturbation to an image in order to fool a model. Here, we consider the opposite which is adversarial examples that can fool a human but not a model. A large enough and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-26 Ali Borji

A Practical Adversarial Attack against Sequence-based Deep Learning Malware Classifiers

Sequence-based deep learning models (e.g., RNNs), can detect malware by analyzing its behavioral sequences. Meanwhile, these models are susceptible to adversarial attacks. Attackers can create adversarial samples that alter the sequence…

Cryptography and Security · Computer Science 2025-09-16 Kai Tan , Dongyang Zhan , Lin Ye , Hongli Zhang , Binxing Fang

Adversarial Attacks on Convolutional Neural Networks in Facial Recognition Domain

Numerous recent studies have demonstrated how Deep Neural Network (DNN) classifiers can be fooled by adversarial examples, in which an attacker adds perturbations to an original sample, causing the classifier to misclassify the sample.…

Machine Learning · Computer Science 2021-02-09 Yigit Alparslan , Ken Alparslan , Jeremy Keim-Shenk , Shweta Khade , Rachel Greenstadt

Understanding and Enhancing the Transferability of Adversarial Examples

State-of-the-art deep neural networks are known to be vulnerable to adversarial examples, formed by applying small but malicious perturbations to the original inputs. Moreover, the perturbations can \textit{transfer across models}:…

Machine Learning · Statistics 2018-02-28 Lei Wu , Zhanxing Zhu , Cheng Tai , Weinan E

Combating Adversaries with Anti-Adversaries

Deep neural networks are vulnerable to small input perturbations known as adversarial attacks. Inspired by the fact that these adversaries are constructed by iteratively minimizing the confidence of a network for the true class label, we…

Machine Learning · Computer Science 2021-12-17 Motasem Alfarra , Juan C. Pérez , Ali Thabet , Adel Bibi , Philip H. S. Torr , Bernard Ghanem

Adversarial Attacks on Hidden Tasks in Multi-Task Learning

Deep learning models are susceptible to adversarial attacks, where slight perturbations to input data lead to misclassification. Adversarial attacks become increasingly effective with access to information about the targeted classifier. In…

Machine Learning · Computer Science 2024-05-29 Yu Zhe , Rei Nagaike , Daiki Nishiyama , Kazuto Fukuchi , Jun Sakuma