Related papers: Functional Adversarial Attacks

Towards Feature Space Adversarial Attack

We propose a new adversarial attack to Deep Neural Networks for image classification. Different from most existing attacks that directly perturb input pixels, our attack focuses on perturbing abstract features, more specifically, features…

Machine Learning · Computer Science 2020-12-17 Qiuling Xu , Guanhong Tao , Siyuan Cheng , Xiangyu Zhang

Semantic Adversarial Examples

Deep neural networks are known to be vulnerable to adversarial examples, i.e., images that are maliciously perturbed to fool the model. Generating adversarial examples has been mostly limited to finding small perturbations that maximize the…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Hossein Hosseini , Radha Poovendran

Adversarial Image Color Transformations in Explicit Color Filter Space

Deep Neural Networks have been shown to be vulnerable to adversarial images. Conventional attacks strive for indistinguishable adversarial images with strictly restricted perturbations. Recently, researchers have moved to explore…

Computer Vision and Pattern Recognition · Computer Science 2023-06-19 Zhengyu Zhao , Zhuoran Liu , Martha Larson

Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers

Deep neural networks have been shown to exhibit an intriguing vulnerability to adversarial input images corrupted with imperceptible perturbations. However, the majority of adversarial attacks assume global, fine-grained control over the…

Computer Vision and Pattern Recognition · Computer Science 2019-08-19 Ameya Joshi , Amitangshu Mukherjee , Soumik Sarkar , Chinmay Hegde

ColorFool: Semantic Adversarial Colorization

Adversarial attacks that generate small L_p-norm perturbations to mislead classifiers have limited success in black-box settings and with unseen classifiers. These attacks are also not robust to defenses that use denoising filters and to…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Ali Shahin Shamsabadi , Ricardo Sanchez-Matilla , Andrea Cavallaro

Robust Feature-Level Adversaries are Interpretability Tools

The literature on adversarial attacks in computer vision typically focuses on pixel-level perturbations. These tend to be very difficult to interpret. Recent work that manipulates the latent representations of image generators to create…

Machine Learning · Computer Science 2023-09-12 Stephen Casper , Max Nadeau , Dylan Hadfield-Menell , Gabriel Kreiman

Adversarial Reprogramming of Neural Networks

Deep neural networks are susceptible to \emph{adversarial} attacks. In computer vision, well-crafted perturbations to images can cause neural networks to make mistakes such as confusing a cat with a computer. Previous adversarial attacks…

Machine Learning · Computer Science 2019-09-12 Gamaleldin F. Elsayed , Ian Goodfellow , Jascha Sohl-Dickstein

Unrestricted Adversarial Examples via Semantic Manipulation

Machine learning models, especially deep neural networks (DNNs), have been shown to be vulnerable against adversarial examples which are carefully crafted samples with a small magnitude of the perturbation. Such adversarial perturbations…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Anand Bhattad , Min Jin Chong , Kaizhao Liang , Bo Li , D. A. Forsyth

Generative Adversarial Perturbations

In this paper, we propose novel generative models for creating adversarial examples, slightly perturbed images resembling natural images but maliciously crafted to fool pre-trained models. We present trainable deep neural networks for…

Computer Vision and Pattern Recognition · Computer Science 2018-07-09 Omid Poursaeed , Isay Katsman , Bicheng Gao , Serge Belongie

Generating Semantic Adversarial Examples via Feature Manipulation

The vulnerability of deep neural networks to adversarial attacks has been widely demonstrated (e.g., adversarial example attacks). Traditional attacks perform unstructured pixel-wise perturbation to fool the classifier. An alternative…

Machine Learning · Computer Science 2022-05-23 Shuo Wang , Surya Nepal , Carsten Rudolph , Marthie Grobler , Shangyu Chen , Tianle Chen

Quantifying Perceptual Distortion of Adversarial Examples

Recent work has shown that additive threat models, which only permit the addition of bounded noise to the pixels of an image, are insufficient for fully capturing the space of imperceivable adversarial examples. For example, small rotations…

Machine Learning · Statistics 2019-02-25 Matt Jordan , Naren Manoj , Surbhi Goel , Alexandros G. Dimakis

Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Adversarial examples raise questions about whether neural network models are sensitive to the same visual features as humans. In this paper, we first detect adversarial examples or otherwise corrupted images based on a class-conditional…

Machine Learning · Computer Science 2020-02-19 Yao Qin , Nicholas Frosst , Sara Sabour , Colin Raffel , Garrison Cottrell , Geoffrey Hinton

A New Kind of Adversarial Example

Almost all adversarial attacks are formulated to add an imperceptible perturbation to an image in order to fool a model. Here, we consider the opposite which is adversarial examples that can fool a human but not a model. A large enough and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-26 Ali Borji

Reversible Attack based on Local Visual Adversarial Perturbation

Adding perturbations to images can mislead classification models to produce incorrect results. Recently, researchers exploited adversarial perturbations to protect image privacy from retrieval by intelligent models. However, adding…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Li Chen , Shaowei Zhu , Zhaoxia Yin

Content-based Unrestricted Adversarial Attack

Unrestricted adversarial attacks typically manipulate the semantic content of an image (e.g., color or texture) to create adversarial examples that are both effective and photorealistic, demonstrating their ability to deceive human…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Zhaoyu Chen , Bo Li , Shuang Wu , Kaixun Jiang , Shouhong Ding , Wenqiang Zhang

ExploreADV: Towards exploratory attack for Neural Networks

Although deep learning has made remarkable progress in processing various types of data such as images, text and speech, they are known to be susceptible to adversarial perturbations: perturbations specifically designed and added to the…

Cryptography and Security · Computer Science 2023-01-04 Tianzuo Luo , Yuyi Zhong , Siaucheng Khoo

Defense Against Adversarial Attacks using Convolutional Auto-Encoders

Deep learning models, while achieving state-of-the-art performance on many tasks, are susceptible to adversarial attacks that exploit inherent vulnerabilities in their architectures. Adversarial attacks manipulate the input data with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-07 Shreyasi Mandal

Feature-Filter: Detecting Adversarial Examples through Filtering off Recessive Features

Deep neural networks (DNNs) are under threat from adversarial example attacks. The adversary can easily change the outputs of DNNs by adding small well-designed perturbations to inputs. Adversarial example detection is a fundamental work…

Machine Learning · Computer Science 2021-11-30 Hui Liu , Bo Zhao , Minzhi Ji , Yuefeng Peng , Jiabao Guo , Peng Liu

Adversarial Confusion Attack: Disrupting Multimodal Large Language Models

We introduce the Adversarial Confusion Attack, a new class of threats against multimodal large language models (MLLMs). Unlike jailbreaks or targeted misclassification, the goal is to induce systematic disruption that makes the model…

Computation and Language · Computer Science 2025-12-02 Jakub Hoscilowicz , Artur Janicki

Cross-Domain Transferability of Adversarial Perturbations

Adversarial examples reveal the blind spots of deep neural networks (DNNs) and represent a major concern for security-critical applications. The transferability of adversarial examples makes real-world attacks possible in black-box…

Computer Vision and Pattern Recognition · Computer Science 2019-10-16 Muzammal Naseer , Salman H. Khan , Harris Khan , Fahad Shahbaz Khan , Fatih Porikli