Related papers: Deflecting Adversarial Attacks

Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Adversarial examples raise questions about whether neural network models are sensitive to the same visual features as humans. In this paper, we first detect adversarial examples or otherwise corrupted images based on a class-conditional…

Machine Learning · Computer Science 2020-02-19 Yao Qin , Nicholas Frosst , Sara Sabour , Colin Raffel , Garrison Cottrell , Geoffrey Hinton

Adversarial attacks hidden in plain sight

Convolutional neural networks have been used to achieve a string of successes during recent years, but their lack of interpretability remains a serious issue. Adversarial examples are designed to deliberately fool neural networks into…

Machine Learning · Computer Science 2020-04-28 Jan Philip Göpfert , André Artelt , Heiko Wersing , Barbara Hammer

DARCCC: Detecting Adversaries by Reconstruction from Class Conditional Capsules

We present a simple technique that allows capsule models to detect adversarial images. In addition to being trained to classify images, the capsule model is trained to reconstruct the images from the pose parameters and identity of the…

Machine Learning · Computer Science 2018-11-19 Nicholas Frosst , Sara Sabour , Geoffrey Hinton

Concept-based Adversarial Attacks: Tricking Humans and Classifiers Alike

We propose to generate adversarial samples by modifying activations of upper layers encoding semantically meaningful concepts. The original sample is shifted towards a target sample, yielding an adversarial sample, by using the modified…

Machine Learning · Computer Science 2022-03-22 Johannes Schneider , Giovanni Apruzzese

Adversary Aware Continual Learning

Class incremental learning approaches are useful as they help the model to learn new information (classes) sequentially, while also retaining the previously acquired information (classes). However, it has been shown that such approaches are…

Machine Learning · Computer Science 2023-05-01 Muhammad Umer , Robi Polikar

Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods

Neural networks are known to be vulnerable to adversarial examples: inputs that are close to natural inputs but classified incorrectly. In order to better understand the space of adversarial examples, we survey ten recent proposals that are…

Machine Learning · Computer Science 2017-11-02 Nicholas Carlini , David Wagner

Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Deep neural networks for image classification are well-known to be vulnerable to adversarial attacks. One such attack that has garnered recent attention is the adversarial backdoor attack, which has demonstrated the capability to perform…

Cryptography and Security · Computer Science 2022-06-09 Glenn Dawson , Muhammad Umer , Robi Polikar

Adversarial Examples Detection in Deep Networks with Convolutional Filter Statistics

Deep learning has greatly improved visual recognition in recent years. However, recent research has shown that there exist many adversarial examples that can negatively impact the performance of such an architecture. This paper focuses on…

Computer Vision and Pattern Recognition · Computer Science 2017-10-30 Xin Li , Fuxin Li

A New Defense Against Adversarial Images: Turning a Weakness into a Strength

Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images. While many techniques for detecting these attacks…

Machine Learning · Computer Science 2019-12-05 Tao Yu , Shengyuan Hu , Chuan Guo , Wei-Lun Chao , Kilian Q. Weinberger

A New Kind of Adversarial Example

Almost all adversarial attacks are formulated to add an imperceptible perturbation to an image in order to fool a model. Here, we consider the opposite which is adversarial examples that can fool a human but not a model. A large enough and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-26 Ali Borji

Defense Against Adversarial Attacks using Convolutional Auto-Encoders

Deep learning models, while achieving state-of-the-art performance on many tasks, are susceptible to adversarial attacks that exploit inherent vulnerabilities in their architectures. Adversarial attacks manipulate the input data with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-07 Shreyasi Mandal

Trace and Detect Adversarial Attacks on CNNs using Feature Response Maps

The existence of adversarial attacks on convolutional neural networks (CNN) questions the fitness of such models for serious applications. The attacks manipulate an input image such that misclassification is evoked while still looking…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Mohammadreza Amirian , Friedhelm Schwenker , Thilo Stadelmann

Adversarial Defense Through Network Profiling Based Path Extraction

Recently, researchers have started decomposing deep neural network models according to their semantics or functions. Recent work has shown the effectiveness of decomposed functional blocks for defending adversarial attacks, which add small…

Machine Learning · Computer Science 2019-05-10 Yuxian Qiu , Jingwen Leng , Cong Guo , Quan Chen , Chao Li , Minyi Guo , Yuhao Zhu

Adversarial Examples Detection beyond Image Space

Deep neural networks have been proved that they are vulnerable to adversarial examples, which are generated by adding human-imperceptible perturbations to images. To defend these adversarial examples, various detection based methods have…

Computer Vision and Pattern Recognition · Computer Science 2021-02-24 Kejiang Chen , Yuefeng Chen , Hang Zhou , Chuan Qin , Xiaofeng Mao , Weiming Zhang , Nenghai Yu

Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization

Adversarial attacks involve adding, small, often imperceptible, perturbations to inputs with the goal of getting a machine learning model to misclassifying them. While many different adversarial attack strategies have been proposed on image…

Computer Vision and Pattern Recognition · Computer Science 2018-06-01 Avishek Joey Bose , Parham Aarabi

Exploring the role of Input and Output Layers of a Deep Neural Network in Adversarial Defense

Deep neural networks are learning models having achieved state of the art performance in many fields like prediction, computer vision, language processing and so on. However, it has been shown that certain inputs exist which would not trick…

Machine Learning · Computer Science 2020-06-03 Jay N. Paranjape , Rahul Kumar Dubey , Vijendran V Gopalan

Defense Against Adversarial Attacks Using Feature Scattering-based Adversarial Training

We introduce a feature scattering-based adversarial training approach for improving model robustness against adversarial attacks. Conventional adversarial training approaches leverage a supervised scheme (either targeted or non-targeted) in…

Computer Vision and Pattern Recognition · Computer Science 2019-11-25 Haichao Zhang , Jianyu Wang

Deflecting Adversarial Attacks with Pixel Deflection

CNNs are poised to become integral parts of many critical systems. Despite their robustness to natural variations, image pixel values can be manipulated, via small, carefully crafted, imperceptible perturbations, to cause a model to…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Aaditya Prakash , Nick Moran , Solomon Garber , Antonella DiLillo , James Storer

Detecting Adversarial Examples via Key-based Network

Though deep neural networks have achieved state-of-the-art performance in visual classification, recent studies have shown that they are all vulnerable to the attack of adversarial examples. Small and often imperceptible perturbations to…

Machine Learning · Computer Science 2018-06-05 Pinlong Zhao , Zhouyu Fu , Ou wu , Qinghua Hu , Jun Wang

ReabsNet: Detecting and Revising Adversarial Examples

Though deep neural network has hit a huge success in recent studies and applica- tions, it still remains vulnerable to adversarial perturbations which are imperceptible to humans. To address this problem, we propose a novel network called…

Machine Learning · Computer Science 2017-12-25 Jiefeng Chen , Zihang Meng , Changtian Sun , Wei Tang , Yinglun Zhu