Related papers: Efficient and Robust Classification for Sparse Att…

Robust Classification Under $\ell_0$ Attack for the Gaussian Mixture Model

It is well-known that machine learning models are vulnerable to small but cleverly-designed adversarial perturbations that can cause misclassification. While there has been major progress in designing attacks and defenses for various…

Machine Learning · Computer Science 2021-04-07 Payam Delgosha , Hamed Hassani , Ramtin Pedarsani

Robust Adversarial Learning via Sparsifying Front Ends

It is by now well-known that small adversarial perturbations can induce classification errors in deep neural networks. In this paper, we take a bottom-up signal processing perspective to this problem and show that a systematic exploitation…

Machine Learning · Statistics 2021-05-26 Soorya Gopalakrishnan , Zhinus Marzi , Metehan Cekic , Upamanyu Madhow , Ramtin Pedarsani

Sparsity-based Defense against Adversarial Attacks on Linear Classifiers

Deep neural networks represent the state of the art in machine learning in a growing number of fields, including vision, speech and natural language processing. However, recent work raises important questions about the robustness of such…

Machine Learning · Statistics 2018-06-20 Zhinus Marzi , Soorya Gopalakrishnan , Upamanyu Madhow , Ramtin Pedarsani

Certified Robustness against Sparse Adversarial Perturbations via Data Localization

Recent work in adversarial robustness suggests that natural data distributions are localized, i.e., they place high probability in small volume regions of the input space, and that this property can be utilized for designing classifiers…

Machine Learning · Computer Science 2024-05-24 Ambar Pal , René Vidal , Jeremias Sulam

Binary Classification Under $\ell_0$ Attacks for General Noise Distribution

Adversarial examples have recently drawn considerable attention in the field of machine learning due to the fact that small perturbations in the data can result in major performance degradation. This phenomenon is usually modeled by a…

Machine Learning · Computer Science 2022-03-10 Payam Delgosha , Hamed Hassani , Ramtin Pedarsani

Sparse and Imperceivable Adversarial Attacks

Neural networks have been proven to be vulnerable to a variety of adversarial attacks. From a safety perspective, highly sparse adversarial attacks are particularly dangerous. On the other hand the pixelwise perturbations of sparse attacks…

Machine Learning · Computer Science 2019-09-12 Francesco Croce , Matthias Hein

Combating Adversarial Attacks Using Sparse Representations

It is by now well-known that small adversarial perturbations can induce classification errors in deep neural networks (DNNs). In this paper, we make the case that sparse representations of the input data are a crucial tool for combating…

Machine Learning · Statistics 2018-07-16 Soorya Gopalakrishnan , Zhinus Marzi , Upamanyu Madhow , Ramtin Pedarsani

Generalization Properties of Adversarial Training for $\ell_0$-Bounded Adversarial Attacks

We have widely observed that neural networks are vulnerable to small additive perturbations to the input causing misclassification. In this paper, we focus on the $\ell_0$-bounded adversarial attacks, and aim to theoretically characterize…

Machine Learning · Computer Science 2024-02-07 Payam Delgosha , Hamed Hassani , Ramtin Pedarsani

Evaluating Robustness of Neural Networks with Mixed Integer Programming

Neural networks have demonstrated considerable success on a wide variety of real-world problems. However, networks trained only to optimize for training accuracy can often be fooled by adversarial examples - slightly perturbed inputs that…

Machine Learning · Computer Science 2019-02-19 Vincent Tjeng , Kai Xiao , Russ Tedrake

Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network

We present a new algorithm to train a robust neural network against adversarial attacks. Our algorithm is motivated by the following two ideas. First, although recent work has demonstrated that fusing randomness can improve the robustness…

Machine Learning · Computer Science 2019-05-07 Xuanqing Liu , Yao Li , Chongruo Wu , Cho-Jui Hsieh

Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation

Recently, techniques have been developed to provably guarantee the robustness of a classifier to adversarial perturbations of bounded L_1 and L_2 magnitudes by using randomized smoothing: the robust classification is a consensus of base…

Machine Learning · Computer Science 2019-11-22 Alexander Levine , Soheil Feizi

Improving the Certified Robustness of Neural Networks via Consistency Regularization

A range of defense methods have been proposed to improve the robustness of neural networks on adversarial examples, among which provable defense methods have been demonstrated to be effective to train neural networks that are certifiably…

Machine Learning · Computer Science 2021-01-21 Mengting Xu , Tao Zhang , Zhongnian Li , Daoqiang Zhang

Towards Interpretable Adversarial Examples via Sparse Adversarial Attack

Sparse attacks are to optimize the magnitude of adversarial perturbations for fooling deep neural networks (DNNs) involving only a few perturbed pixels (i.e., under the l0 constraint), suitable for interpreting the vulnerability of DNNs.…

Machine Learning · Computer Science 2025-06-24 Fudong Lin , Jiadong Lou , Hao Wang , Brian Jalaian , Xu Yuan

Achieving Verified Robustness to Symbol Substitutions via Interval Bound Propagation

Neural networks are part of many contemporary NLP systems, yet their empirical successes come at the price of vulnerability to adversarial attacks. Previous work has used adversarial training and data augmentation to partially mitigate such…

Computation and Language · Computer Science 2019-12-23 Po-Sen Huang , Robert Stanforth , Johannes Welbl , Chris Dyer , Dani Yogatama , Sven Gowal , Krishnamurthy Dvijotham , Pushmeet Kohli

Adversarial Risk Bounds for Neural Networks through Sparsity based Compression

Neural networks have been shown to be vulnerable against minor adversarial perturbations of their inputs, especially for high dimensional data under $\ell_\infty$ attacks. To combat this problem, techniques like adversarial training have…

Machine Learning · Computer Science 2019-06-04 Emilio Rafael Balda , Arash Behboodi , Niklas Koep , Rudolf Mathar

Adversarial Explanations for Understanding Image Classification Decisions and Improved Neural Network Robustness

For sensitive problems, such as medical imaging or fraud detection, Neural Network (NN) adoption has been slow due to concerns about their reliability, leading to a number of algorithms for explaining their decisions. NNs have also been…

Machine Learning · Computer Science 2019-11-06 Walt Woods , Jack Chen , Christof Teuscher

On The Relationship Between Universal Adversarial Attacks And Sparse Representations

The prominent success of neural networks, mainly in computer vision tasks, is increasingly shadowed by their sensitivity to small, barely perceivable adversarial perturbations in image input. In this work, we aim at explaining this…

Computer Vision and Pattern Recognition · Computer Science 2023-11-15 Dana Weitzner , Raja Giryes

DeepSafe: A Data-driven Approach for Checking Adversarial Robustness in Neural Networks

Deep neural networks have become widely used, obtaining remarkable results in domains such as computer vision, speech recognition, natural language processing, audio recognition, social network filtering, machine translation, and…

Neural and Evolutionary Computing · Computer Science 2020-02-03 Divya Gopinath , Guy Katz , Corina S. Pasareanu , Clark Barrett

Convergence of Adversarial Training in Overparametrized Neural Networks

Neural networks are vulnerable to adversarial examples, i.e. inputs that are imperceptibly perturbed from natural data and yet incorrectly classified by the network. Adversarial training, a heuristic form of robust optimization that…

Machine Learning · Computer Science 2019-11-12 Ruiqi Gao , Tianle Cai , Haochuan Li , Liwei Wang , Cho-Jui Hsieh , Jason D. Lee

Exploiting vulnerabilities of deep neural networks for privacy protection

Adversarial perturbations can be added to images to protect their content from unwanted inferences. These perturbations may, however, be ineffective against classifiers that were not {seen} during the generation of the perturbation, or…

Computer Vision and Pattern Recognition · Computer Science 2020-10-01 Ricardo Sanchez-Matilla , Chau Yi Li , Ali Shahin Shamsabadi , Riccardo Mazzon , Andrea Cavallaro