Related papers: Provably Adversarially Robust Nearest Prototype Cl…

Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits

Neural Probabilistic Circuits (NPCs), a new class of concept bottleneck models, comprise an attribute recognition model and a probabilistic circuit for reasoning. By integrating the outputs from these two modules, NPCs produce compositional…

Machine Learning · Computer Science 2025-09-26 Weixin Chen , Han Zhao

Evaluating the Robustness of Nearest Neighbor Classifiers: A Primal-Dual Perspective

We study the problem of computing the minimum adversarial perturbation of the Nearest Neighbor (NN) classifiers. Previous attempts either conduct attacks on continuous approximations of NN models or search for the perturbation by some…

Machine Learning · Computer Science 2019-06-11 Lu Wang , Xuanqing Liu , Jinfeng Yi , Zhi-Hua Zhou , Cho-Jui Hsieh

Certified Robustness against Sparse Adversarial Perturbations via Data Localization

Recent work in adversarial robustness suggests that natural data distributions are localized, i.e., they place high probability in small volume regions of the input space, and that this property can be utilized for designing classifiers…

Machine Learning · Computer Science 2024-05-24 Ambar Pal , René Vidal , Jeremias Sulam

On the Minimal Adversarial Perturbation for Deep Neural Networks with Provable Estimation Error

Although Deep Neural Networks (DNNs) have shown incredible performance in perceptive and control tasks, several trustworthy issues are still open. One of the most discussed topics is the existence of adversarial perturbations, which has…

Machine Learning · Computer Science 2023-01-10 Fabio Brau , Giulio Rossolini , Alessandro Biondi , Giorgio Buttazzo

Towards Certified Probabilistic Robustness with High Accuracy

Adversarial examples pose a security threat to many critical systems built on neural networks (such as face recognition systems, and self-driving cars). While many methods have been proposed to build robust models, how to build certifiably…

Machine Learning · Computer Science 2023-09-06 Ruihan Zhang , Peixin Zhang , Jun Sun

Adversarial Robustness Guarantees for Classification with Gaussian Processes

We investigate adversarial robustness of Gaussian Process Classification (GPC) models. Given a compact subset of the input space $T\subseteq \mathbb{R}^d$ enclosing a test point $x^*$ and a GPC trained on a dataset $\mathcal{D}$, we aim to…

Machine Learning · Statistics 2020-03-12 Arno Blaas , Andrea Patane , Luca Laurenti , Luca Cardelli , Marta Kwiatkowska , Stephen Roberts

PROVEN: Certifying Robustness of Neural Networks with a Probabilistic Approach

With deep neural networks providing state-of-the-art machine learning models for numerous machine learning tasks, quantifying the robustness of these models has become an important area of research. However, most of the research literature…

Machine Learning · Computer Science 2019-01-08 Tsui-Wei Weng , Pin-Yu Chen , Lam M. Nguyen , Mark S. Squillante , Ivan Oseledets , Luca Daniel

Is Certifying $\ell_p$ Robustness Still Worthwhile?

Over the years, researchers have developed myriad attacks that exploit the ubiquity of adversarial examples, as well as defenses that aim to guard against the security vulnerabilities posed by such attacks. Of particular interest to this…

Machine Learning · Computer Science 2023-10-17 Ravi Mangal , Klas Leino , Zifan Wang , Kai Hu , Weicheng Yu , Corina Pasareanu , Anupam Datta , Matt Fredrikson

Robustness Certificates Against Adversarial Examples for ReLU Networks

While neural networks have achieved high performance in different learning tasks, their accuracy drops significantly in the presence of small adversarial perturbations to inputs. Defenses based on regularization and adversarial training are…

Machine Learning · Computer Science 2019-02-07 Sahil Singla , Soheil Feizi

Adversarial Robustness Guarantees for Random Deep Neural Networks

The reliability of deep learning algorithms is fundamentally challenged by the existence of adversarial examples, which are incorrectly classified inputs that are extremely close to a correctly classified input. We explore the properties of…

Machine Learning · Statistics 2021-07-23 Giacomo De Palma , Bobak T. Kiani , Seth Lloyd

Second-Order Provable Defenses against Adversarial Attacks

A robustness certificate is the minimum distance of a given input to the decision boundary of the classifier (or its lower bound). For {\it any} input perturbations with a magnitude smaller than the certificate value, the classification…

Machine Learning · Computer Science 2020-06-02 Sahil Singla , Soheil Feizi

Certifiable Robustness for Nearest Neighbor Classifiers

ML models are typically trained using large datasets of high quality. However, training datasets often contain inconsistent or incomplete data. To tackle this issue, one solution is to develop algorithms that can check whether a prediction…

Machine Learning · Computer Science 2022-01-19 Austen Z. Fan , Paraschos Koutris

Certified Adversarial Robustness with Additive Noise

The existence of adversarial data examples has drawn significant attention in the deep-learning community; such data are seemingly minimally perturbed relative to the original data, but lead to very different outputs from a deep-learning…

Machine Learning · Computer Science 2019-11-12 Bai Li , Changyou Chen , Wenlin Wang , Lawrence Carin

Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation

Recently, techniques have been developed to provably guarantee the robustness of a classifier to adversarial perturbations of bounded L_1 and L_2 magnitudes by using randomized smoothing: the robust classification is a consensus of base…

Machine Learning · Computer Science 2019-11-22 Alexander Levine , Soheil Feizi

Certified Adversarial Robustness Within Multiple Perturbation Bounds

Randomized smoothing (RS) is a well known certified defense against adversarial attacks, which creates a smoothed classifier by predicting the most likely class under random noise perturbations of inputs during inference. While initial work…

Machine Learning · Computer Science 2023-04-21 Soumalya Nandi , Sravanti Addepalli , Harsh Rangwani , R. Venkatesh Babu

On adversarial training and the 1 Nearest Neighbor classifier

The ability to fool deep learning classifiers with tiny perturbations of the input has lead to the development of adversarial training in which the loss with respect to adversarial examples is minimized in addition to the training examples.…

Machine Learning · Computer Science 2024-07-30 Amir Hagai , Yair Weiss

Optimizing Information Loss Towards Robust Neural Networks

Neural Networks (NNs) are vulnerable to adversarial examples. Such inputs differ only slightly from their benign counterparts yet provoke misclassifications of the attacked NNs. The required perturbations to craft the examples are often…

Cryptography and Security · Computer Science 2020-09-30 Philip Sperl , Konstantin Böttinger

Defending with Errors: Approximate Computing for Robustness of Deep Neural Networks

Machine-learning architectures, such as Convolutional Neural Networks (CNNs) are vulnerable to adversarial attacks: inputs crafted carefully to force the system output to a wrong label. Since machine-learning is being deployed in…

Cryptography and Security · Computer Science 2022-11-03 Amira Guesmi , Ihsen Alouani , Khaled N. Khasawneh , Mouna Baklouti , Tarek Frikha , Mohamed Abid , Nael Abu-Ghazaleh

Evaluating Robustness of Neural Networks with Mixed Integer Programming

Neural networks have demonstrated considerable success on a wide variety of real-world problems. However, networks trained only to optimize for training accuracy can often be fooled by adversarial examples - slightly perturbed inputs that…

Machine Learning · Computer Science 2019-02-19 Vincent Tjeng , Kai Xiao , Russ Tedrake

Robust-by-Design Classification via Unitary-Gradient Neural Networks

The use of neural networks in safety-critical systems requires safe and robust models, due to the existence of adversarial attacks. Knowing the minimal adversarial perturbation of any input x, or, equivalently, knowing the distance of x…

Machine Learning · Computer Science 2022-09-12 Fabio Brau , Giulio Rossolini , Alessandro Biondi , Giorgio Buttazzo