Related papers: Adversarial Surrogate Risk Bounds for Binary Class…

The Consistency of Adversarial Training for Binary Classification

Robustness to adversarial perturbations is of paramount concern in modern machine learning. One of the state-of-the-art methods for training robust classifiers is adversarial training, which involves minimizing a supremum-based surrogate…

Machine Learning · Computer Science 2023-05-18 Natalie S. Frank , Jonathan Niles-Weed

The Adversarial Consistency of Surrogate Risks for Binary Classification

We study the consistency of surrogate risks for robust binary classification. It is common to learn robust classifiers by adversarial training, which seeks to minimize the expected $0$-$1$ loss when each example can be maliciously corrupted…

Machine Learning · Computer Science 2025-10-09 Natalie Frank , Jonathan Niles-Weed

Adversarial Consistency and the Uniqueness of the Adversarial Bayes Classifier

Minimizing an adversarial surrogate risk is a common technique for learning robust classifiers. Prior work showed that convex surrogate losses are not statistically consistent in the adversarial context -- or in other words, a minimizing…

Machine Learning · Computer Science 2025-09-29 Natalie S. Frank

Existence and Minimax Theorems for Adversarial Surrogate Risks in Binary Classification

Adversarial training is one of the most popular methods for training methods robust to adversarial attacks, however, it is not well-understood from a theoretical perspective. We prove and existence, regularity, and minimax theorems for…

Machine Learning · Computer Science 2025-10-09 Natalie S. Frank , Jonathan Niles-Weed

Asymptotic Behavior of Adversarial Training in Binary Classification

It has been consistently reported that many machine learning models are susceptible to adversarial attacks i.e., small additive adversarial perturbations applied to data points can cause misclassification. Adversarial training using…

Machine Learning · Statistics 2021-07-15 Hossein Taheri , Ramtin Pedarsani , Christos Thrampoulidis

Towards Consistency in Adversarial Classification

In this paper, we study the problem of consistency in the context of adversarial examples. Specifically, we tackle the following question: can surrogate losses still be used as a proxy for minimizing the $0/1$ loss in the presence of an…

Machine Learning · Computer Science 2022-05-23 Laurent Meunier , Raphaël Ettedgui , Rafael Pinot , Yann Chevaleyre , Jamal Atif

Convergence of Adversarial Training in Overparametrized Neural Networks

Neural networks are vulnerable to adversarial examples, i.e. inputs that are imperceptibly perturbed from natural data and yet incorrectly classified by the network. Adversarial training, a heuristic form of robust optimization that…

Machine Learning · Computer Science 2019-11-12 Ruiqi Gao , Tianle Cai , Haochuan Li , Liwei Wang , Cho-Jui Hsieh , Jason D. Lee

Adversarially Robust Estimate and Risk Analysis in Linear Regression

Adversarially robust learning aims to design algorithms that are robust to small adversarial perturbations on input variables. Beyond the existing studies on the predictive performance to adversarial samples, our goal is to understand…

Machine Learning · Statistics 2020-12-21 Yue Xing , Ruizhi Zhang , Guang Cheng

Towards Calibrated Losses for Adversarial Robust Reject Option Classification

Robustness towards adversarial attacks is a vital property for classifiers in several applications such as autonomous driving, medical diagnosis, etc. Also, in such scenarios, where the cost of misclassification is very high, knowing when…

Machine Learning · Computer Science 2024-10-15 Vrund Shah , Tejas Chaudhari , Naresh Manwani

Adversarial Coreset Selection for Efficient Robust Training

Neural networks are vulnerable to adversarial attacks: adding well-crafted, imperceptible perturbations to their input can modify their output. Adversarial training is one of the most effective approaches to training robust models against…

Machine Learning · Computer Science 2023-08-09 Hadi M. Dolatabadi , Sarah Erfani , Christopher Leckie

Adversarial Risk and the Dangers of Evaluating Against Weak Attacks

This paper investigates recently proposed approaches for defending against adversarial examples and evaluating adversarial robustness. We motivate 'adversarial risk' as an objective for achieving models robust to worst-case inputs. We then…

Machine Learning · Computer Science 2018-06-13 Jonathan Uesato , Brendan O'Donoghue , Aaron van den Oord , Pushmeet Kohli

ATRO: Adversarial Training with a Rejection Option

This paper proposes a classification framework with a rejection option to mitigate the performance deterioration caused by adversarial examples. While recent machine learning algorithms achieve high prediction performance, they are…

Machine Learning · Computer Science 2020-10-27 Masahiro Kato , Zhenghang Cui , Yoshihiro Fukuhara

Calibrated Surrogate Losses for Adversarially Robust Classification

Adversarially robust classification seeks a classifier that is insensitive to adversarial perturbations of test patterns. This problem is often formulated via a minimax objective, where the target loss is the worst-case value of the 0-1…

Machine Learning · Statistics 2021-05-14 Han Bao , Clayton Scott , Masashi Sugiyama

Instance adaptive adversarial training: Improved accuracy tradeoffs in neural nets

Adversarial training is by far the most successful strategy for improving robustness of neural networks to adversarial attacks. Despite its success as a defense mechanism, adversarial training fails to generalize well to unperturbed test…

Machine Learning · Computer Science 2019-10-18 Yogesh Balaji , Tom Goldstein , Judy Hoffman

Robustifying Binary Classification to Adversarial Perturbation

Despite the enormous success of machine learning models in various applications, most of these models lack resilience to (even small) perturbations in their input data. Hence, new methods to robustify machine learning models seem very…

Machine Learning · Computer Science 2020-10-30 Fariborz Salehi , Babak Hassibi

Consistent Robust Adversarial Prediction for General Multiclass Classification

We propose a robust adversarial prediction framework for general multiclass classification. Our method seeks predictive distributions that robustly optimize non-convex and non-continuous multiclass loss metrics against the worst-case…

Machine Learning · Statistics 2019-11-21 Rizal Fathony , Kaiser Asif , Anqi Liu , Mohammad Ali Bashiri , Wei Xing , Sima Behpour , Xinhua Zhang , Brian D. Ziebart

Opportunities and Challenges in Deep Learning Adversarial Robustness: A Survey

As we seek to deploy machine learning models beyond virtual and controlled domains, it is critical to analyze not only the accuracy or the fact that it works most of the time, but if such a model is truly robust and reliable. This paper…

Machine Learning · Computer Science 2020-07-07 Samuel Henrique Silva , Peyman Najafirad

Adversarial Risk via Optimal Transport and Optimal Couplings

Modern machine learning algorithms perform poorly on adversarially manipulated data. Adversarial risk quantifies the error of classifiers in adversarial settings; adversarial classifiers minimize adversarial risk. In this paper, we analyze…

Machine Learning · Computer Science 2020-12-24 Muni Sreenivas Pydi , Varun Jog

On the Error Resistance of Hinge Loss Minimization

Commonly used classification algorithms in machine learning, such as support vector machines, minimize a convex surrogate loss on training examples. In practice, these algorithms are surprisingly robust to errors in the training data. In…

Machine Learning · Computer Science 2020-12-03 Kunal Talwar

Theoretical Foundations of Adversarially Robust Learning

Despite extraordinary progress, current machine learning systems have been shown to be brittle against adversarial examples: seemingly innocuous but carefully crafted perturbations of test examples that cause machine learning predictors to…

Machine Learning · Computer Science 2023-06-14 Omar Montasser