The Adversarial Consistency of Surrogate Risks for Binary Classification

Natalie Frank; Jonathan Niles-Weed

The Adversarial Consistency of Surrogate Risks for Binary Classification

Machine Learning 2025-10-09 v3 Statistics Theory Statistics Theory

Authors: Natalie Frank , Jonathan Niles-Weed

Abstract

We study the consistency of surrogate risks for robust binary classification. It is common to learn robust classifiers by adversarial training, which seeks to minimize the expected $0$ - $1$ loss when each example can be maliciously corrupted within a small ball. We give a simple and complete characterization of the set of surrogate loss functions that are \emph{consistent}, i.e., that can replace the $0$ - $1$ loss without affecting the minimizing sequences of the original adversarial risk, for any data distribution. We also prove a quantitative version of adversarial consistency for the $\rho$ -margin loss. Our results reveal that the class of adversarially consistent surrogates is substantially smaller than in the standard setting, where many common surrogates are known to be consistent.

Keywords

adversarial robustness classification probability theory

Cite

@article{arxiv.2305.09956,
  title  = {The Adversarial Consistency of Surrogate Risks for Binary Classification},
  author = {Natalie Frank and Jonathan Niles-Weed},
  journal= {arXiv preprint arXiv:2305.09956},
  year   = {2025}
}

Comments

17 pages, published in NeurIps 2023. version 3: added acknowledgements, no other changes. version 2: reorganized Section 4 and added proofs of the approximate complimentary slackness theorems. arXiv admin note: text overlap with arXiv:2206.09099

The Adversarial Consistency of Surrogate Risks for Binary Classification

Abstract

Keywords

Cite

Comments

Related papers