English

Detecting adversarial attacks on random samples

Probability 2024-10-28 v2

Abstract

This paper studies the problem of detecting adversarial perturbations in a sequence of observations. Given a data sample X1,,XnX_1, \ldots, X_n drawn from a standard normal distribution, an adversary, after observing the sample, can perturb each observation by a fixed magnitude or leave it unchanged. We explore the relationship between the perturbation magnitude, the sparsity of the perturbation, and the detectability of the adversary's actions, establishing precise thresholds for when detection becomes impossible.

Keywords

Cite

@article{arxiv.2408.06166,
  title  = {Detecting adversarial attacks on random samples},
  author = {Gleb Smirnov},
  journal= {arXiv preprint arXiv:2408.06166},
  year   = {2024}
}

Comments

title changed; introduction expanded; new results about spherical attacks