English

Universal adversarial examples in speech command classification

Machine Learning 2021-02-16 v4 Audio and Speech Processing Machine Learning

Abstract

Adversarial examples are inputs intentionally perturbed with the aim of forcing a machine learning model to produce a wrong prediction, while the changes are not easily detectable by a human. Although this topic has been intensively studied in the image domain, classification tasks in the audio domain have received less attention. In this paper we address the existence of universal perturbations for speech command classification. We provide evidence that universal attacks can be generated for speech command classification tasks, which are able to generalize across different models to a significant extent. Additionally, a novel analytical framework is proposed for the evaluation of universal perturbations under different levels of universality, demonstrating that the feasibility of generating effective perturbations decreases as the universality level increases. Finally, we propose a more detailed and rigorous framework to measure the amount of distortion introduced by the perturbations, demonstrating that the methods employed by convention are not realistic in audio-based problems.

Keywords

Cite

@article{arxiv.1911.10182,
  title  = {Universal adversarial examples in speech command classification},
  author = {Jon Vadillo and Roberto Santana},
  journal= {arXiv preprint arXiv:1911.10182},
  year   = {2021}
}

Comments

14 pages, 2 figures, 4 tables; Revised external links

R2 v1 2026-06-23T12:24:49.187Z