English

Transferable Sparse Adversarial Attack

Computer Vision and Pattern Recognition 2021-06-01 v1

Abstract

Deep neural networks have shown their vulnerability to adversarial attacks. In this paper, we focus on sparse adversarial attack based on the 0\ell_0 norm constraint, which can succeed by only modifying a few pixels of an image. Despite a high attack success rate, prior sparse attack methods achieve a low transferability under the black-box protocol due to overfitting the target model. Therefore, we introduce a generator architecture to alleviate the overfitting issue and thus efficiently craft transferable sparse adversarial examples. Specifically, the generator decouples the sparse perturbation into amplitude and position components. We carefully design a random quantization operator to optimize these two components jointly in an end-to-end way. The experiment shows that our method has improved the transferability by a large margin under a similar sparsity setting compared with state-of-the-art methods. Moreover, our method achieves superior inference speed, 700×\times faster than other optimization-based methods. The code is available at https://github.com/shaguopohuaizhe/TSAA.

Keywords

Cite

@article{arxiv.2105.14727,
  title  = {Transferable Sparse Adversarial Attack},
  author = {Ziwen He and Wei Wang and Jing Dong and Tieniu Tan},
  journal= {arXiv preprint arXiv:2105.14727},
  year   = {2021}
}