English

Learning from Attacks: Attacking Variational Autoencoder for Improving Image Classification

Machine Learning 2022-04-12 v1 Computer Vision and Pattern Recognition Image and Video Processing

Abstract

Adversarial attacks are often considered as threats to the robustness of Deep Neural Networks (DNNs). Various defending techniques have been developed to mitigate the potential negative impact of adversarial attacks against task predictions. This work analyzes adversarial attacks from a different perspective. Namely, adversarial examples contain implicit information that is useful to the predictions i.e., image classification, and treat the adversarial attacks against DNNs for data self-expression as extracted abstract representations that are capable of facilitating specific learning tasks. We propose an algorithmic framework that leverages the advantages of the DNNs for data self-expression and task-specific predictions, to improve image classification. The framework jointly learns a DNN for attacking Variational Autoencoder (VAE) networks and a DNN for classification, coined as Attacking VAE for Improve Classification (AVIC). The experiment results show that AVIC can achieve higher accuracy on standard datasets compared to the training with clean examples and the traditional adversarial training.

Keywords

Cite

@article{arxiv.2203.07027,
  title  = {Learning from Attacks: Attacking Variational Autoencoder for Improving Image Classification},
  author = {Jianzhang Zheng and Fan Yang and Hao Shen and Xuan Tang and Mingsong Chen and Liang Song and Xian Wei},
  journal= {arXiv preprint arXiv:2203.07027},
  year   = {2022}
}
R2 v1 2026-06-24T10:12:13.861Z