DiffDefense: Defending against Adversarial Attacks via Diffusion Models

Hondamunige Prasanna Silva; Lorenzo Seidenari; Alberto Del Bimbo

doi:10.1007/978-3-031-43153-1_36

DiffDefense: Defending against Adversarial Attacks via Diffusion Models

Machine Learning 2023-09-08 v1 Cryptography and Security Computer Vision and Pattern Recognition

Authors: Hondamunige Prasanna Silva , Lorenzo Seidenari , Alberto Del Bimbo

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

This paper presents a novel reconstruction method that leverages Diffusion Models to protect machine learning classifiers against adversarial attacks, all without requiring any modifications to the classifiers themselves. The susceptibility of machine learning models to minor input perturbations renders them vulnerable to adversarial attacks. While diffusion-based methods are typically disregarded for adversarial defense due to their slow reverse process, this paper demonstrates that our proposed method offers robustness against adversarial threats while preserving clean accuracy, speed, and plug-and-play compatibility. Code at: https://github.com/HondamunigePrasannaSilva/DiffDefence.

Keywords

privacy attacks on machine learning adversarial examples adversarial attacks

Cite

@article{arxiv.2309.03702,
  title  = {DiffDefense: Defending against Adversarial Attacks via Diffusion Models},
  author = {Hondamunige Prasanna Silva and Lorenzo Seidenari and Alberto Del Bimbo},
  journal= {arXiv preprint arXiv:2309.03702},
  year   = {2023}
}

Comments

Paper published at ICIAP23

DiffDefense: Defending against Adversarial Attacks via Diffusion Models

Abstract

Keywords

Cite

Comments

Related papers