English

DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers

Computer Vision and Pattern Recognition 2024-05-30 v2 Machine Learning

Abstract

We introduce DiffAug, a simple and efficient diffusion-based augmentation technique to train image classifiers for the crucial yet challenging goal of improved classifier robustness. Applying DiffAug to a given example consists of one forward-diffusion step followed by one reverse-diffusion step. Using both ResNet-50 and Vision Transformer architectures, we comprehensively evaluate classifiers trained with DiffAug and demonstrate the surprising effectiveness of single-step reverse diffusion in improving robustness to covariate shifts, certified adversarial accuracy and out of distribution detection. When we combine DiffAug with other augmentations such as AugMix and DeepAugment we demonstrate further improved robustness. Finally, building on this approach, we also improve classifier-guided diffusion wherein we observe improvements in: (i) classifier-generalization, (ii) gradient quality (i.e., improved perceptual alignment) and (iii) image generation performance. We thus introduce a computationally efficient technique for training with improved robustness that does not require any additional data, and effectively complements existing augmentation approaches.

Keywords

Cite

@article{arxiv.2306.09192,
  title  = {DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers},
  author = {Chandramouli Sastry and Sri Harsha Dumpala and Sageev Oore},
  journal= {arXiv preprint arXiv:2306.09192},
  year   = {2024}
}

Comments

Shorter version of this work was accepted in the CVPR 2024 Workshop on Synthetic Data for Computer Vision