Adversarially Robust Training through Structured Gradient Regularization
Machine Learning
2018-05-23 v1 Machine Learning
Abstract
We propose a novel data-dependent structured gradient regularizer to increase the robustness of neural networks vis-a-vis adversarial perturbations. Our regularizer can be derived as a controlled approximation from first principles, leveraging the fundamental link between training with noise and regularization. It adds very little computational overhead during learning and is simple to implement generically in standard deep learning frameworks. Our experiments provide strong evidence that structured gradient regularization can act as an effective first line of defense against attacks based on low-level signal corruption.
Cite
@article{arxiv.1805.08736,
title = {Adversarially Robust Training through Structured Gradient Regularization},
author = {Kevin Roth and Aurelien Lucchi and Sebastian Nowozin and Thomas Hofmann},
journal= {arXiv preprint arXiv:1805.08736},
year = {2018}
}