Sever: A Robust Meta-Algorithm for Stochastic Optimization
Abstract
In high dimensions, most machine learning methods are brittle to even a small fraction of structured outliers. To address this, we introduce a new meta-algorithm that can take in a base learner such as least squares or stochastic gradient descent, and harden the learner to be resistant to outliers. Our method, Sever, possesses strong theoretical guarantees yet is also highly scalable -- beyond running the base learner itself, it only requires computing the top singular vector of a certain matrix. We apply Sever on a drug design dataset and a spam classification dataset, and find that in both cases it has substantially greater robustness than several baselines. On the spam dataset, with corruptions, we achieved test error, compared to for the baselines, and error on the uncorrupted dataset. Similarly, on the drug design dataset, with corruptions, we achieved mean-squared error test error, compared to - for the baselines, and error on the uncorrupted dataset.
Cite
@article{arxiv.1803.02815,
title = {Sever: A Robust Meta-Algorithm for Stochastic Optimization},
author = {Ilias Diakonikolas and Gautam Kamath and Daniel M. Kane and Jerry Li and Jacob Steinhardt and Alistair Stewart},
journal= {arXiv preprint arXiv:1803.02815},
year = {2019}
}
Comments
To appear in ICML 2019