English

Statistical inference using SGD

Machine Learning 2017-11-21 v2 Artificial Intelligence Optimization and Control Statistics Theory Machine Learning Statistics Theory

Abstract

We present a novel method for frequentist statistical inference in MM-estimation problems, based on stochastic gradient descent (SGD) with a fixed step size: we demonstrate that the average of such SGD sequences can be used for statistical inference, after proper scaling. An intuitive analysis using the Ornstein-Uhlenbeck process suggests that such averages are asymptotically normal. From a practical perspective, our SGD-based inference procedure is a first order method, and is well-suited for large scale problems. To show its merits, we apply it to both synthetic and real datasets, and demonstrate that its accuracy is comparable to classical statistical methods, while requiring potentially far less computation.

Keywords

Cite

@article{arxiv.1705.07477,
  title  = {Statistical inference using SGD},
  author = {Tianyang Li and Liu Liu and Anastasios Kyrillidis and Constantine Caramanis},
  journal= {arXiv preprint arXiv:1705.07477},
  year   = {2017}
}

Comments

To appear in AAAI 2018

R2 v1 2026-06-22T19:53:58.286Z