Robust High Dimensional Expectation Maximization Algorithm via Trimmed Hard Thresholding

Di Wang; Xiangyu Guo; Shi Li; Jinhui Xu

doi:10.1007/s10994-020-05926-z

Robust High Dimensional Expectation Maximization Algorithm via Trimmed Hard Thresholding

Machine Learning 2020-10-20 v1 Machine Learning

Authors: Di Wang , Xiangyu Guo , Shi Li , Jinhui Xu

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

In this paper, we study the problem of estimating latent variable models with arbitrarily corrupted samples in high dimensional space ({\em i.e.,} $d\gg n$ ) where the underlying parameter is assumed to be sparse. Specifically, we propose a method called Trimmed (Gradient) Expectation Maximization which adds a trimming gradients step and a hard thresholding step to the Expectation step (E-step) and the Maximization step (M-step), respectively. We show that under some mild assumptions and with an appropriate initialization, the algorithm is corruption-proofing and converges to the (near) optimal statistical rate geometrically when the fraction of the corrupted samples $\epsilon$ is bounded by $\tilde{O}(\frac{1}{\sqrt{n}})$ . Moreover, we apply our general framework to three canonical models: mixture of Gaussians, mixture of regressions and linear regression with missing covariates. Our theory is supported by thorough numerical results.

Keywords

mixture models and em algorithm sampling algorithms sparse regression and regularization

Cite

@article{arxiv.2010.09576,
  title  = {Robust High Dimensional Expectation Maximization Algorithm via Trimmed Hard Thresholding},
  author = {Di Wang and Xiangyu Guo and Shi Li and Jinhui Xu},
  journal= {arXiv preprint arXiv:2010.09576},
  year   = {2020}
}

Comments

Accepted at Machine Learning

Robust High Dimensional Expectation Maximization Algorithm via Trimmed Hard Thresholding

Abstract

Keywords

Cite

Comments

Related papers