English

Scalable Algorithms for Learning High-Dimensional Linear Mixed Models

Machine Learning 2019-03-08 v1 Machine Learning

Abstract

Linear mixed models (LMMs) are used extensively to model dependecies of observations in linear regression and are used extensively in many application areas. Parameter estimation for LMMs can be computationally prohibitive on big data. State-of-the-art learning algorithms require computational complexity which depends at least linearly on the dimension pp of the covariates, and often use heuristics that do not offer theoretical guarantees. We present scalable algorithms for learning high-dimensional LMMs with sublinear computational complexity dependence on pp. Key to our approach are novel dual estimators which use only kernel functions of the data, and fast computational techniques based on the subsampled randomized Hadamard transform. We provide theoretical guarantees for our learning algorithms, demonstrating the robustness of parameter estimation. Finally, we complement the theory with experiments on large synthetic and real data.

Keywords

Cite

@article{arxiv.1803.04431,
  title  = {Scalable Algorithms for Learning High-Dimensional Linear Mixed Models},
  author = {Zilong Tan and Kimberly Roche and Xiang Zhou and Sayan Mukherjee},
  journal= {arXiv preprint arXiv:1803.04431},
  year   = {2019}
}