English

Sparse Linear Regression With Missing Data

Machine Learning 2015-03-31 v1 Machine Learning Methodology

Abstract

This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the regression coefficients, and the proposed algorithm jointly learns the low-dimensional structure of the data and a linear regressor with sparse coefficients. The proposed stochastic optimization method, Sparse Linear Regression with Missing Data (SLRM), performs an alternating minimization procedure and scales well with the problem size. Large deviation inequalities shed light on the impact of the various problem-dependent parameters on the expected squared loss of the learned regressor. Extensive simulations on both synthetic and real datasets show that SLRM performs better than competing algorithms in a variety of contexts.

Keywords

Cite

@article{arxiv.1503.08348,
  title  = {Sparse Linear Regression With Missing Data},
  author = {Ravi Ganti and Rebecca M. Willett},
  journal= {arXiv preprint arXiv:1503.08348},
  year   = {2015}
}

Comments

14 pages, 7 figures

R2 v1 2026-06-22T09:04:38.125Z