Approximate Sparse Linear Regression
Abstract
In the Sparse Linear Regression (SLR) problem, given a matrix and a -dimensional query , the goal is to compute a -sparse -dimensional vector such that the error is minimized. This problem is equivalent to the following geometric problem: given a set of points and a query point in dimensions, find the closest -dimensional subspace to , that is spanned by a subset of points in . In this paper, we present data-structures/algorithms and conditional lower bounds for several variants of this problem (such as finding the closest induced dimensional flat/simplex instead of a subspace). In particular, we present approximation algorithms for the online variants of the above problems with query time , which are of interest in the "low sparsity regime" where is small, e.g., or . For , this matches, up to polylogarithmic factors, the lower bound that relies on the affinely degenerate conjecture (i.e., deciding if points in contains points contained in a hyperplane takes time). Moreover, our algorithms involve formulating and solving several geometric subproblems, which we believe to be of independent interest.
Cite
@article{arxiv.1609.08739,
title = {Approximate Sparse Linear Regression},
author = {Sariel Har-Peled and Piotr Indyk and Sepideh Mahabadi},
journal= {arXiv preprint arXiv:1609.08739},
year = {2018}
}