Sparse Regression via Range Counting
Abstract
The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set of points in , a point , and an integer , find an affine combination of at most points of that is nearest to . We describe a -time randomized -approximation algorithm for this problem with and constant. This is the first algorithm for this problem running in time . Its running time is similar to the query time of a data structure recently proposed by Har-Peled, Indyk, and Mahabadi (ICALP'18), while not requiring any preprocessing. Up to polylogarithmic factors, it matches a conditional lower bound relying on a conjecture about affine degeneracy testing. In the special case where , we also provide a simple -time deterministic exact algorithm, for any . Finally, we show how to adapt the approximation algorithm for the sparse linear regression and sparse convex regression problems with the same running time, up to polylogarithmic factors.
Cite
@article{arxiv.1908.00351,
title = {Sparse Regression via Range Counting},
author = {Jean Cardinal and Aurélien Ooms},
journal= {arXiv preprint arXiv:1908.00351},
year = {2020}
}