English

Efficient List-Decodable Regression using Batches

Machine Learning 2022-11-24 v1 Information Theory math.IT Machine Learning

Abstract

We begin the study of list-decodable linear regression using batches. In this setting only an α(0,1]\alpha \in (0,1] fraction of the batches are genuine. Each genuine batch contains n\ge n i.i.d. samples from a common unknown distribution and the remaining batches may contain arbitrary or even adversarial samples. We derive a polynomial time algorithm that for any nΩ~(1/α)n\ge \tilde \Omega(1/\alpha) returns a list of size O(1/α2)\mathcal O(1/\alpha^2) such that one of the items in the list is close to the true regression parameter. The algorithm requires only O~(d/α2)\tilde{\mathcal{O}}(d/\alpha^2) genuine batches and works under fairly general assumptions on the distribution. The results demonstrate the utility of batch structure, which allows for the first polynomial time algorithm for list-decodable regression, which may be impossible for the non-batch setting, as suggested by a recent SQ lower bound \cite{diakonikolas2021statistical} for the non-batch setting.

Keywords

Cite

@article{arxiv.2211.12743,
  title  = {Efficient List-Decodable Regression using Batches},
  author = {Abhimanyu Das and Ayush Jain and Weihao Kong and Rajat Sen},
  journal= {arXiv preprint arXiv:2211.12743},
  year   = {2022}
}

Comments

First draft