English

Learning Multivariate Log-concave Distributions

Machine Learning 2017-06-07 v2 Information Theory math.IT Statistics Theory Statistics Theory

Abstract

We study the problem of estimating multivariate log-concave probability density functions. We prove the first sample complexity upper bound for learning log-concave densities on Rd\mathbb{R}^d, for all d1d \geq 1. Prior to our work, no upper bound on the sample complexity of this learning problem was known for the case of d>3d>3. In more detail, we give an estimator that, for any d1d \ge 1 and ϵ>0\epsilon>0, draws O~d((1/ϵ)(d+5)/2)\tilde{O}_d \left( (1/\epsilon)^{(d+5)/2} \right) samples from an unknown target log-concave density on Rd\mathbb{R}^d, and outputs a hypothesis that (with high probability) is ϵ\epsilon-close to the target, in total variation distance. Our upper bound on the sample complexity comes close to the known lower bound of Ωd((1/ϵ)(d+1)/2)\Omega_d \left( (1/\epsilon)^{(d+1)/2} \right) for this problem.

Keywords

Cite

@article{arxiv.1605.08188,
  title  = {Learning Multivariate Log-concave Distributions},
  author = {Ilias Diakonikolas and Daniel M. Kane and Alistair Stewart},
  journal= {arXiv preprint arXiv:1605.08188},
  year   = {2017}
}

Comments

To appear in COLT 2017