Algorithmic statistics revisited

Nikolay Vereshchagin; Alexander Shen

Algorithmic statistics revisited

Information Theory 2015-04-28 v2 math.IT

Authors: Nikolay Vereshchagin , Alexander Shen

Abstract

The mission of statistics is to provide adequate statistical hypotheses (models) for observed data. But what is an "adequate" model? To answer this question, one needs to use the notions of algorithmic information theory. It turns out that for every data string $x$ one can naturally define "stochasticity profile", a curve that represents a trade-off between complexity of a model and its adequacy. This curve has four different equivalent definitions in terms of (1)~randomness deficiency, (2)~minimal description length, (3)~position in the lists of simple strings and (4)~Kolmogorov complexity with decompression time bounded by busy beaver function. We present a survey of the corresponding definitions and results relating them to each other.

Keywords

algorithm selection optimization algorithm information theory

Cite

@article{arxiv.1504.04950,
  title  = {Algorithmic statistics revisited},
  author = {Nikolay Vereshchagin and Alexander Shen},
  journal= {arXiv preprint arXiv:1504.04950},
  year   = {2015}
}

Algorithmic statistics revisited

Abstract

Keywords

Cite

Related papers