English

Predictive Hypothesis Identification

Machine Learning 2009-12-30 v1 Statistics Theory Machine Learning Statistics Theory

Abstract

While statistics focusses on hypothesis testing and on estimating (properties of) the true sampling distribution, in machine learning the performance of learning algorithms on future data is the primary issue. In this paper we bridge the gap with a general principle (PHI) that identifies hypotheses with best predictive performance. This includes predictive point and interval estimation, simple and composite hypothesis testing, (mixture) model selection, and others as special cases. For concrete instantiations we will recover well-known methods, variations thereof, and new ones. PHI nicely justifies, reconciles, and blends (a reparametrization invariant variation of) MAP, ML, MDL, and moment estimation. One particular feature of PHI is that it can genuinely deal with nested hypotheses.

Keywords

Cite

@article{arxiv.0809.1270,
  title  = {Predictive Hypothesis Identification},
  author = {Marcus Hutter},
  journal= {arXiv preprint arXiv:0809.1270},
  year   = {2009}
}

Comments

16 pages

R2 v1 2026-06-21T11:17:47.431Z