English

Cross-validation Confidence Intervals for Test Error

Machine Learning 2020-11-03 v2 Machine Learning Statistics Theory Statistics Theory

Abstract

This work develops central limit theorems for cross-validation and consistent estimators of its asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for kk-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller kk-fold test error than another. These results are also the first of their kind for the popular choice of leave-one-out cross-validation. In our real-data experiments with diverse learning algorithms, the resulting intervals and tests outperform the most popular alternative methods from the literature.

Keywords

Cite

@article{arxiv.2007.12671,
  title  = {Cross-validation Confidence Intervals for Test Error},
  author = {Pierre Bayle and Alexandre Bayle and Lucas Janson and Lester Mackey},
  journal= {arXiv preprint arXiv:2007.12671},
  year   = {2020}
}

Comments

34th Conference on Neural Information Processing Systems (NeurIPS 2020); 40 pages, 15 figures