English

Freeze-Thaw Bayesian Optimization

Machine Learning 2014-06-17 v1 Machine Learning

Abstract

In this paper we develop a dynamic form of Bayesian optimization for machine learning models with the goal of rapidly finding good hyperparameter settings. Our method uses the partial information gained during the training of a machine learning model in order to decide whether to pause training and start a new model, or resume the training of a previously-considered model. We specifically tailor our method to machine learning problems by developing a novel positive-definite covariance kernel to capture a variety of training curves. Furthermore, we develop a Gaussian process prior that scales gracefully with additional temporal observations. Finally, we provide an information-theoretic framework to automate the decision process. Experiments on several common machine learning models show that our approach is extremely effective in practice.

Keywords

Cite

@article{arxiv.1406.3896,
  title  = {Freeze-Thaw Bayesian Optimization},
  author = {Kevin Swersky and Jasper Snoek and Ryan Prescott Adams},
  journal= {arXiv preprint arXiv:1406.3896},
  year   = {2014}
}
R2 v1 2026-06-22T04:39:01.486Z