Freeze-Thaw Bayesian Optimization

Kevin Swersky; Jasper Snoek; Ryan Prescott Adams

Freeze-Thaw Bayesian Optimization

Machine Learning 2014-06-17 v1 Machine Learning

Authors: Kevin Swersky , Jasper Snoek , Ryan Prescott Adams

Abstract

In this paper we develop a dynamic form of Bayesian optimization for machine learning models with the goal of rapidly finding good hyperparameter settings. Our method uses the partial information gained during the training of a machine learning model in order to decide whether to pause training and start a new model, or resume the training of a previously-considered model. We specifically tailor our method to machine learning problems by developing a novel positive-definite covariance kernel to capture a variety of training curves. Furthermore, we develop a Gaussian process prior that scales gracefully with additional temporal observations. Finally, we provide an information-theoretic framework to automate the decision process. Experiments on several common machine learning models show that our approach is extremely effective in practice.

Keywords

bayesian optimization bayesian variable selection bayesian inference

Cite

@article{arxiv.1406.3896,
  title  = {Freeze-Thaw Bayesian Optimization},
  author = {Kevin Swersky and Jasper Snoek and Ryan Prescott Adams},
  journal= {arXiv preprint arXiv:1406.3896},
  year   = {2014}
}

Freeze-Thaw Bayesian Optimization

Abstract

Keywords

Cite

Related papers