Improving Hyperparameter Optimization by Planning Ahead

Hadi S. Jomaa; Jonas Falkner; Lars Schmidt-Thieme

Improving Hyperparameter Optimization by Planning Ahead

Machine Learning 2021-10-18 v1

Authors: Hadi S. Jomaa , Jonas Falkner , Lars Schmidt-Thieme

Abstract

Hyperparameter optimization (HPO) is generally treated as a bi-level optimization problem that involves fitting a (probabilistic) surrogate model to a set of observed hyperparameter responses, e.g. validation loss, and consequently maximizing an acquisition function using a surrogate model to identify good hyperparameter candidates for evaluation. The choice of a surrogate and/or acquisition function can be further improved via knowledge transfer across related tasks. In this paper, we propose a novel transfer learning approach, defined within the context of model-based reinforcement learning, where we represent the surrogate as an ensemble of probabilistic models that allows trajectory sampling. We further propose a new variant of model predictive control which employs a simple look-ahead strategy as a policy that optimizes a sequence of actions, representing hyperparameter candidates to expedite HPO. Our experiments on three meta-datasets comparing to state-of-the-art HPO algorithms including a model-free reinforcement learning approach show that the proposed method can outperform all baselines by exploiting a simple planning-based policy.

Keywords

hyperparameter optimization algorithm selection multi-objective optimization

Cite

@article{arxiv.2110.08028,
  title  = {Improving Hyperparameter Optimization by Planning Ahead},
  author = {Hadi S. Jomaa and Jonas Falkner and Lars Schmidt-Thieme},
  journal= {arXiv preprint arXiv:2110.08028},
  year   = {2021}
}

Improving Hyperparameter Optimization by Planning Ahead

Abstract

Keywords

Cite

Related papers