Open Loop Hyperparameter Optimization and Determinantal Point Processes
Abstract
Driven by the need for parallelizable hyperparameter optimization methods, this paper studies \emph{open loop} search methods: sequences that are predetermined and can be generated before a single configuration is evaluated. Examples include grid search, uniform random search, low discrepancy sequences, and other sampling distributions. In particular, we propose the use of -determinantal point processes in hyperparameter optimization via random search. Compared to conventional uniform random search where hyperparameter settings are sampled independently, a -DPP promotes diversity. We describe an approach that transforms hyperparameter search spaces for efficient use with a -DPP. In addition, we introduce a novel Metropolis-Hastings algorithm which can sample from -DPPs defined over any space from which uniform samples can be drawn, including spaces with a mixture of discrete and continuous dimensions or tree structure. Our experiments show significant benefits in realistic scenarios with a limited budget for training supervised learners, whether in serial or parallel.
Cite
@article{arxiv.1706.01566,
title = {Open Loop Hyperparameter Optimization and Determinantal Point Processes},
author = {Jesse Dodge and Kevin Jamieson and Noah A. Smith},
journal= {arXiv preprint arXiv:1706.01566},
year = {2019}
}