English

Hyperparameter Learning for Conditional Kernel Mean Embeddings with Rademacher Complexity Bounds

Machine Learning 2018-11-09 v3 Machine Learning

Abstract

Conditional kernel mean embeddings are nonparametric models that encode conditional expectations in a reproducing kernel Hilbert space. While they provide a flexible and powerful framework for probabilistic inference, their performance is highly dependent on the choice of kernel and regularization hyperparameters. Nevertheless, current hyperparameter tuning methods predominantly rely on expensive cross validation or heuristics that is not optimized for the inference task. For conditional kernel mean embeddings with categorical targets and arbitrary inputs, we propose a hyperparameter learning framework based on Rademacher complexity bounds to prevent overfitting by balancing data fit against model complexity. Our approach only requires batch updates, allowing scalable kernel hyperparameter tuning without invoking kernel approximations. Experiments demonstrate that our learning framework outperforms competing methods, and can be further extended to incorporate and learn deep neural network weights to improve generalization.

Keywords

Cite

@article{arxiv.1809.00175,
  title  = {Hyperparameter Learning for Conditional Kernel Mean Embeddings with Rademacher Complexity Bounds},
  author = {Kelvin Hsu and Richard Nock and Fabio Ramos},
  journal= {arXiv preprint arXiv:1809.00175},
  year   = {2018}
}

Comments

Best Student Machine Learning Paper Award Winner at ECML-PKDD 2018 (European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases)