Deep Kernel Learning

Andrew Gordon Wilson; Zhiting Hu; Ruslan Salakhutdinov; Eric P. Xing

Deep Kernel Learning

Machine Learning 2015-11-09 v1 Artificial Intelligence Methodology Machine Learning

Authors: Andrew Gordon Wilson , Zhiting Hu , Ruslan Salakhutdinov , Eric P. Xing

Abstract

We introduce scalable deep kernels, which combine the structural properties of deep learning architectures with the non-parametric flexibility of kernel methods. Specifically, we transform the inputs of a spectral mixture base kernel with a deep architecture, using local kernel interpolation, inducing points, and structure exploiting (Kronecker and Toeplitz) algebra for a scalable kernel representation. These closed-form kernels can be used as drop-in replacements for standard kernels, with benefits in expressive power and scalability. We jointly learn the properties of these kernels through the marginal likelihood of a Gaussian process. Inference and learning cost $O(n)$ for $n$ training points, and predictions cost $O(1)$ per test point. On a large and diverse collection of applications, including a dataset with 2 million examples, we show improved performance over scalable Gaussian processes with flexible kernel learning models, and stand-alone deep architectures.

Keywords

kernel learning deep neural networks deep learning

Cite

@article{arxiv.1511.02222,
  title  = {Deep Kernel Learning},
  author = {Andrew Gordon Wilson and Zhiting Hu and Ruslan Salakhutdinov and Eric P. Xing},
  journal= {arXiv preprint arXiv:1511.02222},
  year   = {2015}
}

Comments

19 pages, 6 figures

Deep Kernel Learning

Abstract

Keywords

Cite

Comments

Related papers