Convolutional Gaussian Processes

Mark van der Wilk; Carl Edward Rasmussen; James Hensman

Convolutional Gaussian Processes

Machine Learning 2017-09-07 v1 Machine Learning

Authors: Mark van der Wilk , Carl Edward Rasmussen , James Hensman

Abstract

We present a practical way of introducing convolutional structure into Gaussian processes, making them more suited to high-dimensional inputs like images. The main contribution of our work is the construction of an inter-domain inducing point approximation that is well-tailored to the convolutional kernel. This allows us to gain the generalisation benefit of a convolutional kernel, together with fast but accurate posterior inference. We investigate several variations of the convolutional kernel, and apply it to MNIST and CIFAR-10, which have both been known to be challenging for Gaussian processes. We also show how the marginal likelihood can be used to find an optimal weighting between convolutional and RBF kernels to further improve performance. We hope that this illustration of the usefulness of a marginal likelihood will help automate discovering architectures in larger models.

Keywords

gaussian processes gaussian graphical model estimation kernel methods

Cite

@article{arxiv.1709.01894,
  title  = {Convolutional Gaussian Processes},
  author = {Mark van der Wilk and Carl Edward Rasmussen and James Hensman},
  journal= {arXiv preprint arXiv:1709.01894},
  year   = {2017}
}

Comments

To appear in Advances in Neural Information Processing Systems 30 (NIPS 2017)

Convolutional Gaussian Processes

Abstract

Keywords

Cite

Comments

Related papers