English

Convolutional Gaussian Processes

Machine Learning 2017-09-07 v1 Machine Learning

Abstract

We present a practical way of introducing convolutional structure into Gaussian processes, making them more suited to high-dimensional inputs like images. The main contribution of our work is the construction of an inter-domain inducing point approximation that is well-tailored to the convolutional kernel. This allows us to gain the generalisation benefit of a convolutional kernel, together with fast but accurate posterior inference. We investigate several variations of the convolutional kernel, and apply it to MNIST and CIFAR-10, which have both been known to be challenging for Gaussian processes. We also show how the marginal likelihood can be used to find an optimal weighting between convolutional and RBF kernels to further improve performance. We hope that this illustration of the usefulness of a marginal likelihood will help automate discovering architectures in larger models.

Keywords

Cite

@article{arxiv.1709.01894,
  title  = {Convolutional Gaussian Processes},
  author = {Mark van der Wilk and Carl Edward Rasmussen and James Hensman},
  journal= {arXiv preprint arXiv:1709.01894},
  year   = {2017}
}

Comments

To appear in Advances in Neural Information Processing Systems 30 (NIPS 2017)