The Kanerva Machine: A Generative Distributed Memory

Yan Wu; Greg Wayne; Alex Graves; Timothy Lillicrap

The Kanerva Machine: A Generative Distributed Memory

Machine Learning 2018-06-19 v3 Artificial Intelligence Machine Learning Neural and Evolutionary Computing

Authors: Yan Wu , Greg Wayne , Alex Graves , Timothy Lillicrap

Abstract

We present an end-to-end trained memory system that quickly adapts to new data and generates samples like them. Inspired by Kanerva's sparse distributed memory, it has a robust distributed reading and writing mechanism. The memory is analytically tractable, which enables optimal on-line compression via a Bayesian update-rule. We formulate it as a hierarchical conditional generative model, where memory provides a rich data-dependent prior distribution. Consequently, the top-down memory and bottom-up perception are combined to produce the code representing an observation. Empirically, we demonstrate that the adaptive memory significantly improves generative models trained on both the Omniglot and CIFAR datasets. Compared with the Differentiable Neural Computer (DNC) and its variants, our memory model has greater capacity and is significantly easier to train.

Keywords

generative adversarial networks deep learning multi-task learning

Cite

@article{arxiv.1804.01756,
  title  = {The Kanerva Machine: A Generative Distributed Memory},
  author = {Yan Wu and Greg Wayne and Alex Graves and Timothy Lillicrap},
  journal= {arXiv preprint arXiv:1804.01756},
  year   = {2018}
}

Comments

Published as a conference paper at ICLR 2018 (corrected typos in revision)

The Kanerva Machine: A Generative Distributed Memory

Abstract

Keywords

Cite

Comments

Related papers