Hierarchical Quantized Autoencoders

Will Williams; Sam Ringer; Tom Ash; John Hughes; David MacLeod; Jamie Dougherty

Hierarchical Quantized Autoencoders

Machine Learning 2020-10-19 v3 Computer Vision and Pattern Recognition Neural and Evolutionary Computing Machine Learning

Authors: Will Williams , Sam Ringer , Tom Ash , John Hughes , David MacLeod , Jamie Dougherty

View on arXiv ↗ PDF ↗

Abstract

Despite progress in training neural networks for lossy image compression, current approaches fail to maintain both perceptual quality and abstract features at very low bitrates. Encouraged by recent success in learning discrete representations with Vector Quantized Variational Autoencoders (VQ-VAEs), we motivate the use of a hierarchy of VQ-VAEs to attain high factors of compression. We show that the combination of stochastic quantization and hierarchical latent structure aids likelihood-based image compression. This leads us to introduce a novel objective for training hierarchical VQ-VAEs. Our resulting scheme produces a Markovian series of latent variables that reconstruct images of high-perceptual quality which retain semantically meaningful features. We provide qualitative and quantitative evaluations on the CelebA and MNIST datasets.

Keywords

variational autoencoder image compression representation learning

Cite

@article{arxiv.2002.08111,
  title  = {Hierarchical Quantized Autoencoders},
  author = {Will Williams and Sam Ringer and Tom Ash and John Hughes and David MacLeod and Jamie Dougherty},
  journal= {arXiv preprint arXiv:2002.08111},
  year   = {2020}
}

Hierarchical Quantized Autoencoders

Abstract

Keywords

Cite

Related papers