Variational Hyper-Encoding Networks

Phuoc Nguyen; Truyen Tran; Sunil Gupta; Santu Rana; Hieu-Chi Dam; Svetha Venkatesh

Variational Hyper-Encoding Networks

Machine Learning 2022-05-16 v2 Machine Learning

Authors: Phuoc Nguyen , Truyen Tran , Sunil Gupta , Santu Rana , Hieu-Chi Dam , Svetha Venkatesh

Abstract

We propose a framework called HyperVAE for encoding distributions of distributions. When a target distribution is modeled by a VAE, its neural network parameters \theta is drawn from a distribution p(\theta) which is modeled by a hyper-level VAE. We propose a variational inference using Gaussian mixture models to implicitly encode the parameters \theta into a low dimensional Gaussian distribution. Given a target distribution, we predict the posterior distribution of the latent code, then use a matrix-network decoder to generate a posterior distribution q(\theta). HyperVAE can encode the parameters \theta in full in contrast to common hyper-networks practices, which generate only the scale and bias vectors as target-network parameters. Thus HyperVAE preserves much more information about the model for each task in the latent space. We discuss HyperVAE using the minimum description length (MDL) principle and show that it helps HyperVAE to generalize. We evaluate HyperVAE in density estimation tasks, outlier detection and discovery of novel design classes, demonstrating its efficacy.

Keywords

variational autoencoders bayesian inference density estimation and sampling

Cite

@article{arxiv.2005.08482,
  title  = {Variational Hyper-Encoding Networks},
  author = {Phuoc Nguyen and Truyen Tran and Sunil Gupta and Santu Rana and Hieu-Chi Dam and Svetha Venkatesh},
  journal= {arXiv preprint arXiv:2005.08482},
  year   = {2022}
}

Comments

Accepted ECML-2021

Variational Hyper-Encoding Networks

Abstract

Keywords

Cite

Comments

Related papers