Multivariate Variational Autoencoder

Mehmet Can Yavuz

Multivariate Variational Autoencoder

Machine Learning 2025-12-02 v2 Computer Vision and Pattern Recognition

Authors: Mehmet Can Yavuz

Abstract

Learning latent representations that are simultaneously expressive, geometrically well-structured, and reliably calibrated remains a central challenge for Variational Autoencoders (VAEs). Standard VAEs typically assume a diagonal Gaussian posterior, which simplifies optimization but rules out correlated uncertainty and often yields entangled or redundant latent dimensions. We introduce the Multivariate Variational Autoencoder (MVAE), a tractable full-covariance extension of the VAE that augments the encoder with sample-specific diagonal scales and a global coupling matrix. This induces a multivariate Gaussian posterior of the form $N(\mu_\phi(x), C \operatorname{diag}(\sigma_\phi^2(x)) C^\top)$ , enabling correlated latent factors while preserving a closed-form KL divergence and a simple reparameterization path. Beyond likelihood, we propose a multi-criterion evaluation protocol that jointly assesses reconstruction quality (MSE, ELBO), downstream discrimination (linear probes), probabilistic calibration (NLL, Brier, ECE), and unsupervised structure (NMI, ARI). Across Larochelle-style MNIST variants, Fashion-MNIST, and CIFAR-10/100, MVAE consistently matches or outperforms diagonal-covariance VAEs of comparable capacity, with particularly notable gains in calibration and clustering metrics at both low and high latent dimensions. Qualitative analyses further show smoother, more semantically coherent latent traversals and sharper reconstructions. All code, dataset splits, and evaluation utilities are released to facilitate reproducible comparison and future extensions of multivariate posterior models.

Keywords

variational autoencoder masked autoencoder neural operator

Cite

@article{arxiv.2511.07472,
  title  = {Multivariate Variational Autoencoder},
  author = {Mehmet Can Yavuz},
  journal= {arXiv preprint arXiv:2511.07472},
  year   = {2025}
}

Multivariate Variational Autoencoder

Abstract

Keywords

Cite

Related papers