Related papers: PCA-VAE: Differentiable Subspace Quantization with…

VP-VAE: Rethinking Vector Quantization via Adaptive Vector Perturbation

Vector Quantized Variational Autoencoders (VQ-VAEs) are fundamental to modern generative modeling, yet they often suffer from training instability and "codebook collapse" due to the inherent coupling of representation learning and discrete…

Machine Learning · Computer Science 2026-02-20 Linwei Zhai , Han Ding , Mingzhi Lin , Cui Zhao , Fei Wang , Ge Wang , Wang Zhi , Wei Xi

Noise Contrastive Variational Autoencoders

We take steps towards understanding the "posterior collapse (PC)" difficulty in variational autoencoders (VAEs),~i.e. a degenerate optimum in which the latent codes become independent of their corresponding inputs. We rely on calculus of…

Machine Learning · Computer Science 2019-08-01 Octavian-Eugen Ganea , Yashas Annadani , Gary Bécigneul

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

One noted issue of vector-quantized variational autoencoder (VQ-VAE) is that the learned discrete representation uses only a fraction of the full capacity of the codebook, also known as codebook collapse. We hypothesize that the training…

Machine Learning · Computer Science 2022-06-10 Yuhta Takida , Takashi Shibuya , WeiHsiang Liao , Chieh-Hsin Lai , Junki Ohmura , Toshimitsu Uesaka , Naoki Murata , Shusuke Takahashi , Toshiyuki Kumakura , Yuki Mitsufuji

Learning Product Codebooks using Vector Quantized Autoencoders for Image Retrieval

Vector-Quantized Variational Autoencoders (VQ-VAE)[1] provide an unsupervised model for learning discrete representations by combining vector quantization and autoencoders. In this paper, we study the use of VQ-VAE for representation…

Image and Video Processing · Electrical Eng. & Systems 2019-03-05 Hanwei Wu , Markus Flierl

Variational Autoencoders Pursue PCA Directions (by Accident)

The Variational Autoencoder (VAE) is a powerful architecture capable of representation learning and generative modeling. When it comes to learning interpretable (disentangled) representations, VAE and its variants show unparalleled…

Machine Learning · Computer Science 2019-04-17 Michal Rolinek , Dominik Zietlow , Georg Martius

Quantitative Understanding of VAE as a Non-linearly Scaled Isometric Embedding

Variational autoencoder (VAE) estimates the posterior parameters (mean and variance) of latent variables corresponding to each input data. While it is used for many tasks, the transparency of the model is still an underlying issue. This…

Machine Learning · Statistics 2023-02-23 Akira Nakagawa , Keizo Kato , Taiji Suzuki

LL-VQ-VAE: Learnable Lattice Vector-Quantization For Efficient Representations

In this paper we introduce learnable lattice vector quantization and demonstrate its effectiveness for learning discrete representations. Our method, termed LL-VQ-VAE, replaces the vector quantization layer in VQ-VAE with lattice-based…

Machine Learning · Computer Science 2023-10-17 Ahmed Khalil , Robert Piechocki , Raul Santos-Rodriguez

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes

Vector quantization (VQ) is a technique to deterministically learn features with discrete codebook representations. It is commonly performed with a variational autoencoding model, VQ-VAE, which can be further extended to hierarchical…

Machine Learning · Computer Science 2024-03-29 Yuhta Takida , Yukara Ikemiya , Takashi Shibuya , Kazuki Shimada , Woosung Choi , Chieh-Hsin Lai , Naoki Murata , Toshimitsu Uesaka , Kengo Uchida , Wei-Hsiang Liao , Yuki Mitsufuji

Variance Constrained Autoencoding

Recent state-of-the-art autoencoder based generative models have an encoder-decoder structure and learn a latent representation with a pre-defined distribution that can be sampled from. Implementing the encoder networks of these models in a…

Machine Learning · Computer Science 2020-05-11 D. T. Braithwaite , M. O'Connor , W. B. Kleijn

Online Clustered Codebook

Vector Quantisation (VQ) is experiencing a comeback in machine learning, where it is increasingly used in representation learning. However, optimizing the codevectors in existing VQ-VAE is not entirely trivial. A problem is codebook…

Computer Vision and Pattern Recognition · Computer Science 2023-07-31 Chuanxia Zheng , Andrea Vedaldi

Probabilistic Autoencoder

Principal Component Analysis (PCA) minimizes the reconstruction error given a class of linear models of fixed component dimensionality. Probabilistic PCA adds a probabilistic structure by learning the probability distribution of the PCA…

Machine Learning · Computer Science 2022-09-20 Vanessa Böhm , Uroš Seljak

Neural Discrete Representation Learning

Learning useful representations without supervision remains a key challenge in machine learning. In this paper, we propose a simple yet powerful generative model that learns such discrete representations. Our model, the Vector…

Machine Learning · Computer Science 2018-05-31 Aaron van den Oord , Oriol Vinyals , Koray Kavukcuoglu

Robust Training of Vector Quantized Bottleneck Models

In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial…

Machine Learning · Computer Science 2024-09-13 Adrian Łańcucki , Jan Chorowski , Guillaume Sanchez , Ricard Marxer , Nanxin Chen , Hans J. G. A. Dolfing , Sameer Khurana , Tanel Alumäe , Antoine Laurent

Disentangling Variational Autoencoders

A variational autoencoder (VAE) is a probabilistic machine learning framework for posterior inference that projects an input set of high-dimensional data to a lower-dimensional, latent space. The latent space learned with a VAE offers…

Machine Learning · Computer Science 2022-11-16 Rafael Pastrana

Learning Stable Representations with Full Encoder

While the beta-VAE family is aiming to find disentangled representations and acquire human-interpretable generative factors, like what an ICA (from the linear domain) does, we propose Full Encoder, a novel unified autoencoder framework as a…

Machine Learning · Computer Science 2021-07-14 Zhouzheng Li , Kun Feng

A Stable Variational Autoencoder for Text Modelling

Variational Autoencoder (VAE) is a powerful method for learning representations of high-dimensional data. However, VAEs can suffer from an issue known as latent variable collapse (or KL loss vanishing), where the posterior collapses to the…

Computation and Language · Computer Science 2019-11-14 Ruizhe Li , Xiao Li , Chenghua Lin , Matthew Collinson , Rui Mao

Beyond Stationarity: Rethinking Codebook Collapse in Vector Quantization

Vector Quantization (VQ) underpins many modern generative frameworks such as VQ-VAE, VQ-GAN, and latent diffusion models. Yet, it suffers from the persistent problem of codebook collapse, where a large fraction of code vectors remains…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Hao Lu , Onur C. Koyun , Yongxin Guo , Zhengjie Zhu , Abbas Alili , Metin Nafi Gurcan

Restructuring Vector Quantization with the Rotation Trick

Vector Quantized Variational AutoEncoders (VQ-VAEs) are designed to compress a continuous input to a discrete latent space and reconstruct it with minimal distortion. They operate by maintaining a set of vectors -- often referred to as the…

Machine Learning · Computer Science 2025-03-18 Christopher Fifty , Ronald G. Junkins , Dennis Duan , Aniketh Iyengar , Jerry W. Liu , Ehsan Amid , Sebastian Thrun , Christopher Ré

Beta-VAE has 2 Behaviors: PCA or ICA?

Beta-VAE is a very classical model for disentangled representation learning, the use of an expanding bottleneck that allow information into the decoder gradually is key to representation disentanglement as well as high-quality…

Machine Learning · Computer Science 2023-03-28 Zhouzheng Li , Hao Liu

ControlVAE: Controllable Variational Autoencoder

Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models have some…

Machine Learning · Computer Science 2020-06-23 Huajie Shao , Shuochao Yao , Dachun Sun , Aston Zhang , Shengzhong Liu , Dongxin Liu , Jun Wang , Tarek Abdelzaher