Related papers: Beyond Stationarity: Rethinking Codebook Collapse …

Online Clustered Codebook

Vector Quantisation (VQ) is experiencing a comeback in machine learning, where it is increasingly used in representation learning. However, optimizing the codevectors in existing VQ-VAE is not entirely trivial. A problem is codebook…

Computer Vision and Pattern Recognition · Computer Science 2023-07-31 Chuanxia Zheng , Andrea Vedaldi

Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Vector Quantization (VQ) is essential for discretizing continuous representations in unsupervised learning but suffers from representation collapse, causing low codebook utilization and limiting scalability. Existing solutions often rely on…

Machine Learning · Computer Science 2025-10-06 Yongxin Zhu , Bocheng Li , Yifei Xin , Zhihua Xia , Linli Xu

Scalable Image Tokenization with Index Backpropagation Quantization

Existing vector quantization (VQ) methods struggle with scalability, largely attributed to the instability of the codebook that undergoes partial updates during training. The codebook is prone to collapse as utilization decreases, due to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Fengyuan Shi , Zhuoyan Luo , Yixiao Ge , Yujiu Yang , Ying Shan , Limin Wang

Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning

Vector Quantization (VQ) has recently emerged as a promising approach for learning discrete representations of graph-structured data. However, a fundamental challenge, i.e., codebook collapse, remains underexplored in the graph domain,…

Machine Learning · Computer Science 2025-09-29 Zian Zhai , Fan Li , Xingyu Tan , Xiaoyang Wang , Wenjie Zhang

NVTC: Nonlinear Vector Transform Coding

In theory, vector quantization (VQ) is always better than scalar quantization (SQ) in terms of rate-distortion (R-D) performance. Recent state-of-the-art methods for neural image compression are mainly based on nonlinear transform coding…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Runsen Feng , Zongyu Guo , Weiping Li , Zhibo Chen

Restructuring Vector Quantization with the Rotation Trick

Vector Quantized Variational AutoEncoders (VQ-VAEs) are designed to compress a continuous input to a discrete latent space and reconstruct it with minimal distortion. They operate by maintaining a set of vectors -- often referred to as the…

Machine Learning · Computer Science 2025-03-18 Christopher Fifty , Ronald G. Junkins , Dennis Duan , Aniketh Iyengar , Jerry W. Liu , Ehsan Amid , Sebastian Thrun , Christopher Ré

VP-VAE: Rethinking Vector Quantization via Adaptive Vector Perturbation

Vector Quantized Variational Autoencoders (VQ-VAEs) are fundamental to modern generative modeling, yet they often suffer from training instability and "codebook collapse" due to the inherent coupling of representation learning and discrete…

Machine Learning · Computer Science 2026-02-20 Linwei Zhai , Han Ding , Mingzhi Lin , Cui Zhao , Fei Wang , Ge Wang , Wang Zhi , Wei Xi

NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector Quantization

Built upon vector quantization (VQ), discrete audio codec models have achieved great success in audio compression and auto-regressive audio generation. However, existing models face substantial challenges in perceptual quality and signal…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-20 Zhikang Niu , Sanyuan Chen , Long Zhou , Ziyang Ma , Xie Chen , Shujie Liu

LG-VQ: Language-Guided Codebook Learning

Vector quantization (VQ) is a key technique in high-resolution and high-fidelity image synthesis, which aims to learn a codebook to encode an image with a sequence of discrete codes and then generate an image in an auto-regression manner.…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Guotao Liang , Baoquan Zhang , Yaowei Wang , Xutao Li , Yunming Ye , Huaibin Wang , Chuyao Luo , Kola Ye , linfeng Luo

VAEVQ: Enhancing Discrete Visual Tokenization through Variational Modeling

Vector quantization (VQ) transforms continuous image features into discrete representations, providing compressed, tokenized inputs for generative models. However, VQ-based frameworks suffer from several issues, such as non-smooth latent…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Sicheng Yang , Xing Hu , Qiang Wu , Dawei Yang

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2026-03-19 Wenhao Zhao , Qiran Zou , Rushi Shah , Yudi Wu , Zhouhan Lin , Dianbo Liu

Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization

Vector quantization (VQ) is a key component in discrete tokenizers for image generation, but its training is often unstable due to straight-through estimation bias, one-step-behind updates, and sparse codebook gradients, which lead to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Yifan Chang , Jie Qin , Limeng Qiao , Xiaofeng Wang , Zheng Zhu , Lin Ma , Xingang Wang

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

One noted issue of vector-quantized variational autoencoder (VQ-VAE) is that the learned discrete representation uses only a fraction of the full capacity of the codebook, also known as codebook collapse. We hypothesize that the training…

Machine Learning · Computer Science 2022-06-10 Yuhta Takida , Takashi Shibuya , WeiHsiang Liao , Chieh-Hsin Lai , Junki Ohmura , Toshimitsu Uesaka , Naoki Murata , Shusuke Takahashi , Toshiyuki Kumakura , Yuki Mitsufuji

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization

Existing vector quantization (VQ) based autoregressive models follow a two-stage generation paradigm that first learns a codebook to encode images as discrete codes, and then completes generation based on the learned codebook. However, they…

Computer Vision and Pattern Recognition · Computer Science 2023-05-22 Mengqi Huang , Zhendong Mao , Zhuowei Chen , Yongdong Zhang

PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse

Vector-quantized autoencoders deliver high-fidelity latents but suffer inherent flaws: the quantizer is non-differentiable, requires straight-through hacks, and is prone to collapse. We address these issues at the root by replacing VQ with…

Machine Learning · Computer Science 2026-02-24 Hao Lu , Onur C. Koyun , Yongxin Guo , Zhengjie Zhu , Abbas Alili , Metin Nafi Gurcan

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

Vector Quantization (VQ) has emerged as a prominent weight compression technique, showcasing substantially lower quantization errors than uniform quantization across diverse models, particularly in extreme compression scenarios. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Shuaiting Li , Juncan Deng , Chenxuan Wang , Kedong Xu , Rongtao Deng , Hong Gu , Haibin Shen , Kejie Huang

Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization

Vector quantization (VQ) underpins modern generative and representation models by turning continuous latents into discrete tokens. Yet hard nearest-neighbor assignments are non-differentiable and are typically optimized with heuristic…

Machine Learning · Computer Science 2026-02-03 Haochen You , Heng Zhang , Hongyang He , Yuqi Li , Baojing Liu

Representation Collapsing Problems in Vector Quantization

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2024-11-26 Wenhao Zhao , Qiran Zou , Rushi Shah , Dianbo Liu

Variable Bitrate Residual Vector Quantization for Audio Coding

Recent state-of-the-art neural audio compression models have progressively adopted residual vector quantization (RVQ). Despite this success, these models employ a fixed number of codebooks per frame, which can be suboptimal in terms of…

Sound · Computer Science 2025-04-29 Yunkee Chae , Woosung Choi , Yuhta Takida , Junghyun Koo , Yukara Ikemiya , Zhi Zhong , Kin Wai Cheuk , Marco A. Martínez-Ramírez , Kyogu Lee , Wei-Hsiang Liao , Yuki Mitsufuji

Residual Quantization with Implicit Neural Codebooks

Vector quantization is a fundamental operation for data compression and vector search. To obtain high accuracy, multi-codebook methods represent each vector using codewords across several codebooks. Residual quantization (RQ) is one such…

Machine Learning · Computer Science 2024-05-22 Iris A. M. Huijben , Matthijs Douze , Matthew Muckley , Ruud J. G. van Sloun , Jakob Verbeek