Related papers: Regularized Vector Quantization for Tokenized Imag…

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2026-03-19 Wenhao Zhao , Qiran Zou , Rushi Shah , Yudi Wu , Zhouhan Lin , Dianbo Liu

Representation Collapsing Problems in Vector Quantization

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2024-11-26 Wenhao Zhao , Qiran Zou , Rushi Shah , Dianbo Liu

Mitigating Premature Discretization with Progressive Quantization for Robust Vector Tokenization

Vector Quantization (VQ) has become the cornerstone of tokenization for many multimodal Large Language Models and diffusion synthesis. However, existing VQ paradigms suffer from a fundamental conflict: they enforce discretization before the…

Machine Learning · Computer Science 2026-03-25 Wenhao Zhao , Qiran Zou , Zhouhan Lin , Dianbo Liu

Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks

Although quantization has emerged as a promising approach to reducing computational complexity across various high-level vision tasks, it inevitably leads to accuracy loss in image super-resolution (SR) networks. This is due to the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Cheeun Hong , Kyoung Mu Lee

Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study

The success of autoregressive models largely depends on the effectiveness of vector quantization, a technique that discretizes continuous features by mapping them to the nearest code vectors within a learnable codebook. Two critical issues…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Xianghong Fang , Litao Guo , Hengchao Chen , Yuxuan Zhang , XiaofanXia , Dingjie Song , Yexin Liu , Hao Wang , Harry Yang , Yuan Yuan , Qiang Sun

Pushing Toward the Simplex Vertices: A Simple Remedy for Code Collapse in Smoothed Vector Quantization

Vector quantization, which discretizes a continuous vector space into a finite set of representative vectors (a codebook), has been widely adopted in modern machine learning. Despite its effectiveness, vector quantization poses a…

Machine Learning · Computer Science 2026-01-30 Takashi Morita

A multi-layer image representation using Regularized Residual Quantization: application to compression and denoising

A learning-based framework for representation of domain-specific images is proposed where joint compression and denoising can be done using a VQ-based multi-layer network. While it learns to compress the images from a training set, the…

Computer Vision and Pattern Recognition · Computer Science 2017-07-10 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation

Although two-stage Vector Quantized (VQ) generative models allow for synthesizing high-fidelity and high-resolution images, their quantization operator encodes similar patches within an image into the same index, resulting in a repeated…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Chuanxia Zheng , Long Tung Vuong , Jianfei Cai , Dinh Phung

VAEVQ: Enhancing Discrete Visual Tokenization through Variational Modeling

Vector quantization (VQ) transforms continuous image features into discrete representations, providing compressed, tokenized inputs for generative models. However, VQ-based frameworks suffer from several issues, such as non-smooth latent…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Sicheng Yang , Xing Hu , Qiang Wu , Dawei Yang

Factorized Visual Tokenization and Generation

Visual tokenizers are fundamental to image generation. They convert visual data into discrete tokens, enabling transformer-based models to excel at image generation. Despite their success, VQ-based tokenizers like VQGAN face significant…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Zechen Bai , Jianxiong Gao , Ziteng Gao , Pichao Wang , Zheng Zhang , Tong He , Mike Zheng Shou

REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization

Visual autoregressive (AR) generation offers a promising path toward unifying vision and language models, yet its performance remains suboptimal against diffusion models. Prior work often attributes this gap to tokenizer limitations and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Qiyuan He , Yicong Li , Haotian Ye , Jinghao Wang , Xinyao Liao , Pheng-Ann Heng , Stefano Ermon , James Zou , Angela Yao

Compressed Sensing MRI Reconstruction Regularized by VAEs with Structured Image Covariance

Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned…

Image and Video Processing · Electrical Eng. & Systems 2023-08-07 Margaret Duff , Ivor J. A. Simpson , Matthias J. Ehrhardt , Neill D. F. Campbell

Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution

Vector-quantized based models have recently demonstrated strong potential for visual prior modeling. However, existing VQ-based methods simply encode visual features with nearest codebook items and train index predictor with code-level…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Qifan Li , Jiale Zou , Jinhua Zhang , Wei Long , Xingyu Zhou , Shuhang Gu

Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning

Vector Quantization (VQ) has recently emerged as a promising approach for learning discrete representations of graph-structured data. However, a fundamental challenge, i.e., codebook collapse, remains underexplored in the graph domain,…

Machine Learning · Computer Science 2025-09-29 Zian Zhai , Fan Li , Xingyu Tan , Xiaoyang Wang , Wenjie Zhang

Vector Quantisation for Robust Segmentation

The reliability of segmentation models in the medical domain depends on the model's robustness to perturbations in the input space. Robustness is a particular challenge in medical imaging exhibiting various sources of image noise,…

Image and Video Processing · Electrical Eng. & Systems 2022-07-06 Ainkaran Santhirasekaram , Avinash Kori , Mathias Winkler , Andrea Rockall , Ben Glocker

Iteratively Refined Image Reconstruction with Learned Attentive Regularizers

We propose a regularization scheme for image reconstruction that leverages the power of deep learning while hinging on classic sparsity-promoting models. Many deep-learning-based models are hard to interpret and cumbersome to analyze…

Image and Video Processing · Electrical Eng. & Systems 2024-07-10 Mehrsa Pourya , Sebastian Neumayer , Michael Unser

Regularized Residual Quantization: a multi-layer sparse dictionary learning approach

The Residual Quantization (RQ) framework is revisited where the quantization distortion is being successively reduced in multi-layers. Inspired by the reverse-water-filling paradigm in rate-distortion theory, an efficient regularization on…

Machine Learning · Computer Science 2017-05-02 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

Masked Vector Quantization

Generative models with discrete latent representations have recently demonstrated an impressive ability to learn complex high-dimensional data distributions. However, their performance relies on a long sequence of tokens per instance and a…

Machine Learning · Computer Science 2024-03-26 David D. Nguyen , David Leibowitz , Surya Nepal , Salil S. Kanhere

Vector Quantized Image-to-Image Translation

Current image-to-image translation methods formulate the task with conditional generation models, leading to learning only the recolorization or regional changes as being constrained by the rich structural information provided by the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Yu-Jie Chen , Shin-I Cheng , Wei-Chen Chiu , Hung-Yu Tseng , Hsin-Ying Lee

Auto-regressive Image Synthesis with Integrated Quantization

Deep generative models have achieved conspicuous progress in realistic image synthesis with multifarious conditional inputs, while generating diverse yet high-fidelity images remains a grand challenge in conditional image generation. This…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Fangneng Zhan , Yingchen Yu , Rongliang Wu , Jiahui Zhang , Kaiwen Cui , Changgong Zhang , Shijian Lu