English
Related papers

Related papers: Regularized Vector Quantization for Tokenized Imag…

200 papers

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2026-03-19 Wenhao Zhao , Qiran Zou , Rushi Shah , Yudi Wu , Zhouhan Lin , Dianbo Liu

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2024-11-26 Wenhao Zhao , Qiran Zou , Rushi Shah , Dianbo Liu

Vector Quantization (VQ) has become the cornerstone of tokenization for many multimodal Large Language Models and diffusion synthesis. However, existing VQ paradigms suffer from a fundamental conflict: they enforce discretization before the…

Machine Learning · Computer Science 2026-03-25 Wenhao Zhao , Qiran Zou , Zhouhan Lin , Dianbo Liu

Although quantization has emerged as a promising approach to reducing computational complexity across various high-level vision tasks, it inevitably leads to accuracy loss in image super-resolution (SR) networks. This is due to the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Cheeun Hong , Kyoung Mu Lee

The success of autoregressive models largely depends on the effectiveness of vector quantization, a technique that discretizes continuous features by mapping them to the nearest code vectors within a learnable codebook. Two critical issues…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Xianghong Fang , Litao Guo , Hengchao Chen , Yuxuan Zhang , XiaofanXia , Dingjie Song , Yexin Liu , Hao Wang , Harry Yang , Yuan Yuan , Qiang Sun

Vector quantization, which discretizes a continuous vector space into a finite set of representative vectors (a codebook), has been widely adopted in modern machine learning. Despite its effectiveness, vector quantization poses a…

Machine Learning · Computer Science 2026-01-30 Takashi Morita

A learning-based framework for representation of domain-specific images is proposed where joint compression and denoising can be done using a VQ-based multi-layer network. While it learns to compress the images from a training set, the…

Computer Vision and Pattern Recognition · Computer Science 2017-07-10 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

Although two-stage Vector Quantized (VQ) generative models allow for synthesizing high-fidelity and high-resolution images, their quantization operator encodes similar patches within an image into the same index, resulting in a repeated…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Chuanxia Zheng , Long Tung Vuong , Jianfei Cai , Dinh Phung

Vector quantization (VQ) transforms continuous image features into discrete representations, providing compressed, tokenized inputs for generative models. However, VQ-based frameworks suffer from several issues, such as non-smooth latent…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Sicheng Yang , Xing Hu , Qiang Wu , Dawei Yang

Visual tokenizers are fundamental to image generation. They convert visual data into discrete tokens, enabling transformer-based models to excel at image generation. Despite their success, VQ-based tokenizers like VQGAN face significant…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Zechen Bai , Jianxiong Gao , Ziteng Gao , Pichao Wang , Zheng Zhang , Tong He , Mike Zheng Shou

Visual autoregressive (AR) generation offers a promising path toward unifying vision and language models, yet its performance remains suboptimal against diffusion models. Prior work often attributes this gap to tokenizer limitations and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Qiyuan He , Yicong Li , Haotian Ye , Jinghao Wang , Xinyao Liao , Pheng-Ann Heng , Stefano Ermon , James Zou , Angela Yao

Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned…

Image and Video Processing · Electrical Eng. & Systems 2023-08-07 Margaret Duff , Ivor J. A. Simpson , Matthias J. Ehrhardt , Neill D. F. Campbell

Vector-quantized based models have recently demonstrated strong potential for visual prior modeling. However, existing VQ-based methods simply encode visual features with nearest codebook items and train index predictor with code-level…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Qifan Li , Jiale Zou , Jinhua Zhang , Wei Long , Xingyu Zhou , Shuhang Gu

Vector Quantization (VQ) has recently emerged as a promising approach for learning discrete representations of graph-structured data. However, a fundamental challenge, i.e., codebook collapse, remains underexplored in the graph domain,…

Machine Learning · Computer Science 2025-09-29 Zian Zhai , Fan Li , Xingyu Tan , Xiaoyang Wang , Wenjie Zhang

The reliability of segmentation models in the medical domain depends on the model's robustness to perturbations in the input space. Robustness is a particular challenge in medical imaging exhibiting various sources of image noise,…

Image and Video Processing · Electrical Eng. & Systems 2022-07-06 Ainkaran Santhirasekaram , Avinash Kori , Mathias Winkler , Andrea Rockall , Ben Glocker

We propose a regularization scheme for image reconstruction that leverages the power of deep learning while hinging on classic sparsity-promoting models. Many deep-learning-based models are hard to interpret and cumbersome to analyze…

Image and Video Processing · Electrical Eng. & Systems 2024-07-10 Mehrsa Pourya , Sebastian Neumayer , Michael Unser

The Residual Quantization (RQ) framework is revisited where the quantization distortion is being successively reduced in multi-layers. Inspired by the reverse-water-filling paradigm in rate-distortion theory, an efficient regularization on…

Machine Learning · Computer Science 2017-05-02 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

Generative models with discrete latent representations have recently demonstrated an impressive ability to learn complex high-dimensional data distributions. However, their performance relies on a long sequence of tokens per instance and a…

Machine Learning · Computer Science 2024-03-26 David D. Nguyen , David Leibowitz , Surya Nepal , Salil S. Kanhere

Current image-to-image translation methods formulate the task with conditional generation models, leading to learning only the recolorization or regional changes as being constrained by the rich structural information provided by the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Yu-Jie Chen , Shin-I Cheng , Wei-Chen Chiu , Hung-Yu Tseng , Hsin-Ying Lee

Deep generative models have achieved conspicuous progress in realistic image synthesis with multifarious conditional inputs, while generating diverse yet high-fidelity images remains a grand challenge in conditional image generation. This…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Fangneng Zhan , Yingchen Yu , Rongliang Wu , Jiahui Zhang , Kaiwen Cui , Changgong Zhang , Shijian Lu
‹ Prev 1 2 3 10 Next ›