Related papers: Generalized residual vector quantization for large…

Improved Residual Vector Quantization for High-dimensional Approximate Nearest Neighbor Search

Quantization methods have been introduced to perform large scale approximate nearest search tasks. Residual Vector Quantization (RVQ) is one of the effective quantization methods. RVQ uses a multi-stage codebook learning scheme to lower the…

Computer Vision and Pattern Recognition · Computer Science 2015-09-18 Shicong Liu , Hongtao Lu , Junru Shao

Sequential Regression for Continuous Value Prediction using Residual Quantization

Continuous value prediction plays a crucial role in industrial-scale recommendation systems, including tasks such as predicting users' watch-time and estimating the gross merchandise value (GMV) in e-commerce transactions. However, it…

Information Retrieval · Computer Science 2026-02-27 Runpeng Cui , Zhipeng Sun , Chi Lu , Peng Jiang

Residual Quantization with Implicit Neural Codebooks

Vector quantization is a fundamental operation for data compression and vector search. To obtain high accuracy, multi-codebook methods represent each vector using codewords across several codebooks. Residual quantization (RQ) is one such…

Machine Learning · Computer Science 2024-05-22 Iris A. M. Huijben , Matthijs Douze , Matthew Muckley , Ruud J. G. van Sloun , Jakob Verbeek

Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization

Vector quantization (VQ) underpins modern generative and representation models by turning continuous latents into discrete tokens. Yet hard nearest-neighbor assignments are non-differentiable and are typically optimized with heuristic…

Machine Learning · Computer Science 2026-02-03 Haochen You , Heng Zhang , Hongyang He , Yuqi Li , Baojing Liu

Generalized Learning Vector Quantization for Classification in Randomized Neural Networks and Hyperdimensional Computing

Machine learning algorithms deployed on edge devices must meet certain resource constraints and efficiency requirements. Random Vector Functional Link (RVFL) networks are favored for such applications due to their simple design and training…

Machine Learning · Computer Science 2022-09-02 Cameron Diao , Denis Kleyko , Jan M. Rabaey , Bruno A. Olshausen

Individualized non-uniform quantization for vector search

Embedding vectors are widely used for representing unstructured data and searching through it for semantically similar items. However, the large size of these vectors, due to their high-dimensionality, creates problems for modern vector…

Machine Learning · Computer Science 2025-09-24 Mariano Tepper , Ted Willke

Efficient Generative Modeling with Residual Vector Quantization-Based Tokens

We introduce ResGen, an efficient Residual Vector Quantization (RVQ)-based generative model for high-fidelity generation with fast sampling. RVQ improves data fidelity by increasing the number of quantization steps, referred to as depth,…

Machine Learning · Computer Science 2025-06-03 Jaehyeon Kim , Taehong Moon , Keon Lee , Jaewoong Cho

Generalized Product Quantization Network for Semi-supervised Image Retrieval

Image retrieval methods that employ hashing or vector quantization have achieved great success by taking advantage of deep learning. However, these approaches do not meet expectations unless expensive label information is sufficient. To…

Computer Vision and Pattern Recognition · Computer Science 2020-06-15 Young Kyun Jang , Nam Ik Cho

Vector Quantization for Recommender Systems: A Review and Outlook

Vector quantization, renowned for its unparalleled feature compression capabilities, has been a prominent topic in signal processing and machine learning research for several decades and remains widely utilized today. With the emergence of…

Information Retrieval · Computer Science 2024-05-07 Qijiong Liu , Xiaoyu Dong , Jiaren Xiao , Nuo Chen , Hengchang Hu , Jieming Zhu , Chenxu Zhu , Tetsuya Sakai , Xiao-Ming Wu

DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick

Vector quantization is common in deep models, yet its hard assignments block gradients and hinder end-to-end training. We propose DiVeQ, which treats quantization as adding an error vector that mimics the quantization distortion, keeping…

Machine Learning · Computer Science 2026-05-27 Mohammad Hassan Vali , Tom Bäckström , Arno Solin

LG-VQ: Language-Guided Codebook Learning

Vector quantization (VQ) is a key technique in high-resolution and high-fidelity image synthesis, which aims to learn a codebook to encode an image with a sequence of discrete codes and then generate an image in an auto-regression manner.…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Guotao Liang , Baoquan Zhang , Yaowei Wang , Xutao Li , Yunming Ye , Huaibin Wang , Chuyao Luo , Kola Ye , linfeng Luo

Regularized Residual Quantization: a multi-layer sparse dictionary learning approach

The Residual Quantization (RQ) framework is revisited where the quantization distortion is being successively reduced in multi-layers. Inspired by the reverse-water-filling paradigm in rate-distortion theory, an efficient regularization on…

Machine Learning · Computer Science 2017-05-02 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

Residual vector quantization for KV cache compression in large language model

KV cache compression methods have mainly relied on scalar quantization techniques to reduce the memory requirements during decoding. In this work, we apply residual vector quantization, which has been widely used for high fidelity audio…

Machine Learning · Computer Science 2024-10-22 Ankur Kumar

Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

The vector quantization is a widely used method to map continuous representation to discrete space and has important application in tokenization for generative mode, bottlenecking information and many other tasks in machine learning. Vector…

Machine Learning · Computer Science 2024-10-15 Mingyuan Yan , Jiawei Wu , Rushi Shah , Dianbo Liu

Autoregressive Image Generation using Residual Quantization

For autoregressive (AR) modeling of high-resolution images, vector quantization (VQ) represents an image as a sequence of discrete codes. A short sequence length is important for an AR model to reduce its computational costs to consider…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Doyup Lee , Chiheon Kim , Saehoon Kim , Minsu Cho , Wook-Shin Han

Nowadays, data is represented by vectors. Retrieving those vectors, among millions and billions, that are similar to a given query is a ubiquitous problem, known as similarity search, of relevance for a wide range of applications.…

Machine Learning · Computer Science 2023-07-26 Cecilia Aguerrebere , Ishwar Bhati , Mark Hildebrand , Mariano Tepper , Ted Willke

VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization

Quantization has been proven to be an effective method for reducing the computing and/or storage cost of DNNs. However, the trade-off between the quantization bitwidth and final accuracy is complex and non-convex, which makes it difficult…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Cheng Gong , Yao Chen , Ye Lu , Tao Li , Cong Hao , Deming Chen

Soft Convex Quantization: Revisiting Vector Quantization with Convex Optimization

Vector Quantization (VQ) is a well-known technique in deep learning for extracting informative discrete latent representations. VQ-embedded models have shown impressive results in a range of applications including image and speech…

Machine Learning · Computer Science 2023-10-05 Tanmay Gautam , Reid Pryzant , Ziyi Yang , Chenguang Zhu , Somayeh Sojoudi

Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution

Vector-quantized based models have recently demonstrated strong potential for visual prior modeling. However, existing VQ-based methods simply encode visual features with nearest codebook items and train index predictor with code-level…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Qifan Li , Jiale Zou , Jinhua Zhang , Wei Long , Xingyu Zhou , Shuhang Gu

MGVQ: Synergizing Multi-dimensional Sensitivity-Aware and Gradient-Hessian Fusion for Vector Quantization

Vision-Language Models (VLMs) achieve outstanding performance, yet their huge model size severely hinders deployment on edge devices with limited resources. As an efficient model compression technique, vector quantization (VQ) excels in…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Zhong Wang , Zukang Xu , Xing Hu , Dawei Yang