Related papers: Block-Sphere Vector Quantization

TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Vector quantization, a problem rooted in Shannon's source coding theory, aims to quantize high-dimensional Euclidean vectors while minimizing distortion in their geometric structure. We propose TurboQuant to address both mean-squared error…

Machine Learning · Computer Science 2025-04-29 Amir Zandieh , Majid Daliri , Majid Hadian , Vahab Mirrokni

A Note on TurboQuant and the Earlier DRIVE/EDEN Line of Work

This note clarifies the relationship between the recent TurboQuant work and the earlier DRIVE (NeurIPS 2021) and EDEN (ICML 2022) schemes. DRIVE is a 1-bit quantizer that EDEN extended to any $b>0$ bits per coordinate; we refer to them…

Machine Learning · Computer Science 2026-05-08 Ran Ben-Basat , Yaniv Ben-Itzhak , Gal Mendelson , Michael Mitzenmacher , Amit Portnoy , Shay Vargaftik

Provable Quantization with Randomized Hadamard Transform

Vector quantization via random projection followed by scalar quantization is a fundamental primitive in machine learning, with applications ranging from similarity search to federated learning and KV cache compression. While dense random…

Machine Learning · Computer Science 2026-05-14 Ying Feng , Piotr Indyk , Michael Kapralov , Dmitry Krachun , Boris Prokhorov

Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression

The rapid growth of visual data under stringent storage and bandwidth constraints makes extremely low-bitrate image compression increasingly important. While Vector Quantization (VQ) offers strong structural fidelity, existing methods lack…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Shiyin Jiang , Wei Long , Minghao Han , Zhenghao Chen , Ce Zhu , Shuhang Gu

Robust Clustering on High-Dimensional Data with Stochastic Quantization

This paper addresses the limitations of conventional vector quantization algorithms, particularly K-Means and its variant K-Means++, and investigates the Stochastic Quantization (SQ) algorithm as a scalable alternative for high-dimensional…

Machine Learning · Computer Science 2025-03-11 Anton Kozyriev , Vladimir Norkin

MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization

Vector quantization(VQ) is a hardware-friendly DNN compression method that can reduce the storage cost and weight-loading datawidth of hardware accelerators. However, conventional VQ techniques lead to significant accuracy loss because the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Shuaiting Li , Chengxuan Wang , Juncan Deng , Zeyu Wang , Zewen Ye , Zongsheng Wang , Haibin Shen , Kejie Huang

Restructuring Vector Quantization with the Rotation Trick

Vector Quantized Variational AutoEncoders (VQ-VAEs) are designed to compress a continuous input to a discrete latent space and reconstruct it with minimal distortion. They operate by maintaining a set of vectors -- often referred to as the…

Machine Learning · Computer Science 2025-03-18 Christopher Fifty , Ronald G. Junkins , Dennis Duan , Aniketh Iyengar , Jerry W. Liu , Ehsan Amid , Sebastian Thrun , Christopher Ré

MRQ:Support Multiple Quantization Schemes through Model Re-Quantization

Despite the proliferation of diverse hardware accelerators (e.g., NPU, TPU, DPU), deploying deep learning models on edge devices with fixed-point hardware is still challenging due to complex model quantization and conversion. Existing model…

Machine Learning · Computer Science 2023-08-07 Manasa Manohara , Sankalp Dayal , Tariq Afzal , Rahul Bakshi , Kahkuen Fu

MixQuant: Mixed Precision Quantization with a Bit-width Optimization Search

Quantization is a technique for creating efficient Deep Neural Networks (DNNs), which involves performing computations and storing tensors at lower bit-widths than f32 floating point precision. Quantization reduces model size and inference…

Machine Learning · Computer Science 2023-10-02 Eliska Kloberdanz , Wei Le

DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick

Vector quantization is common in deep models, yet its hard assignments block gradients and hinder end-to-end training. We propose DiVeQ, which treats quantization as adding an error vector that mimics the quantization distortion, keeping…

Machine Learning · Computer Science 2026-05-27 Mohammad Hassan Vali , Tom Bäckström , Arno Solin

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Model quantization has emerged as an indispensable technique to accelerate deep learning inference. While researchers continue to push the frontier of quantization algorithms, existing quantization work is often unreproducible and…

Machine Learning · Computer Science 2022-01-26 Yuhang Li , Mingzhu Shen , Jian Ma , Yan Ren , Mingxin Zhao , Qi Zhang , Ruihao Gong , Fengwei Yu , Junjie Yan

Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks

Quantization reduces the model's hardware costs, such as data movement, storage, and operations like multiply and addition. It also affects the model's behavior by degrading the output quality. Therefore, there is a need for methods that…

Machine Learning · Computer Science 2024-10-16 Alireza Khodamoradi , Kristof Denolf , Eric Dellinger

MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization

Vector Quantized Variational Autoencoders (VQ-VAEs) are fundamental models that compress continuous visual data into discrete tokens. Existing methods have tried to improve the quantization strategy for better reconstruction quality,…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Mingkai Jia , Wei Yin , Xiaotao Hu , Jiaxin Guo , Xiaoyang Guo , Qian Zhang , Xiao-Xiao Long , Ping Tan

Generalized residual vector quantization for large scale data

Vector quantization is an essential tool for tasks involving large scale data, for example, large scale similarity search, which is crucial for content-based information retrieval and analysis. In this paper, we propose a novel vector…

Multimedia · Computer Science 2016-09-20 Shicong Liu , Junru Shao , Hongtao Lu

Block-encoding based quantum algorithm for linear systems with displacement structures

Matrices with the displacement structures of circulant, Toeplitz, and Hankel types as well as matrices with structures generalizing these types are omnipresent in computations of sciences and engineering. In this paper, we present efficient…

Quantum Physics · Physics 2021-10-06 Lin-Chun Wan , Chao-Hua Yu , Shi-Jie Pan , Su-Juan Qin , Fei Gao , Qiao-Yan Wen

Convergence and Quantum Advantage of Trotterized MERA for Strongly-Correlated Systems

Strongly-correlated quantum many-body systems are difficult to study and simulate classically. We recently proposed a variational quantum eigensolver (VQE) based on the multiscale entanglement renormalization ansatz (MERA) with tensors…

Quantum Physics · Physics 2025-02-18 Qiang Miao , Thomas Barthel

Vectorized Attention with Learnable Encoding for Quantum Transformer

Vectorized quantum block encoding provides a way to embed classical data into Hilbert space, offering a pathway for quantum models, such as Quantum Transformers (QT), that replace classical self-attention with quantum circuit simulations to…

Quantum Physics · Physics 2025-09-05 Ziqing Guo , Ziwen Pan , Alex Khan , Jan Balewski

EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching

We introduce the first learning-based dense matching algorithm, termed Equirectangular Projection-Oriented Dense Kernelized Feature Matching (EDM), specifically designed for omnidirectional images. Equirectangular projection (ERP) images,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Dongki Jung , Jaehoon Choi , Yonghan Lee , Somi Jeong , Taejae Lee , Dinesh Manocha , Suyong Yeon

Scalable Image Tokenization with Index Backpropagation Quantization

Existing vector quantization (VQ) methods struggle with scalability, largely attributed to the instability of the codebook that undergoes partial updates during training. The codebook is prone to collapse as utilization decreases, due to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Fengyuan Shi , Zhuoyan Luo , Yixiao Ge , Yujiu Yang , Ying Shan , Limin Wang

Learning Low-Rank Representations for Model Compression

Vector Quantization (VQ) is an appealing model compression method to obtain a tiny model with less accuracy loss. While methods to obtain better codebooks and codes under fixed clustering dimensionality have been extensively studied,…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Zezhou Zhu , Yucong Zhou , Zhao Zhong