Related papers: Spherical Leech Quantization for Visual Tokenizati…

Leech Lattice Vector Quantization for Efficient LLM Compression

Scalar quantization of large language models (LLMs) is fundamentally limited by information-theoretic bounds. While vector quantization (VQ) overcomes these limits by encoding blocks of parameters jointly, practical implementations must…

Machine Learning · Computer Science 2026-03-12 Tycho F. A. van der Ouderaa , Mart van Baalen , Paul Whatmough , Markus Nagel

Scaling Image Tokenizers with Grouped Spherical Quantization

Vision tokenizers have gained a lot of attraction due to their scalability and compactness; previous works depend on old-school GAN-based hyperparameters, biased comparisons, and a lack of comprehensive analysis of the scaling behaviours.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Jiangtao Wang , Zhen Qin , Yifan Zhang , Vincent Tao Hu , Björn Ommer , Rania Briq , Stefan Kesselheim

Constructive spherical codes on layers of flat tori

A new class of spherical codes is constructed by selecting a finite subset of flat tori from a foliation of the unit sphere S^{2L-1} of R^{2L} and designing a structured codebook on each torus layer. The resulting spherical code can be the…

Information Theory · Computer Science 2016-11-17 Cristiano Torezzan , Sueli I. R. Costa , Vinay A. Vaishampayan

Compression for Quadratic Similarity Queries: Finite Blocklength and Practical Schemes

We study the problem of compression for the purpose of similarity identification, where similarity is measured by the mean square Euclidean distance between vectors. While the asymptotical fundamental limits of the problem - the minimal…

Information Theory · Computer Science 2014-05-13 Fabian Steiner , Steffen Dempfle , Amir Ingber , Tsachy Weissman

Image and Video Tokenization with Binary Spherical Quantization

We propose a new transformer-based image and video tokenizer with Binary Spherical Quantization (BSQ). BSQ projects the high-dimensional visual embedding to a lower-dimensional hypersphere and then applies binary quantization. BSQ is (1)…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Yue Zhao , Yuanjun Xiong , Philipp Krähenbühl

QuantEase: Optimization-based Quantization for Language Models

With the rising popularity of Large Language Models (LLMs), there has been an increasing interest in compression techniques that enable their efficient deployment. This study focuses on the Post-Training Quantization (PTQ) of LLMs. Drawing…

Machine Learning · Statistics 2023-12-04 Kayhan Behdin , Ayan Acharya , Aman Gupta , Qingquan Song , Siyu Zhu , Sathiya Keerthi , Rahul Mazumder

Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression

It is customary to deploy uniform scalar quantization in the end-to-end optimized Neural image compression methods, instead of more powerful vector quantization, due to the high complexity of the latter. Lattice vector quantization (LVQ),…

Image and Video Processing · Electrical Eng. & Systems 2024-11-26 Xi Zhang , Xiaolin Wu

Deep Spherical Quantization for Image Search

Hashing methods, which encode high-dimensional images with compact discrete codes, have been widely applied to enhance large-scale image retrieval. In this paper, we put forward Deep Spherical Quantization (DSQ), a novel method to make deep…

Computer Vision and Pattern Recognition · Computer Science 2019-06-10 Sepehr Eghbali , Ladan Tahvildari

Statistically-Lossless Quantization of Large Language Models

Model quantization has become essential for efficient large language model deployment, yet existing approaches involve clear trade-offs: methods such as GPTQ and AWQ achieve practical compression but are lossy, while lossless techniques…

Machine Learning · Computer Science 2026-05-05 Michael Helcig , Eldar Kurtic , Dan Alistarh

LL-VQ-VAE: Learnable Lattice Vector-Quantization For Efficient Representations

In this paper we introduce learnable lattice vector quantization and demonstrate its effectiveness for learning discrete representations. Our method, termed LL-VQ-VAE, replaces the vector quantization layer in VQ-VAE with lattice-based…

Machine Learning · Computer Science 2023-10-17 Ahmed Khalil , Robert Piechocki , Raul Santos-Rodriguez

Faster Projection in Sphere Decoding

Most of the calculations in standard sphere decoders are redundant, in the sense that they either calculate quantities that are never used or calculate some quantities more than once. A new method, which is applicable to lattices as well as…

Information Theory · Computer Science 2015-01-07 Arash Ghasemmehdi , Erik Agrell

LVQAC: Lattice Vector Quantization Coupled with Spatially Adaptive Companding for Efficient Learned Image Compression

Recently, numerous end-to-end optimized image compression neural networks have been developed and proved themselves as leaders in rate-distortion performance. The main strength of these learnt compression methods is in powerful nonlinear…

Image and Video Processing · Electrical Eng. & Systems 2023-04-26 Xi Zhang , Xiaolin Wu

Regularized Vector Quantization for Tokenized Image Synthesis

Quantizing images into discrete representations has been a fundamental problem in unified generative modeling. Predominant approaches learn the discrete representation either in a deterministic manner by selecting the best-matching token or…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Jiahui Zhang , Fangneng Zhan , Christian Theobalt , Shijian Lu

RSQ: Learning from Important Tokens Leads to Better Quantized LLMs

Layer-wise quantization is a key technique for efficiently compressing large models without expensive retraining. Previous methods typically quantize the weights of each layer by "uniformly" optimizing the layer reconstruction loss across…

Machine Learning · Computer Science 2025-03-04 Yi-Lin Sung , Prateek Yadav , Jialu Li , Jaehong Yoon , Mohit Bansal

LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization

Discrete image tokenization is a key bottleneck for scalable visual generation: a tokenizer must remain compact for efficient latent-space priors while preserving semantic structure and using discrete capacity effectively. Existing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-23 Idil Bilge Altun , Mert Onur Cakiroglu , Elham Buxton , Mehmet Dalkilic , Hasan Kurban

Simultaneous Compression and Quantization: A Joint Approach for Efficient Unsupervised Hashing

For unsupervised data-dependent hashing, the two most important requirements are to preserve similarity in the low-dimensional feature space and to minimize the binary quantization loss. A well-established hashing approach is Iterative…

Computer Vision and Pattern Recognition · Computer Science 2019-11-14 Tuan Hoang , Thanh-Toan Do , Huu Le , Dang-Khoa Le-Tan , Ngai-Man Cheung

Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression

Large Language Models (LLMs) have demonstrated remarkable capabilities but typically require extensive computational resources and memory for inference. Post-training quantization (PTQ) can effectively reduce these demands by storing…

Machine Learning · Computer Science 2026-01-27 Xi Zhang , Xiaolin Wu , Jiamang Wang , Weisi Lin

RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Existing neural networks are memory-consuming and computationally intensive, making deploying them challenging in resource-constrained environments. However, there are various methods to improve their efficiency. Two such methods are…

Machine Learning · Computer Science 2023-11-10 Anastasiia Prutianova , Alexey Zaytsev , Chung-Kuei Lee , Fengyu Sun , Ivan Koryakovskiy

Learnable Companding Quantization for Accurate Low-bit Neural Networks

Quantizing deep neural networks is an effective method for reducing memory consumption and improving inference speed, and is thus useful for implementation in resource-constrained devices. However, it is still hard for extremely low-bit…

Computer Vision and Pattern Recognition · Computer Science 2021-11-03 Kohei Yamamoto

Luminance-Aware Statistical Quantization: Unsupervised Hierarchical Learning for Illumination Enhancement

Low-light image enhancement (LLIE) faces persistent challenges in balancing reconstruction fidelity with cross-scenario generalization. While existing methods predominantly focus on deterministic pixel-level mappings between paired…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Derong Kong , Zhixiong Yang , Shengxi Li , Shuaifeng Zhi , Li Liu , Zhen Liu , Jingyuan Xia