Related papers: On Distribution Preserving Quantization

Differentially Quantized Gradient Methods

Consider the following distributed optimization scenario. A worker has access to training data that it uses to compute the gradients while a server decides when to stop iterative computation based on its target accuracy or delay…

Machine Learning · Computer Science 2022-04-28 Chung-Yi Lin , Victoria Kostina , Babak Hassibi

Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval

Product Quantization (PQ) has long been a mainstream for generating an exponentially large codebook at very low memory/time cost. Despite its success, PQ is still tricky for the decomposition of high-dimensional vector space, and the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Lianli Gao , Xiaosu Zhu , Jingkuan Song , Zhou Zhao , Heng Tao Shen

Frame Permutation Quantization

Frame permutation quantization (FPQ) is a new vector quantization technique using finite frames. In FPQ, a vector is encoded using a permutation source code to quantize its frame expansion. This means that the encoding is a partial ordering…

Information Theory · Computer Science 2015-03-24 Ha Q. Nguyen , Vivek K Goyal , Lav R. Varshney

TR-DQ: Time-Rotation Diffusion Quantization

Diffusion models have been widely adopted in image and video generation. However, their complex network architecture leads to high inference overhead for its generation process. Existing diffusion quantization methods primarily focus on the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Yihua Shao , Deyang Lin , Fanhu Zeng , Minxi Yan , Muyang Zhang , Siyu Chen , Yuxuan Fan , Ziyang Yan , Haozhe Wang , Jingcai Guo , Yan Wang , Haotong Qin , Hao Tang

Frequency-Dependent Perceptual Quantisation for Visually Lossless Compression Applications

The default quantisation algorithms in the state-of-the-art High Efficiency Video Coding (HEVC) standard, namely Uniform Reconstruction Quantisation (URQ) and Rate-Distortion Optimised Quantisation (RDOQ), do not take into account the…

Multimedia · Computer Science 2019-06-11 Lee Prangnell

Bridging the Gap between Continuous and Informative Discrete Representations by Random Product Quantization

Self-supervised learning (SSL) has become a core technique in speech processing, but the high dimensionality of its representations makes discretization essential for improving efficiency. However, existing discretization methods still…

Audio and Speech Processing · Electrical Eng. & Systems 2025-11-11 Xueqing Li , Hao Ma , Zehan Li , Rujin Chen , Boyu Zhu , Ruihao Jing , Jian Kang , Jie Li , Chi Zhang , Xiao-Lei Zhang , Xuelong Li

A coding theorem for the rate-distortion-perception function

The rate-distortion-perception function (RDPF; Blau and Michaeli, 2019) has emerged as a useful tool for thinking about realism and distortion of reconstructions in lossy compression. Unlike the rate-distortion function, however, it is…

Information Theory · Computer Science 2021-04-29 Lucas Theis , Aaron B. Wagner

Adaptive Dataset Quantization: A New Direction for Dataset Pruning

This paper addresses the challenges of storage and communication costs for large-scale datasets in resource-constrained edge devices by proposing a novel dataset quantization approach to reduce intra-sample redundancy. Unlike traditional…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Chenyue Yu , Jianyu Yu

Differentiable Product Quantization for End-to-End Embedding Compression

Embedding layers are commonly used to map discrete symbols into continuous embedding vectors that reflect their semantic meanings. Despite their effectiveness, the number of parameters in an embedding layer increases linearly with the…

Machine Learning · Computer Science 2020-06-29 Ting Chen , Lala Li , Yizhou Sun

Data-Driven Neural Estimation of Indirect Rate-Distortion Function

The rate-distortion function (RDF) has long been an information-theoretic benchmark for data compression. As its natural extension, the indirect rate-distortion function (iRDF) corresponds to the scenario where the encoder can only access…

Information Theory · Computer Science 2025-03-11 Zichao Yu , Qiang Sun , Wenyi Zhang

Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression

The rapid growth of visual data under stringent storage and bandwidth constraints makes extremely low-bitrate image compression increasingly important. While Vector Quantization (VQ) offers strong structural fidelity, existing methods lack…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Shiyin Jiang , Wei Long , Minghao Han , Zhenghao Chen , Ce Zhu , Shuhang Gu

Randomized Quantization and Source Coding with Constrained Output Distribution

This paper studies fixed-rate randomized vector quantization under the constraint that the quantizer's output has a given fixed probability distribution. A general representation of randomized quantizers that includes the common models in…

Information Theory · Computer Science 2016-11-15 Naci Saldi , Tamás Linder , Serdar Yüksel

Distributed Functional Scalar Quantization Simplified

Distributed functional scalar quantization (DFSQ) theory provides optimality conditions and predicts performance of data acquisition systems in which a computation on acquired data is desired. We address two limitations of previous works:…

Information Theory · Computer Science 2015-03-24 John Z. Sun , Vinith Misra , Vivek K Goyal

Quantization using Compressive Sensing

The problem of compressing a real-valued sparse source using compressive sensing techniques is studied. The rate distortion optimality of a coding scheme in which compressively sensed signals are quantized and then reconstructed is…

Information Theory · Computer Science 2010-11-09 Rajiv Soundararajan , Sriram Vishwanath

Stochastic Approximation Methods for Distortion Risk Measure Optimization

Distortion Risk Measures (DRMs) capture risk preferences in decision-making and serve as general criteria for managing uncertainty. This paper proposes gradient descent algorithms for DRM optimization based on two dual representations: the…

Machine Learning · Computer Science 2025-10-07 Jinyang Jiang , Bernd Heidergott , Jiaqiao Hu , Yijie Peng

Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance

Post-training quantization (PTQ) is a popular method for compressing deep neural networks (DNNs) without modifying their original architecture or training procedures. Despite its effectiveness and convenience, the reliability of PTQ methods…

Machine Learning · Computer Science 2023-03-24 Zhihang Yuan , Jiawei Liu , Jiaxiang Wu , Dawei Yang , Qiang Wu , Guangyu Sun , Wenyu Liu , Xinggang Wang , Bingzhe Wu

End-to-End Supervised Product Quantization for Image Search and Retrieval

Product Quantization, a dictionary based hashing method, is one of the leading unsupervised hashing techniques. While it ignores the labels, it harnesses the features to construct look up tables that can approximate the feature space. In…

Computer Vision and Pattern Recognition · Computer Science 2020-01-22 Benjamin Klein , Lior Wolf

Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks

Hardware-friendly network quantization (e.g., binary/uniform quantization) can efficiently accelerate the inference and meanwhile reduce memory consumption of the deep neural networks, which is crucial for model deployment on…

Computer Vision and Pattern Recognition · Computer Science 2019-08-15 Ruihao Gong , Xianglong Liu , Shenghu Jiang , Tianxiang Li , Peng Hu , Jiazhen Lin , Fengwei Yu , Junjie Yan

Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution

Model quantization is challenging due to many tedious hyper-parameters such as precision (bitwidth), dynamic range (minimum and maximum discrete values) and stepsize (interval between discrete values). Unlike prior arts that carefully tune…

Machine Learning · Computer Science 2021-07-08 Zhang Zhaoyang , Shao Wenqi , Gu Jinwei , Wang Xiaogang , Luo Ping

Adaptive Dataset Quantization

Contemporary deep learning, characterized by the training of cumbersome neural networks on massive datasets, confronts substantial computational hurdles. To alleviate heavy data storage burdens on limited hardware resources, numerous…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Muquan Li , Dongyang Zhang , Qiang Dong , Xiurui Xie , Ke Qin