Related papers: Deep Learning-based Image Compression with Trellis…

Comprehensive Comparisons of Uniform Quantization in Deep Image Compression

In deep image compression, uniform quantization is applied to latent representations obtained by using an auto-encoder architecture for reducing bits and entropy coding. Quantization is a problem encountered in the end-to-end training of…

Image and Video Processing · Electrical Eng. & Systems 2023-03-02 Koki Tsubota , Kiyoharu Aizawa

Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization

The continuous improvements on image compression with variational autoencoders have lead to learned codecs competitive with conventional approaches in terms of rate-distortion efficiency. Nonetheless, taking the quantization into account…

Machine Learning · Computer Science 2025-06-11 Florian Borzechowski , Michael Schäfer , Heiko Schwarz , Jonathan Pfaff , Detlev Marpe , Thomas Wiegand

DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, most existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2024-12-03 Yongqi Zhai , Yi Ma , Luyang Tang , Wei Jiang , Ronggang Wang

Deep Triplet Quantization

Deep hashing establishes efficient and effective image retrieval by end-to-end learning of deep representations and hash codes from similarity data. We present a compact coding solution, focusing on deep learning to quantization approach…

Computer Vision and Pattern Recognition · Computer Science 2019-02-04 Bin Liu , Yue Cao , Mingsheng Long , Jianmin Wang , Jingdong Wang

Deep Spherical Quantization for Image Search

Hashing methods, which encode high-dimensional images with compact discrete codes, have been widely applied to enhance large-scale image retrieval. In this paper, we put forward Deep Spherical Quantization (DSQ), a novel method to make deep…

Computer Vision and Pattern Recognition · Computer Science 2019-06-10 Sepehr Eghbali , Ladan Tahvildari

Deep Neural Network Compression with Single and Multiple Level Quantization

Network quantization is an effective solution to compress deep neural networks for practical usage. Existing network quantization methods cannot sufficiently exploit the depth information to generate low-bit compressed network. In this…

Machine Learning · Computer Science 2018-12-18 Yuhui Xu , Yongzhuang Wang , Aojun Zhou , Weiyao Lin , Hongkai Xiong

Implicit Feature Decoupling with Depthwise Quantization

Quantization has been applied to multiple domains in Deep Neural Networks (DNNs). We propose Depthwise Quantization (DQ) where $\textit{quantization}$ is applied to a decomposed sub-tensor along the $\textit{feature axis}$ of weak…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Iordanis Fostiropoulos , Barry Boehm

A multi-layer image representation using Regularized Residual Quantization: application to compression and denoising

A learning-based framework for representation of domain-specific images is proposed where joint compression and denoising can be done using a VQ-based multi-layer network. While it learns to compress the images from a training set, the…

Computer Vision and Pattern Recognition · Computer Science 2017-07-10 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

Improved Hybrid Layered Image Compression using Deep Learning and Traditional Codecs

Recently deep learning-based methods have been applied in image compression and achieved many promising results. In this paper, we propose an improved hybrid layered image compression framework by combining deep learning and the traditional…

Image and Video Processing · Electrical Eng. & Systems 2022-06-22 Haisheng Fu , Feng Liang , Bo Lei , Nai Bian , Qian zhang , Mohammad Akbari , Jie Liang , Chengjie Tu

QTIP: Quantization with Trellises and Incoherence Processing

Post-training quantization (PTQ) reduces the memory footprint of LLMs by quantizing weights to low-precision datatypes. Since LLM inference is usually memory-bound, PTQ methods can improve inference throughput. Recent state-of-the-art PTQ…

Machine Learning · Computer Science 2025-06-19 Albert Tseng , Qingyao Sun , David Hou , Christopher De Sa

Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression

Large-scale image datasets are fundamental to deep learning, but their high storage demands pose challenges for deployment in resource-constrained environments. While existing approaches reduce dataset size by discarding samples, they often…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Chenyue Yu , Lingao Xiao , Jinhong Deng , Ivor W. Tsang , Yang He

Learnable Companding Quantization for Accurate Low-bit Neural Networks

Quantizing deep neural networks is an effective method for reducing memory consumption and improving inference speed, and is thus useful for implementation in resource-constrained devices. However, it is still hard for extremely low-bit…

Computer Vision and Pattern Recognition · Computer Science 2021-11-03 Kohei Yamamoto

Compress-and-Forward via Multilevel Coding and Trellis Coded Quantization

Compress-forward (CF) relays can improve communication rates even when the relay cannot decode the source signal. Efficient implementation of CF is a topic of contemporary interest, in part because of its potential impact on wireless…

Information Theory · Computer Science 2023-01-10 Heping Wan , Anders Host-Madsen , Aria Nosratinia

Deep Multiple Description Coding by Learning Scalar Quantization

In this paper, we propose a deep multiple description coding framework, whose quantizers are adaptively learned via the minimization of multiple description compressive loss. Firstly, our framework is built upon auto-encoder networks, which…

Multimedia · Computer Science 2019-02-07 Lijun Zhao , Huihui Bai , Anhong Wang , Yao Zhao

Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval

Product Quantization (PQ) has long been a mainstream for generating an exponentially large codebook at very low memory/time cost. Despite its success, PQ is still tricky for the decomposition of high-dimensional vector space, and the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Lianli Gao , Xiaosu Zhu , Jingkuan Song , Zhou Zhao , Heng Tao Shen

Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks

We propose a method of training quantization thresholds (TQT) for uniform symmetric quantizers using standard backpropagation and gradient descent. Contrary to prior work, we show that a careful analysis of the straight-through estimator…

Computer Vision and Pattern Recognition · Computer Science 2020-03-02 Sambhav R. Jain , Albert Gural , Michael Wu , Chris H. Dick

Supervised Encoding for Discrete Representation Learning

Classical supervised classification tasks search for a nonlinear mapping that maps each encoded feature directly to a probability mass over the labels. Such a learning framework typically lacks the intuition that encoded features from the…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Cat P. Le , Yi Zhou , Jie Ding , Vahid Tarokh

LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization

Discrete image tokenization is a key bottleneck for scalable visual generation: a tokenizer must remain compact for efficient latent-space priors while preserving semantic structure and using discrete capacity effectively. Existing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-23 Idil Bilge Altun , Mert Onur Cakiroglu , Elham Buxton , Mehmet Dalkilic , Hasan Kurban

Neural Image Compression with Quantization Rectifier

Neural image compression has been shown to outperform traditional image codecs in terms of rate-distortion performance. However, quantization introduces errors in the compression process, which can degrade the quality of the compressed…

Machine Learning · Computer Science 2024-03-27 Wei Luo , Bo Chen

Trellis: Learning to Compress Key-Value Memory in Attention Models

Transformers, while powerful, suffer from quadratic computational complexity and the ever-growing Key-Value (KV) cache of the attention mechanism. This paper introduces Trellis, a novel Transformer architecture with bounded memory that…

Machine Learning · Computer Science 2026-01-01 Mahdi Karami , Ali Behrouz , Praneeth Kacham , Vahab Mirrokni