Related papers: A 1Mb mixed-precision quantized encoder for image …

Mixed-Precision Quantized Neural Network with Progressively Decreasing Bitwidth For Image Classification and Object Detection

Efficient model inference is an important and practical issue in the deployment of deep neural network on resource constraint platforms. Network quantization addresses this problem effectively by leveraging low-bit representation and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-01 Tianshu Chu , Qin Luo , Jie Yang , Xiaolin Huang

Picking Up Quantization Steps for Compressed Image Classification

The sensitivity of deep neural networks to compressed images hinders their usage in many real applications, which means classification networks may fail just after taking a screenshot and saving it as a compressed file. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2023-04-24 Li Ma , Peixi Peng , Guangyao Chen , Yifan Zhao , Siwei Dong , Yonghong Tian

Efficient Neural Compression with Inference-time Decoding

This paper explores the combination of neural network quantization and entropy coding for memory footprint minimization. Edge deployment of quantized models is hampered by the harsh Pareto frontier of the accuracy-to-bitwidth tradeoff,…

Machine Learning · Computer Science 2024-06-11 C. Metz , O. Bichler , A. Dupret

Comprehensive Comparisons of Uniform Quantization in Deep Image Compression

In deep image compression, uniform quantization is applied to latent representations obtained by using an auto-encoder architecture for reducing bits and entropy coding. Quantization is a problem encountered in the end-to-end training of…

Image and Video Processing · Electrical Eng. & Systems 2023-03-02 Koki Tsubota , Kiyoharu Aizawa

End-to-End Learned Image Compression with Quantized Weights and Activations

End-to-end Learned image compression (LIC) has reached the traditional hand-crafted methods such as BPG (HEVC intra) in terms of the coding gain. However, the large network size prohibits the usage of LIC on resource-limited embedded…

Image and Video Processing · Electrical Eng. & Systems 2021-11-19 Heming Sun , Lu Yu , Jiro Katto

Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1

The training of deep neural networks (DNNs) requires intensive resources both for computation and for storage performance. Thus, DNNs cannot be efficiently applied to mobile phones and embedded devices, which seriously limits their…

Computer Vision and Pattern Recognition · Computer Science 2019-06-03 Qigong Sun , Fanhua Shang , Kang Yang , Xiufang Li , Yan Ren , Licheng Jiao

Quantization of Deep Neural Networks for Accumulator-constrained Processors

We introduce an Artificial Neural Network (ANN) quantization methodology for platforms without wide accumulation registers. This enables fixed-point model deployment on embedded compute platforms that are not specifically designed for large…

Computer Vision and Pattern Recognition · Computer Science 2020-04-27 Barry de Bruin , Zoran Zivkovic , Henk Corporaal

Ultra-Low Bitrate Perceptual Image Compression with Shallow Encoder

Ultra-low bitrate image compression (below 0.05 bits per pixel) is increasingly critical for bandwidth-constrained and computation-limited encoding scenarios such as edge devices. Existing frameworks typically rely on large pretrained…

Computer Vision and Pattern Recognition · Computer Science 2026-03-12 Tianyu Zhang , Dong Liu , Chang Wen Chen

Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration

The training of deep neural networks (DNNs) always requires intensive resources for both computation and data storage. Thus, DNNs cannot be efficiently applied to mobile phones and embedded devices, which severely limits their applicability…

Computer Vision and Pattern Recognition · Computer Science 2021-06-21 Qigong Sun , Xiufang Li , Fanhua Shang , Hongying Liu , Kang Yang , Licheng Jiao , Zhouchen Lin

Device Interoperability for Learned Image Compression with Weights and Activations Quantization

Learning-based image compression has improved to a level where it can outperform traditional image codecs such as HEVC and VVC in terms of coding performance. In addition to good compression performance, device interoperability is essential…

Image and Video Processing · Electrical Eng. & Systems 2022-12-05 Esin Koyuncu , Timofey Solovyev , Elena Alshina , André Kaup

A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification

Recent advancements in machine learning achieved by Deep Neural Networks (DNNs) have been significant. While demonstrating high accuracy, DNNs are associated with a huge number of parameters and computations, which leads to high memory…

Machine Learning · Computer Science 2023-12-20 Babak Rokh , Ali Azarpeyvand , Alireza Khanteymoori

Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Convolutional Neural Networks (CNN) has become more popular choice for various tasks such as computer vision, speech recognition and natural language processing. Thanks to their large computational capability and throughput, GPUs ,which are…

Machine Learning · Computer Science 2018-11-28 Natan Liss , Chaim Baskin , Avi Mendelson , Alex M. Bronstein , Raja Giryes

Quantized Decoder in Learned Image Compression for Deterministic Reconstruction

Learned image compression has a problem of non-bit-exact reconstruction due to different calculations of floating point arithmetic on different devices. This paper shows a method to achieve a deterministic reconstructed image by quantizing…

Image and Video Processing · Electrical Eng. & Systems 2024-01-12 Esin Koyuncu , Timofey Solovyev , Johannes Sauer , Elena Alshina , André Kaup

AMED: Automatic Mixed-Precision Quantization for Edge Devices

Quantized neural networks are well known for reducing the latency, power consumption, and model size without significant harm to the performance. This makes them highly appropriate for systems with limited resources and low power capacity.…

Machine Learning · Computer Science 2024-06-11 Moshe Kimhi , Tal Rozen , Avi Mendelson , Chaim Baskin

Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes

Quantization is widely employed in both cloud and edge systems to reduce the memory occupation, latency, and energy consumption of deep neural networks. In particular, mixed-precision quantization, i.e., the use of different bit-widths for…

Machine Learning · Computer Science 2023-01-26 Matteo Risso , Alessio Burrello , Luca Benini , Enrico Macii , Massimo Poncino , Daniele Jahier Pagliari

Learning Convolutional Networks for Content-weighted Image Compression

Lossy image compression is generally formulated as a joint rate-distortion optimization to learn encoder, quantizer, and decoder. However, the quantizer is non-differentiable, and discrete entropy estimation usually is required for rate…

Computer Vision and Pattern Recognition · Computer Science 2017-09-20 Mu Li , Wangmeng Zuo , Shuhang Gu , Debin Zhao , David Zhang

Coarse-to-Fine: Progressive Image Compression for Semantically Hierarchical Classification

Recent advances in learned image compression (LIC) have enabled practical deployments, spurring active research into image compression for machines and progressive coding schemes. However, their integration remains under-explored: prior…

Image and Video Processing · Electrical Eng. & Systems 2026-05-12 Jungwoo Kim , Jun-Hyuk Kim , Jong-Seok Lee

Enhanced Standard Compatible Image Compression Framework based on Auxiliary Codec Networks

To enhance image compression performance, recent deep neural network-based research can be divided into three categories: a learnable codec, a postprocessing network, and a compact representation network. The learnable codec has been…

Image and Video Processing · Electrical Eng. & Systems 2021-12-21 Hanbin Son , Taeoh Kim , Hyeongmin Lee , Sangyoun Lee

Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms

Convolutional Neural Networks (CNNs) have proven to be a powerful state-of-the-art method for image classification tasks. One drawback however is the high computational complexity and high memory consumption of CNNs which makes them…

Computer Vision and Pattern Recognition · Computer Science 2021-02-04 Rishabh Goyal , Joaquin Vanschoren , Victor van Acht , Stephan Nijssen

Q-LIC: Quantizing Learned Image Compression with Channel Splitting

Learned image compression (LIC) has reached a comparable coding gain with traditional hand-crafted methods such as VVC intra. However, the large network complexity prohibits the usage of LIC on resource-limited embedded systems. Network…

Image and Video Processing · Electrical Eng. & Systems 2022-05-31 Heming Sun , Lu Yu , Jiro Katto