Related papers: End-to-end optimized image compression for multipl…

End-to-end optimized image compression for machines, a study

An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately,…

Image and Video Processing · Electrical Eng. & Systems 2020-11-13 Lahiru D. Chamain , Fabien Racapé , Jean Bégaint , Akshay Pushparaja , Simon Feltman

Preprocessing Enhanced Image Compression for Machine Vision

Recently, more and more images are compressed and sent to the back-end devices for the machine analysis tasks~(\textit{e.g.,} object detection) instead of being purely watched by humans. However, most traditional or learned image codecs are…

Image and Video Processing · Electrical Eng. & Systems 2022-06-14 Guo Lu , Xingtong Ge , Tianxiong Zhong , Jing Geng , Qiang Hu

End-to-end Compression Towards Machine Vision: Network Architecture Design and Optimization

The visual signal compression is a long-standing problem. Fueled by the recent advances of deep learning, exciting progress has been made. Despite better compression performance, existing end-to-end compression algorithms are still designed…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Shurun Wang , Zhao Wang , Shiqi Wang , Yan Ye

Image coding for machines: an end-to-end learned approach

Over recent years, deep learning-based computer vision systems have been applied to images at an ever-increasing pace, oftentimes representing the only type of consumption for those images. Given the dramatic explosion in the number of…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Nam Le , Honglei Zhang , Francesco Cricri , Ramin Ghaznavi-Youvalari , Esa Rahtu

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

End-to-End Semantic Preservation in Text-Aware Image Compression Systems

Traditional image compression methods aim to reconstruct images for human perception, prioritizing visual fidelity over task relevance. In contrast, Coding for Machines focuses on preserving information essential for automated…

Image and Video Processing · Electrical Eng. & Systems 2025-10-16 Stefano Della Fiore , Alessandro Gnutti , Marco Dalai , Pierangelo Migliorati , Riccardo Leonardi

Learned Image Compression for Machine Perception

Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing…

Image and Video Processing · Electrical Eng. & Systems 2021-11-04 Felipe Codevilla , Jean Gabriel Simard , Ross Goroshin , Chris Pal

Content-oriented learned image compression

In recent years, with the development of deep neural networks, end-to-end optimized image compression has made significant progress and exceeded the classic methods in terms of rate-distortion performance. However, most learning-based image…

Computer Vision and Pattern Recognition · Computer Science 2022-08-02 Meng Li , Shangyin Gao , Yihui Feng , Yibo Shi , Jing Wang

End-to-End Image Compression with Probabilistic Decoding

Lossy image compression is a many-to-one process, thus one bitstream corresponds to multiple possible original images, especially at low bit rates. However, this nature was seldom considered in previous studies on image compression, which…

Image and Video Processing · Electrical Eng. & Systems 2021-10-01 Haichuan Ma , Dong Liu , Cunhui Dong , Li Li , Feng Wu

End-to-end Optimized Image Compression

We describe an image compression method, consisting of a nonlinear analysis transformation, a uniform quantizer, and a nonlinear synthesis transformation. The transforms are constructed in three successive stages of convolutional linear…

Computer Vision and Pattern Recognition · Computer Science 2020-07-28 Johannes Ballé , Valero Laparra , Eero P. Simoncelli

Learning End-to-End Lossy Image Compression: A Benchmark

Image compression is one of the most fundamental techniques and commonly used applications in the image and video processing field. Earlier methods built a well-designed pipeline, and efforts were made to improve all modules of the pipeline…

Image and Video Processing · Electrical Eng. & Systems 2021-03-29 Yueyu Hu , Wenhan Yang , Zhan Ma , Jiaying Liu

An End-to-End Compression Framework Based on Convolutional Neural Networks

Deep learning, e.g., convolutional neural networks (CNNs), has achieved great success in image processing and computer vision especially in high level vision applications such as recognition and understanding. However, it is rarely used to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-03 Feng Jiang , Wen Tao , Shaohui Liu , Jie Ren , Xun Guo , Debin Zhao

Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Learned image compression allows achieving state-of-the-art accuracy and compression ratios, but their relatively slow runtime performance limits their usage. While previous attempts on optimizing learned image codecs focused more on the…

Image and Video Processing · Electrical Eng. & Systems 2022-08-04 Fangzheng Lin , Heming Sun , Jiro Katto

Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG Encoder-Decoder

Recent advances in deep learning have led to superhuman performance across a variety of applications. Recently, these methods have been successfully employed to improve the rate-distortion performance in the task of image compression.…

Image and Video Processing · Electrical Eng. & Systems 2022-02-01 Ankur Mali , Alexander Ororbia , Daniel Kifer , Lee Giles

End-to-End Learning for Video Frame Compression with Self-Attention

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu

Ultra-Efficient Decoding for End-to-End Neural Compression and Reconstruction

Image compression and reconstruction are crucial for various digital applications. While contemporary neural compression methods achieve impressive compression rates, the adoption of such technology has been largely hindered by the…

Machine Learning · Computer Science 2025-10-06 Ethan G. Rogers , Cheng Wang

End-to-End Optimized Image Compression with the Frequency-Oriented Transform

Image compression constitutes a significant challenge amidst the era of information explosion. Recent studies employing deep learning methods have demonstrated the superior performance of learning-based image compression methods over…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Yuefeng Zhang , Kai Lin

End-to-End Learned Block-Based Image Compression with Block-Level Masked Convolutions and Asymptotic Closed Loop Training

Learned image compression research has achieved state-of-the-art compression performance with auto-encoder based neural network architectures, where the image is mapped via convolutional neural networks (CNN) into a latent representation…

Image and Video Processing · Electrical Eng. & Systems 2022-03-23 Fatih Kamisli

End-to-End Learnable Multi-Scale Feature Compression for VCM

The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision…

Computer Vision and Pattern Recognition · Computer Science 2023-08-09 Yeongwoong Kim , Hyewon Jeong , Janghyun Yu , Younhee Kim , Jooyoung Lee , Se Yoon Jeong , Hui Yong Kim

How to Exploit the Transferability of Learned Image Compression to Conventional Codecs

Lossy image compression is often limited by the simplicity of the chosen loss measure. Recent research suggests that generative adversarial networks have the ability to overcome this limitation and serve as a multi-modal loss, especially…

Image and Video Processing · Electrical Eng. & Systems 2021-03-09 Jan P. Klopp , Keng-Chi Liu , Liang-Gee Chen , Shao-Yi Chien