Related papers: Preprocessing Enhanced Image Compression for Machi…

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

End-to-end optimized image compression for multiple machine tasks

An increasing share of captured images and videos are transmitted for storage and remote analysis by computer vision algorithms, rather than to be viewed by humans. Contrary to traditional standard codecs with engineered tools, neural…

Computer Vision and Pattern Recognition · Computer Science 2021-03-09 Lahiru D. Chamain , Fabien Racapé , Jean Bégaint , Akshay Pushparaja , Simon Feltman

Learning-Based Image Compression for Machines

While learning based compression techniques for images have outperformed traditional methods, they have not been widely adopted in machine learning pipelines. This is largely due to lack of standardization and lack of retention of salient…

Image and Video Processing · Electrical Eng. & Systems 2024-10-01 Kartik Gupta , Kimberley Faria , Vikas Mehta

Learned Image Compression for Machine Perception

Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing…

Image and Video Processing · Electrical Eng. & Systems 2021-11-04 Felipe Codevilla , Jean Gabriel Simard , Ross Goroshin , Chris Pal

Progressive Learned Image Compression for Machine Perception

Recent advances in learned image codecs have been extended from human perception toward machine perception. However, progressive image compression with fine granular scalability (FGS)-which enables decoding a single bitstream at multiple…

Computer Vision and Pattern Recognition · Computer Science 2025-12-24 Jungwoo Kim , Jun-Hyuk Kim , Jong-Seok Lee

Prompt-Guided Prefiltering for VLM Image Compression

The rapid progress of large Vision-Language Models (VLMs) has enabled a wide range of applications, such as image understanding and Visual Question Answering (VQA). Query images are often uploaded to the cloud, where VLMs are typically…

Image and Video Processing · Electrical Eng. & Systems 2026-04-02 Bardia Azizian , Ivan V. Bajic

End-to-end optimized image compression for machines, a study

An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately,…

Image and Video Processing · Electrical Eng. & Systems 2020-11-13 Lahiru D. Chamain , Fabien Racapé , Jean Bégaint , Akshay Pushparaja , Simon Feltman

Supervised Compression for Resource-Constrained Edge Computing Systems

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors. However, full-scale deep neural networks are often too resource-intensive in terms of energy and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Yoshitomo Matsubara , Ruihan Yang , Marco Levorato , Stephan Mandt

Efficient Learned Image Compression Through Knowledge Distillation

Learned image compression sits at the intersection of machine learning and image processing. With advances in deep learning, neural network-based compression methods have emerged. In this process, an encoder maps the image to a…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Fabien Allemand , Attilio Fiandrotti , Sumanta Chaudhuri , Alaa Eddine Mazouz

Multi-Modality Deep Network for Extreme Learned Image Compression

Image-based single-modality compression learning approaches have demonstrated exceptionally powerful encoding and decoding capabilities in the past few years , but suffer from blur and severe semantics loss at extremely low bitrates. To…

Image and Video Processing · Electrical Eng. & Systems 2023-04-27 Xuhao Jiang , Weimin Tan , Tian Tan , Bo Yan , Liquan Shen

Recognition-Aware Learned Image Compression

Learned image compression methods generally optimize a rate-distortion loss, trading off improvements in visual distortion for added bitrate. Increasingly, however, compressed imagery is used as an input to deep learning networks for…

Image and Video Processing · Electrical Eng. & Systems 2022-02-02 Maxime Kawawa-Beaudan , Ryan Roggenkemper , Avideh Zakhor

Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image

Although deep convolutional neural network has been proved to efficiently eliminate coding artifacts caused by the coarse quantization of traditional codec, it's difficult to train any neural network in front of the encoder for gradient's…

Computer Vision and Pattern Recognition · Computer Science 2018-01-17 Lijun Zhao , Huihui Bai , Anhong Wang , Yao Zhao

Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers

We propose sandwiching standard image and video codecs between pre- and post-processing neural networks. The networks are jointly trained through a differentiable codec proxy to minimize a given rate-distortion loss. This sandwich…

Image and Video Processing · Electrical Eng. & Systems 2025-02-24 Onur G. Guleryuz , Philip A. Chou , Berivan Isik , Hugues Hoppe , Danhang Tang , Ruofei Du , Jonathan Taylor , Philip Davidson , Sean Fanello

How to Exploit the Transferability of Learned Image Compression to Conventional Codecs

Lossy image compression is often limited by the simplicity of the chosen loss measure. Recent research suggests that generative adversarial networks have the ability to overcome this limitation and serve as a multi-modal loss, especially…

Image and Video Processing · Electrical Eng. & Systems 2021-03-09 Jan P. Klopp , Keng-Chi Liu , Liang-Gee Chen , Shao-Yi Chien

Towards Backward-Compatible Continual Learning of Image Compression

This paper explores the possibility of extending the capability of pre-trained neural image compressors (e.g., adapting to new data or target bitrates) without breaking backward compatibility, the ability to decode bitstreams encoded by the…

Image and Video Processing · Electrical Eng. & Systems 2024-03-01 Zhihao Duan , Ming Lu , Justin Yang , Jiangpeng He , Zhan Ma , Fengqing Zhu

Guided Diffusion for the Extension of Machine Vision to Human Visual Perception

Image compression technology eliminates redundant information to enable efficient transmission and storage of images, serving both machine vision and human visual perception. For years, image coding focused on human perception has been…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Takahiro Shindo , Yui Tatsumi , Taiju Watanabe , Hiroshi Watanabe

Efficient Progressive Image Compression with Variance-aware Masking

Learned progressive image compression is gaining momentum as it allows improved image reconstruction as more bits are decoded at the receiver. We propose a progressive image compression method in which an image is first represented as a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Alberto Presta , Enzo Tartaglione , Attilio Fiandrotti , Marco Grangetto , Pamela Cosman

Generative Image Coding with Diffusion Prior

As generative technologies advance, visual content has evolved into a complex mix of natural and AI-generated images, driving the need for more efficient coding techniques that prioritize perceptual quality. Traditional codecs and learned…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jianhui Chang

Perceptually Optimizing Deep Image Compression

Mean squared error (MSE) and $\ell_p$ norms have largely dominated the measurement of loss in neural networks due to their simplicity and analytical properties. However, when used to assess visual information loss, these simple norms are…

Image and Video Processing · Electrical Eng. & Systems 2020-07-13 Li-Heng Chen , Christos G. Bampis , Zhi Li , Andrey Norkin , Alan C. Bovik

Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing

Scalable image coding for both humans and machines is a technique that has gained a lot of attention recently. This technology enables the hierarchical decoding of images for human vision and image recognition models. It is a highly…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Takahiro Shindo , Yui Tatsumi , Taiju Watanabe , Hiroshi Watanabe