Related papers: CADC: Content Adaptive Diffusion-Based Generative …

Region-Adaptive Generative Compression with Spatially Varying Diffusion Models

Generative image codecs aim to optimize perceptual quality, producing realistic and detailed reconstructions. However, they often overlook a key property of human vision: our tendency to focus on particular aspects of a visual scene (e.g.,…

Image and Video Processing · Electrical Eng. & Systems 2026-04-02 Lucas Relic , Roberto Azevedo , Yang Zhang , Stephan Mandt , Markus Gross , Christopher Schroers

Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression

Generative neural image compression supports data representation at extremely low bitrate, synthesizing details at the client and consistently producing highly realistic images. By leveraging the similarities between quantization error and…

Image and Video Processing · Electrical Eng. & Systems 2025-04-04 Lucas Relic , Roberto Azevedo , Yang Zhang , Markus Gross , Christopher Schroers

Generative Image Coding with Diffusion Prior

As generative technologies advance, visual content has evolved into a complex mix of natural and AI-generated images, driving the need for more efficient coding techniques that prioritize perceptual quality. Traditional codecs and learned…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jianhui Chang

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Traditional image codecs emphasize signal fidelity and human perception, often at the expense of machine vision tasks. Deep learning methods have demonstrated promising coding performance by utilizing rich semantic embeddings optimized for…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Sha Guo , Zhuo Chen , Yang Zhao , Ning Zhang , Xiaotong Li , Lingyu Duan

Lossy Image Compression with Foundation Diffusion Models

Incorporating diffusion models in the image compression domain has the potential to produce realistic and detailed reconstructions, especially at extremely low bitrates. Previous methods focus on using diffusion models as expressive…

Image and Video Processing · Electrical Eng. & Systems 2024-10-10 Lucas Relic , Roberto Azevedo , Markus Gross , Christopher Schroers

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

Recently, perceptual image compression has achieved significant advancements, delivering high visual quality at low bitrates for natural images. However, for screen content, existing methods often produce noticeable artifacts when…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Tongda Xu , Jiahao Li , Bin Li , Yan Wang , Ya-Qin Zhang , Yan Lu

Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

Active Sampling for Ultra-Low-Bit-Rate Video Compression via Conditional Controlled Diffusion

Diffusion models provide a powerful generative prior for perceptual reconstruction at ultra-low bitrates, but effective video compression requires controlling the generative process using highly compact conditioning signals. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Amirhosein Javadi , Shirin Saeedi Bidokhti , Tara Javidi

Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior

Image compression at extremely low bitrates (below 0.1 bits per pixel (bpp)) is a significant challenge due to substantial information loss. In this work, we propose a novel two-stage extreme image compression framework that exploits the…

Image and Video Processing · Electrical Eng. & Systems 2024-09-05 Zhiyuan Li , Yanhui Zhou , Hao Wei , Chenyang Ge , Jingwen Jiang

Lossy Image Compression with Conditional Diffusion Models

This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models. The approach relies on the transform coding paradigm, where an image is mapped into a latent space for entropy coding and, from…

Image and Video Processing · Electrical Eng. & Systems 2024-01-03 Ruihan Yang , Stephan Mandt

Content Adaptive Latents and Decoder for Neural Image Compression

In recent years, neural image compression (NIC) algorithms have shown powerful coding performance. However, most of them are not adaptive to the image content. Although several content adaptive methods have been proposed by updating the…

Computer Vision and Pattern Recognition · Computer Science 2022-12-22 Guanbo Pan , Guo Lu , Zhihao Hu , Dong Xu

CoD: A Diffusion Foundation Model for Image Compression

Existing diffusion codecs typically build on text-to-image diffusion foundation models like Stable Diffusion. However, text conditioning is suboptimal from a compression perspective, hindering the potential of downstream diffusion codecs,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Zhaoyang Jia , Zihan Zheng , Naifu Xue , Jiahao Li , Bin Li , Zongyu Guo , Xiaoyi Zhang , Houqiang Li , Yan Lu

Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models

Transferring large amount of high resolution images over limited bandwidth is an important but very challenging task. Compressing images using extremely low bitrates (<0.1 bpp) has been studied but it often results in low quality images of…

Image and Video Processing · Electrical Eng. & Systems 2022-11-16 Zhihong Pan , Xin Zhou , Hao Tian

Semantics-Guided Generative Image Compression

Advancements in text-to-image generative AI with large multimodal models are spreading into the field of image compression, creating high-quality representation of images at extremely low bit rates. This work introduces novel components to…

Image and Video Processing · Electrical Eng. & Systems 2025-06-02 Cheng-Lin Wu , Hyomin Choi , Ivan V. Bajić

CAT: Content-Adaptive Image Tokenization

Most existing image tokenizers encode images into a fixed number of tokens or patches, overlooking the inherent variability in image complexity. To address this, we introduce Content-Adaptive Tokenizer (CAT), which dynamically adjusts…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Junhong Shen , Kushal Tirumala , Michihiro Yasunaga , Ishan Misra , Luke Zettlemoyer , Lili Yu , Chunting Zhou

Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds

Stable diffusion networks have emerged as a groundbreaking development for their ability to produce realistic and detailed visual content. This characteristic renders them ideal decoders, capable of producing high-quality and aesthetically…

Computer Vision and Pattern Recognition · Computer Science 2024-08-21 Kai Liu , Kang You , Pan Gao

Guided Diffusion for the Extension of Machine Vision to Human Visual Perception

Image compression technology eliminates redundant information to enable efficient transmission and storage of images, serving both machine vision and human visual perception. For years, image coding focused on human perception has been…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Takahiro Shindo , Yui Tatsumi , Taiju Watanabe , Hiroshi Watanabe

Leveraging Diffusion Knowledge for Generative Image Compression with Fractal Frequency-Aware Band Learning

By optimizing the rate-distortion-realism trade-off, generative image compression approaches produce detailed, realistic images instead of the only sharp-looking reconstructions produced by rate-distortion-optimized models. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Lingyu Zhu , Xiangrui Zeng , Bolin Chen , Peilin Chen , Yung-Hui Li , Shiqi Wang

Text-Guided Diffusion Model-based Generative Communication for Wireless Image Transmission

Reliable image transmission over wireless channels is particularly challenging at extremely low transmission rates, where conventional compression and channel coding schemes fail to preserve adequate visual quality. To address this issue,…

Information Theory · Computer Science 2025-10-27 Shengkang Chen , Tong Wu , Zhiyong Chen , Feng Yang , Meixia Tao , Wenjun Zhang

Lossy Compression with Gaussian Diffusion

We consider a novel lossy compression approach based on unconditional diffusion generative models, which we call DiffC. Unlike modern compression schemes which rely on transform coding and quantization to restrict the transmitted…

Machine Learning · Statistics 2023-01-03 Lucas Theis , Tim Salimans , Matthew D. Hoffman , Fabian Mentzer