Related papers: Lossy Image Compression with Conditional Diffusion…

Lossy Image Compression with Foundation Diffusion Models

Incorporating diffusion models in the image compression domain has the potential to produce realistic and detailed reconstructions, especially at extremely low bitrates. Previous methods focus on using diffusion models as expressive…

Image and Video Processing · Electrical Eng. & Systems 2024-10-10 Lucas Relic , Roberto Azevedo , Markus Gross , Christopher Schroers

Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior

Image compression at extremely low bitrates (below 0.1 bits per pixel (bpp)) is a significant challenge due to substantial information loss. In this work, we propose a novel two-stage extreme image compression framework that exploits the…

Image and Video Processing · Electrical Eng. & Systems 2024-09-05 Zhiyuan Li , Yanhui Zhou , Hao Wei , Chenyang Ge , Jingwen Jiang

Lossy Image Compression with Quantized Hierarchical VAEs

Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory. Motivated by this, we consider the problem of lossy image compression from the perspective of generative…

Image and Video Processing · Electrical Eng. & Systems 2023-03-28 Zhihao Duan , Ming Lu , Zhan Ma , Fengqing Zhu

Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model

Denoising diffusion models achieved impressive results on several image generation tasks often outperforming GAN based models. Recently, the generative capabilities of diffusion models have been employed for perceptual image compression,…

Image and Video Processing · Electrical Eng. & Systems 2025-05-20 Jonas Brenig , Radu Timofte

Lossy Compression with Gaussian Diffusion

We consider a novel lossy compression approach based on unconditional diffusion generative models, which we call DiffC. Unlike modern compression schemes which rely on transform coding and quantization to restrict the transmitted…

Machine Learning · Statistics 2023-01-03 Lucas Theis , Tim Salimans , Matthew D. Hoffman , Fabian Mentzer

Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders

Learned image compression codecs have recently achieved impressive compression performances surpassing the most efficient image coding architectures. However, most approaches are trained to minimize rate and distortion which often leads to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Daniele Mari , Simone Milani

Advances in Diffusion-Based Generative Compression

Popularized by their strong image generation performance, diffusion and related methods for generative modeling have found widespread success in visual media applications. In particular, diffusion methods have enabled new approaches to data…

Image and Video Processing · Electrical Eng. & Systems 2026-01-28 Yibo Yang , Stephan Mandt

Extreme Video Compression with Pre-trained Diffusion Models

Diffusion models have achieved remarkable success in generating high quality image and video data. More recently, they have also been used for image compression with high perceptual quality. In this paper, we present a novel approach to…

Image and Video Processing · Electrical Eng. & Systems 2024-02-15 Bohan Li , Yiming Liu , Xueyan Niu , Bo Bai , Lei Deng , Deniz Gündüz

Generative Latent Diffusion for Efficient Spatiotemporal Data Reduction

Generative models have demonstrated strong performance in conditional settings and can be viewed as a form of data compression, where the condition serves as a compact representation. However, their limited controllability and…

Machine Learning · Computer Science 2025-07-04 Xiao Li , Liangji Zhu , Anand Rangarajan , Sanjay Ranka

Generative Image Coding with Diffusion Prior

As generative technologies advance, visual content has evolved into a complex mix of natural and AI-generated images, driving the need for more efficient coding techniques that prioritize perceptual quality. Traditional codecs and learned…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jianhui Chang

Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Variational Diffusion Auto-encoder: Latent Space Extraction from Pre-trained Diffusion Models

As a widely recognized approach to deep generative modeling, Variational Auto-Encoders (VAEs) still face challenges with the quality of generated images, often presenting noticeable blurriness. This issue stems from the unrealistic…

Machine Learning · Computer Science 2023-05-22 Georgios Batzolis , Jan Stanczuk , Carola-Bibiane Schönlieb

Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis

While replacing Gaussian decoders with a conditional diffusion model enhances the perceptual quality of reconstructions in neural image compression, their lack of inductive bias for image data restricts their ability to achieve…

Image and Video Processing · Electrical Eng. & Systems 2024-03-26 Atefeh Khoshkhahtinat , Ali Zafari , Piyush M. Mehta , Nasser M. Nasrabadi

Improving Inference for Neural Image Compression

We consider the problem of lossy image compression with deep latent variable models. State-of-the-art methods build on hierarchical variational autoencoders (VAEs) and learn inference networks to predict a compressible latent representation…

Image and Video Processing · Electrical Eng. & Systems 2021-01-11 Yibo Yang , Robert Bamler , Stephan Mandt

Region-Adaptive Generative Compression with Spatially Varying Diffusion Models

Generative image codecs aim to optimize perceptual quality, producing realistic and detailed reconstructions. However, they often overlook a key property of human vision: our tendency to focus on particular aspects of a visual scene (e.g.,…

Image and Video Processing · Electrical Eng. & Systems 2026-04-02 Lucas Relic , Roberto Azevedo , Yang Zhang , Stephan Mandt , Markus Gross , Christopher Schroers

CADC: Content Adaptive Diffusion-Based Generative Image Compression

Diffusion-based generative image compression has demonstrated remarkable potential for achieving realistic reconstruction at ultra-low bitrates. The key to unlocking this potential lies in making the entire compression process…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Xihua Sheng , Lingyu Zhu , Tianyu Zhang , Dong Liu , Shiqi Wang , Jing Wang

Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder

The images produced by diffusion models can attain excellent perceptual quality. However, it is challenging for diffusion models to guarantee distortion, hence the integration of diffusion models and image compression models still needs…

Image and Video Processing · Electrical Eng. & Systems 2024-05-03 Yiyang Ma , Wenhan Yang , Jiaying Liu

DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents

Diffusion probabilistic models have been shown to generate state-of-the-art results on several competitive image synthesis benchmarks but lack a low-dimensional, interpretable latent space, and are slow at generation. On the other hand,…

Machine Learning · Computer Science 2022-11-30 Kushagra Pandey , Avideep Mukherjee , Piyush Rai , Abhishek Kumar

Edge-based Denoising Image Compression

In recent years, deep learning-based image compression, particularly through generative models, has emerged as a pivotal area of research. Despite significant advancements, challenges such as diminished sharpness and quality in…

Image and Video Processing · Electrical Eng. & Systems 2024-09-18 Ryugo Morita , Hitoshi Nishimura , Ko Watanabe , Andreas Dengel , Jinjia Zhou

Rethinking Video Tokenization: A Conditioned Diffusion-based Approach

Existing video tokenizers typically use the traditional Variational Autoencoder (VAE) architecture for video compression and reconstruction. However, to achieve good performance, its training process often relies on complex multi-stage…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Nianzu Yang , Pandeng Li , Liming Zhao , Yang Li , Chen-Wei Xie , Yehui Tang , Xudong Lu , Zhihang Liu , Yun Zheng , Yu Liu , Junchi Yan