Related papers: LowDiff: Efficient Diffusion Sampling with Low-Res…

BADiff: Bandwidth Adaptive Diffusion Model

In this work, we propose a novel framework to enable diffusion models to adapt their generation quality based on real-time network bandwidth constraints. Traditional diffusion models produce high-fidelity images by performing a fixed number…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Xi Zhang , Hanwei Zhu , Yan Zhong , Jiamang Wang , Weisi Lin

Low-Light Image Enhancement with Wavelet-based Diffusion Models

Diffusion models have achieved promising results in image restoration tasks, yet suffer from time-consuming, excessive computational resource consumption, and unstable restoration. To address these issues, we propose a robust and efficient…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Hai Jiang , Ao Luo , Songchen Han , Haoqiang Fan , Shuaicheng Liu

Diffusion Models Beat GANs on Image Synthesis

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For…

Machine Learning · Computer Science 2021-06-02 Prafulla Dhariwal , Alex Nichol

On Distillation of Guided Diffusion Models

Classifier-free guided diffusion models have recently been shown to be highly effective at high-resolution image generation, and they have been widely used in large-scale diffusion frameworks including DALLE-2, Stable Diffusion and Imagen.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Chenlin Meng , Robin Rombach , Ruiqi Gao , Diederik P. Kingma , Stefano Ermon , Jonathan Ho , Tim Salimans

DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach

Diffusion models have achieved unprecedented performance in image generation, yet they suffer from slow inference due to their iterative sampling process. To address this, early-exiting has recently been proposed, where the depth of the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Daniel Gallo Fernández , Răzvan-Andrei Matişan , Alejandro Monroy Muñoz , Ana-Maria Vasilcoiu , Janusz Partyka , Tin Hadži Veljković , Metod Jazbec

PartDiff: Image Super-resolution with Partial Diffusion Models

Denoising diffusion probabilistic models (DDPMs) have achieved impressive performance on various image generation tasks, including image super-resolution. By learning to reverse the process of gradually diffusing the data distribution into…

Image and Video Processing · Electrical Eng. & Systems 2023-07-25 Kai Zhao , Alex Ling Yu Hung , Kaifeng Pang , Haoxin Zheng , Kyunghyun Sung

FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis

Denoising diffusion probabilistic models (DDPMs) have recently achieved leading performances in many generative tasks. However, the inherited iterative sampling process costs hindered their applications to speech synthesis. This paper…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-22 Rongjie Huang , Max W. Y. Lam , Jun Wang , Dan Su , Dong Yu , Yi Ren , Zhou Zhao

Progressive Distillation for Fast Sampling of Diffusion Models

Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their slow sampling time: generating high quality…

Machine Learning · Computer Science 2022-06-08 Tim Salimans , Jonathan Ho

Cascaded Diffusion Models for High Fidelity Image Generation

We show that cascaded diffusion models are capable of generating high fidelity images on the class-conditional ImageNet generation benchmark, without any assistance from auxiliary image classifiers to boost sample quality. A cascaded…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Jonathan Ho , Chitwan Saharia , William Chan , David J. Fleet , Mohammad Norouzi , Tim Salimans

FaSDiff: Balancing Perception and Semantics in Face Compression via Stable Diffusion Priors

With the increasing deployment of facial image data across a wide range of applications, efficient compression tailored to facial semantics has become critical for both storage and transmission. While recent learning-based face image…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Yimin Zhou , Yichong Xia , Bin Chen , Mingyao Hong , Jiawei Li , Zhi Wang , Yaowei Wang

Diffusion-based Light Field Synthesis

Light fields (LFs), conducive to comprehensive scene radiance recorded across angular dimensions, find wide applications in 3D reconstruction, virtual reality, and computational photography.However, the LF acquisition is inevitably…

Computer Vision and Pattern Recognition · Computer Science 2024-02-02 Ruisheng Gao , Yutong Liu , Zeyu Xiao , Zhiwei Xiong

LFSRDiff: Light Field Image Super-Resolution via Diffusion Models

Light field (LF) image super-resolution (SR) is a challenging problem due to its inherent ill-posed nature, where a single low-resolution (LR) input LF image can correspond to multiple potential super-resolved outcomes. Despite this…

Image and Video Processing · Electrical Eng. & Systems 2023-11-29 Wentao Chao , Fuqing Duan , Xuechun Wang , Yingqian Wang , Guanghui Wang

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Enze Xie , Lewei Yao , Han Shi , Zhili Liu , Daquan Zhou , Zhaoqiang Liu , Jiawei Li , Zhenguo Li

Enhancing Diffusion Models for High-Quality Image Generation

This report presents the comprehensive implementation, evaluation, and optimization of Denoising Diffusion Probabilistic Models (DDPMs) and Denoising Diffusion Implicit Models (DDIMs), which are state-of-the-art generative models. During…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Jaineet Shah , Michael Gromis , Rickston Pinto

MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize

While diffusion-based generative models have made significant strides in visual content creation, conventional approaches face computational challenges, especially for high-resolution images, as they denoise the entire image from noisy…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Haohang Xu , Longyu Chen , Yichen Zhang , Shuangrui Ding , Zhipeng Zhang

Directly Denoising Diffusion Models

In this paper, we present the Directly Denoising Diffusion Model (DDDM): a simple and generic approach for generating realistic images with few-step sampling, while multistep sampling is still preserved for better performance. DDDMs require…

Computer Vision and Pattern Recognition · Computer Science 2024-06-03 Dan Zhang , Jingjing Wang , Feng Luo

FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes

The class-conditional image generation based on diffusion models is renowned for generating high-quality and diverse images. However, most prior efforts focus on generating images for general categories, e.g., 1000 classes in ImageNet-1k. A…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Ziying Pan , Kun Wang , Gang Li , Feihong He , Yongxuan Lai

Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution

Image super-resolution is a fundamentally ill-posed problem because multiple valid high-resolution images exist for one low-resolution image. Super-resolution methods based on diffusion probabilistic models can deal with the ill-posed…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Yutao Yuan , Chun Yuan

AdaDiff: Adaptive Step Selection for Fast Diffusion Models

Diffusion models, as a type of generative model, have achieved impressive results in generating images and videos conditioned on textual conditions. However, the generation process of diffusion models involves denoising dozens of steps to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Hui Zhang , Zuxuan Wu , Zhen Xing , Jie Shao , Yu-Gang Jiang

Foresight Diffusion: Improving Sampling Consistency in Predictive Diffusion Models

Diffusion and flow-based models have enabled significant progress in generation tasks across various modalities and have recently found applications in predictive learning. However, unlike typical generation tasks that encourage sample…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Yu Zhang , Xingzhuo Guo , Haoran Xu , Jialong Wu , Mingsheng Long