Related papers: DRiffusion: Draft-and-Refine Process Parallelizes …

AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

Diffusion models have garnered significant interest from the community for their great generative ability across various applications. However, their typical multi-step sequential-denoising nature gives rise to high cumulative latency,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Zigeng Chen , Xinyin Ma , Gongfan Fang , Zhenxiong Tan , Xinchao Wang

Parallel Sampling of Diffusion Models

Diffusion models are powerful generative models but suffer from slow sampling, often taking 1000 sequential denoising steps for one sample. As a result, considerable efforts have been directed toward reducing the number of denoising steps,…

Machine Learning · Computer Science 2023-10-17 Andy Shih , Suneel Belkhale , Stefano Ermon , Dorsa Sadigh , Nima Anari

Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling

Diffusion models have achieved remarkable progress in high-fidelity image, video, and audio generation, yet inference remains computationally expensive. Nevertheless, current diffusion acceleration methods based on distributed parallelism…

Computer Vision and Pattern Recognition · Computer Science 2026-02-26 Euisoo Jung , Byunghyun Kim , Hyunjin Kim , Seonghye Cho , Jae-Gil Lee

Accelerating Parallel Sampling of Diffusion Models

Diffusion models have emerged as state-of-the-art generative models for image generation. However, sampling from diffusion models is usually time-consuming due to the inherent autoregressive nature of their sampling process. In this work,…

Machine Learning · Computer Science 2024-05-28 Zhiwei Tang , Jiasheng Tang , Hao Luo , Fan Wang , Tsung-Hui Chang

Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations

In diffusion models, samples are generated through an iterative refinement process, requiring hundreds of sequential model evaluations. Several recent methods have introduced approximations (fewer discretization steps or distillation) to…

Machine Learning · Computer Science 2024-12-12 Nikil Roashan Selvam , Amil Merchant , Stefano Ermon

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Diffusion models have recently achieved great success in the synthesis of high-quality images and videos. However, the existing denoising techniques in diffusion models are commonly based on step-by-step noise predictions, which suffers…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Hancheng Ye , Jiakang Yuan , Renqiu Xia , Xiangchao Yan , Tao Chen , Junchi Yan , Botian Shi , Bo Zhang

Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Diffusion models have emerged as a powerful class of generative models across various modalities, including image, video, and audio synthesis. However, their deployment is often limited by significant inference latency, primarily due to the…

Machine Learning · Computer Science 2025-10-14 Kunyun Wang , Bohan Li , Kai Yu , Minyi Guo , Jieru Zhao

DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models

Diffusion probabilistic models (DPMs) have achieved impressive success in visual generation. While, they suffer from slow inference speed due to iterative sampling. Employing fewer sampling steps is an intuitive solution, but this will also…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Hu Yu , Hao Luo , Fan Wang , Feng Zhao

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Diffusion models have achieved great success in synthesizing high-quality images. However, generating high-resolution images with diffusion models is still challenging due to the enormous computational costs, resulting in a prohibitive…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Muyang Li , Tianle Cai , Jiaxin Cao , Qinsheng Zhang , Han Cai , Junjie Bai , Yangqing Jia , Ming-Yu Liu , Kai Li , Song Han

One Step Diffusion via Shortcut Models

Diffusion models and flow-matching models have enabled generating diverse and realistic images by learning to transfer noise to data. However, sampling from these models involves iterative denoising over many neural network passes, making…

Machine Learning · Computer Science 2025-06-24 Kevin Frans , Danijar Hafner , Sergey Levine , Pieter Abbeel

Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

Speculative decoding has emerged as a widely adopted method to accelerate large language model inference without sacrificing the quality of the model outputs. While this technique has facilitated notable speed improvements by enabling…

Computation and Language · Computer Science 2025-02-12 Jacob K Christopher , Brian R Bartoldson , Tal Ben-Nun , Michael Cardei , Bhavya Kailkhura , Ferdinando Fioretto

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech

Denoising diffusion probabilistic models (DDPMs) have recently achieved leading performances in many generative tasks. However, the inherited iterative sampling process costs hinder their applications to text-to-speech deployment. Through…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-14 Rongjie Huang , Zhou Zhao , Huadai Liu , Jinglin Liu , Chenye Cui , Yi Ren

DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach

Diffusion models have achieved unprecedented performance in image generation, yet they suffer from slow inference due to their iterative sampling process. To address this, early-exiting has recently been proposed, where the depth of the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Daniel Gallo Fernández , Răzvan-Andrei Matişan , Alejandro Monroy Muñoz , Ana-Maria Vasilcoiu , Janusz Partyka , Tin Hadži Veljković , Metod Jazbec

TMPDiff: Temporal Mixed-Precision for Diffusion Models

Diffusion models are the go-to method for Text-to-Image generation, but their iterative denoising processes has high inference latency. Quantization reduces compute time by using lower bitwidths, but applies a fixed precision across all…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Basile Lewandowski , Simon Kurz , Aditya Shankar , Robert Birke , Jian-Jia Chen , Lydia Y. Chen

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion

Denoising Diffusion Probabilistic Models have shown extraordinary ability on various generative tasks. However, their slow inference speed renders them impractical in speech synthesis. This paper proposes a linear diffusion model (LinDiff)…

Sound · Computer Science 2023-06-13 Haogeng Liu , Tao Wang , Jie Cao , Ran He , Jianhua Tao

Progressive Distillation for Fast Sampling of Diffusion Models

Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their slow sampling time: generating high quality…

Machine Learning · Computer Science 2022-06-08 Tim Salimans , Jonathan Ho

One-step Diffusion with Distribution Matching Distillation

Diffusion models generate high-quality images but require dozens of forward passes. We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Tianwei Yin , Michaël Gharbi , Richard Zhang , Eli Shechtman , Fredo Durand , William T. Freeman , Taesung Park

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

In this paper, we propose an efficient, fast, and versatile distillation method to accelerate the generation of pre-trained diffusion models: Flash Diffusion. The method reaches state-of-the-art performances in terms of FID and CLIP-Score…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Clément Chadebec , Onur Tasar , Eyal Benaroche , Benjamin Aubin

DRIFT: Harnessing Inherent Fault Tolerance for Efficient and Reliable Diffusion Model Inference

Diffusion model deployment has been suffering from high energy consumption and inference latency despite its superior performance in visual generation tasks. Dynamic voltage and frequency scaling (DVFS) offers a promising solution to…

Hardware Architecture · Computer Science 2026-04-13 Jinqi Wen , Tong Xie , Runsheng Wang , Meng Li

Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network

Diffusion models achieve state-of-the-art generative performance but suffer from high computational costs during inference due to the repeated evaluation of a heavy neural network. In this work, we propose Dual-Rate Diffusion, a method to…

Machine Learning · Computer Science 2026-05-19 Grigory Bartosh , David Ruhe , Emiel Hoogeboom , Jonathan Heek , Thomas Mensink , Tim Salimans