Related papers: AsyncDiff: Parallelizing Diffusion Models by Async…

Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Diffusion models have emerged as a powerful class of generative models across various modalities, including image, video, and audio synthesis. However, their deployment is often limited by significant inference latency, primarily due to the…

Machine Learning · Computer Science 2025-10-14 Kunyun Wang , Bohan Li , Kai Yu , Minyi Guo , Jieru Zhao

DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease

Diffusion models have achieved remarkable success in generating high-fidelity content but suffer from slow, iterative sampling, resulting in high latency that limits their use in interactive applications. We introduce DRiffusion, a parallel…

Machine Learning · Computer Science 2026-03-30 Runsheng Bai , Chengyu Zhang , Yangdong Deng

Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling

Diffusion models have achieved remarkable progress in high-fidelity image, video, and audio generation, yet inference remains computationally expensive. Nevertheless, current diffusion acceleration methods based on distributed parallelism…

Computer Vision and Pattern Recognition · Computer Science 2026-02-26 Euisoo Jung , Byunghyun Kim , Hyunjin Kim , Seonghye Cho , Jae-Gil Lee

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Diffusion models have achieved great success in synthesizing high-quality images. However, generating high-resolution images with diffusion models is still challenging due to the enormous computational costs, resulting in a prohibitive…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Muyang Li , Tianle Cai , Jiaxin Cao , Qinsheng Zhang , Han Cai , Junjie Bai , Yangqing Jia , Ming-Yu Liu , Kai Li , Song Han

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Diffusion models have recently achieved great success in the synthesis of high-quality images and videos. However, the existing denoising techniques in diffusion models are commonly based on step-by-step noise predictions, which suffers…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Hancheng Ye , Jiakang Yuan , Renqiu Xia , Xiangchao Yan , Tao Chen , Junchi Yan , Botian Shi , Bo Zhang

Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation

Diffusion models have achieved impressive results in generating high-quality images. Yet, they often struggle to faithfully align the generated images with the input prompts. This limitation is associated with synchronous denoising, where…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Zijing Hu , Yunze Tong , Fengda Zhang , Junkun Yuan , Jun Xiao , Kun Kuang

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion

Denoising Diffusion Probabilistic Models have shown extraordinary ability on various generative tasks. However, their slow inference speed renders them impractical in speech synthesis. This paper proposes a linear diffusion model (LinDiff)…

Sound · Computer Science 2023-06-13 Haogeng Liu , Tao Wang , Jie Cao , Ran He , Jianhua Tao

DisagFusion: Asynchronous Pipeline Parallelism and Elastic Scheduling for Disaggregated Diffusion Serving

Diffusion-based generation is increasingly powering production content pipelines; however, deploying these models at scale remains a significant challenge. Model weights frequently exceed the memory capacity of commodity GPUs, while the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-26 Hantian Zha , Teng Ma , Yang Yong , Haiwen Fu , Ruiyang Ma , Wei Gao , Ruihao Gong , Xianglong Liu , Wei Wang , Yunpeng Chai

AdaDiff: Adaptive Step Selection for Fast Diffusion Models

Diffusion models, as a type of generative model, have achieved impressive results in generating images and videos conditioned on textual conditions. However, the generation process of diffusion models involves denoising dozens of steps to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Hui Zhang , Zuxuan Wu , Zhen Xing , Jie Shao , Yu-Gang Jiang

Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures

Diffusion models, emerging as powerful deep generative tools, excel in various applications. They operate through a two-steps process: introducing noise into training samples and then employing a model to convert random noise into new…

Computer Vision and Pattern Recognition · Computer Science 2026-02-13 Huijie Zhang , Yifu Lu , Ismail Alkhouri , Saiprasad Ravishankar , Dogyoon Song , Qing Qu

AsyncDiff: Asynchronous Timestep Conditioning for Enhanced Text-to-Image Diffusion Inference

Text-to-image diffusion inference typically follows synchronized schedules, where the numerical integrator advances the latent state to the same timestep at which the denoiser is conditioned. We propose an asynchronous inference mechanism…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Longhuan Xu , Feng Yin , Cunjian Chen

STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs

The escalating adoption of diffusion models for applications such as image generation demands efficient parallel inference techniques to manage their substantial computational cost. However, existing diffusion parallelism inference schemes…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-16 Han Liang , Jiahui Zhou , Zicheng Zhou , Xiaoxi Zhang , Xu Chen

ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes

This work introduces a novel approach to modeling temporal point processes using diffusion models with an asynchronous noise schedule. At each step of the diffusion process, the noise schedule injects noise of varying scales into different…

Machine Learning · Computer Science 2025-04-30 Amartya Mukherjee , Ruizhi Deng , He Zhao , Yuzhen Mao , Leonid Sigal , Frederick Tung

FastFlow: Accelerating The Generative Flow Matching Models with Bandit Inference

Flow-matching models deliver state-of-the-art fidelity in image and video generation, but the inherent sequential denoising process renders them slower. Existing acceleration methods like distillation, trajectory truncation, and consistency…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Divya Jyoti Bajpai , Dhruv Bhardwaj , Soumya Roy , Tejas Duseja , Harsh Agarwal , Aashay Sandansing , Manjesh Kumar Hanawal

SynerDiff: Synergetic Continuous Batching for Fast and Parallel Diffusion Model Inference

The expansion of Artificial Intelligence-generated content service requires diffusion model serving to simultaneously achieve high throughput and low task end-to-end (E2E) latency. However, existing continuous batching methods suffer from…

Artificial Intelligence · Computer Science 2026-05-12 Ziqi Zhou , Peng Yang , Yuxin Liang , Mingliu Liu , Jia Lu

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Diffusion models achieve great success in generating diverse and high-fidelity images, yet their widespread application, especially in real-time scenarios, is hampered by their inherently slow generation speed. The slow generation stems…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Shengkun Tang , Yaqing Wang , Caiwen Ding , Yi Liang , Yao Li , Dongkuan Xu

Parallel Sampling of Diffusion Models

Diffusion models are powerful generative models but suffer from slow sampling, often taking 1000 sequential denoising steps for one sample. As a result, considerable efforts have been directed toward reducing the number of denoising steps,…

Machine Learning · Computer Science 2023-10-17 Andy Shih , Suneel Belkhale , Stefano Ermon , Dorsa Sadigh , Nima Anari

DiffusionAD: Norm-guided One-step Denoising Diffusion for Anomaly Detection

Anomaly detection has garnered extensive applications in real industrial manufacturing due to its remarkable effectiveness and efficiency. However, previous generative-based models have been limited by suboptimal reconstruction quality,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Hui Zhang , Zheng Wang , Dan Zeng , Zuxuan Wu , Yu-Gang Jiang

BADiff: Bandwidth Adaptive Diffusion Model

In this work, we propose a novel framework to enable diffusion models to adapt their generation quality based on real-time network bandwidth constraints. Traditional diffusion models produce high-fidelity images by performing a fixed number…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Xi Zhang , Hanwei Zhu , Yan Zhong , Jiamang Wang , Weisi Lin

DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach

Diffusion models have achieved unprecedented performance in image generation, yet they suffer from slow inference due to their iterative sampling process. To address this, early-exiting has recently been proposed, where the depth of the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Daniel Gallo Fernández , Răzvan-Andrei Matişan , Alejandro Monroy Muñoz , Ana-Maria Vasilcoiu , Janusz Partyka , Tin Hadži Veljković , Metod Jazbec