Related papers: Clockwork Diffusion: Efficient Generation With Mod…

Efficient Coarse-to-Fine Diffusion Models with Time Step Sequence Redistribution

Recently, diffusion models (DMs) have made significant strides in high-quality image generation. However, the multi-step denoising process often results in considerable computational overhead, impeding deployment on resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Yu-Shan Tai , An-Yeu , Wu

Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets

Text-to-image diffusion models enable high-quality image generation but are computationally expensive. While prior work optimizes per-inference efficiency, we explore an orthogonal approach: reducing redundancy across correlated prompts.…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Dale Decatur , Thibault Groueix , Wang Yifan , Rana Hanocka , Vladimir Kim , Matheus Gadelha

Denoising Diffusion Step-aware Models

Denoising Diffusion Probabilistic Models (DDPMs) have garnered popularity for data generation across various domains. However, a significant bottleneck is the necessity for whole-network computation during every step of the generative…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Shuai Yang , Yukang Chen , Luozhou Wang , Shu Liu , Yingcong Chen

One-step Diffusion with Distribution Matching Distillation

Diffusion models generate high-quality images but require dozens of forward passes. We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Tianwei Yin , Michaël Gharbi , Richard Zhang , Eli Shechtman , Fredo Durand , William T. Freeman , Taesung Park

Cost-Aware Routing for Efficient Text-To-Image Generation

Diffusion models are well known for their ability to generate a high-fidelity image for an input prompt through an iterative denoising process. Unfortunately, the high fidelity also comes at a high computational cost due the inherently…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Qinchan Li , Kenneth Chen , Changyue Su , Wittawat Jitkrittum , Qi Sun , Patsorn Sangkloy

BudgetFusion: Perceptually-Guided Adaptive Diffusion Models

Diffusion models have shown unprecedented success in the task of text-to-image generation. While these models are capable of generating high-quality and realistic images, the complexity of sequential denoising has raised societal concerns…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Qinchan Li , Kenneth Chen , Changyue Su , Qi Sun

Accelerating Diffusion Decoders via Multi-Scale Sampling and One-Step Distillation

Image tokenization plays a central role in modern generative modeling by mapping visual inputs into compact representations that serve as an intermediate signal between pixels and generative models. Diffusion-based decoders have recently…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Chuhan Wang , Hao Chen

Timestep-Aware Block Masking for Efficient Diffusion Model Inference

Diffusion Probabilistic Models (DPMs) have achieved great success in image generation but suffer from high inference latency due to their iterative denoising nature. Motivated by the evolving feature dynamics across the denoising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Haodong He , Yuan Gao , Weizhong Zhang , Gui-Song Xia

Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures

Diffusion models, emerging as powerful deep generative tools, excel in various applications. They operate through a two-steps process: introducing noise into training samples and then employing a model to convert random noise into new…

Computer Vision and Pattern Recognition · Computer Science 2026-02-13 Huijie Zhang , Yifu Lu , Ismail Alkhouri , Saiprasad Ravishankar , Dogyoon Song , Qing Qu

SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds

Text-to-image diffusion models can create stunning images from natural language descriptions that rival the work of professional artists and photographers. However, these models are large, with complex network architectures and tens of…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Yanyu Li , Huan Wang , Qing Jin , Ju Hu , Pavlo Chemerys , Yun Fu , Yanzhi Wang , Sergey Tulyakov , Jian Ren

The Missing U for Efficient Diffusion Models

Diffusion Probabilistic Models stand as a critical tool in generative modelling, enabling the generation of complex data distributions. This family of generative models yields record-breaking performance in tasks such as image synthesis,…

Machine Learning · Computer Science 2024-04-08 Sergio Calvo-Ordonez , Chun-Wun Cheng , Jiahao Huang , Lipei Zhang , Guang Yang , Carola-Bibiane Schonlieb , Angelica I Aviles-Rivero

Towards Consistent and Efficient Dataset Distillation via Diffusion-Driven Selection

Dataset distillation provides an effective approach to reduce memory and computational costs by optimizing a compact dataset that achieves performance comparable to the full original. However, for large-scale datasets and complex deep…

Computer Vision and Pattern Recognition · Computer Science 2025-11-14 Xinhao Zhong , Shuoyang Sun , Xulin Gu , Zhaoyang Xu , Yaowei Wang , Min Zhang , Bin Chen

TMPDiff: Temporal Mixed-Precision for Diffusion Models

Diffusion models are the go-to method for Text-to-Image generation, but their iterative denoising processes has high inference latency. Quantization reduces compute time by using lower bitwidths, but applies a fixed precision across all…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Basile Lewandowski , Simon Kurz , Aditya Shankar , Robert Birke , Jian-Jia Chen , Lydia Y. Chen

Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment

Diffusion models have shown remarkable success across generative tasks, yet their high computational demands challenge deployment on resource-limited platforms. This paper investigates a critical question for compute-optimal diffusion model…

Computer Vision and Pattern Recognition · Computer Science 2025-08-11 Zhenbang Du , Yonggan Fu , Lifu Wang , Jiayi Qian , Xiao Luo , Yingyan , Lin

CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models

Diffusion models have revolutionized generative tasks, especially in the domain of text-to-image synthesis; however, their iterative denoising process demands substantial computational resources. In this paper, we present a novel…

Computer Vision and Pattern Recognition · Computer Science 2025-02-04 Xinle Cheng , Zhuoming Chen , Zhihao Jia

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

In this paper, we propose an efficient, fast, and versatile distillation method to accelerate the generation of pre-trained diffusion models: Flash Diffusion. The method reaches state-of-the-art performances in terms of FID and CLIP-Score…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Clément Chadebec , Onur Tasar , Eyal Benaroche , Benjamin Aubin

Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation

In this paper, we introduce an innovative NLP model specifically fine-tuned to determine the minimal number of denoising steps required for any given text prompt. This advanced model serves as a real-time tool that recommends the ideal…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Jean Yu , Haim Barad

Efficient Diffusion Models for Vision: A Survey

Diffusion Models (DMs) have demonstrated state-of-the-art performance in content generation without requiring adversarial training. These models are trained using a two-step process. First, a forward - diffusion - process gradually adds…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Anwaar Ulhaq , Naveed Akhtar

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

A diffusion model, which is formulated to produce an image using thousands of denoising steps, usually suffers from a slow inference speed. Existing acceleration algorithms simplify the sampling by skipping most steps yet exhibit…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Mengfei Xia , Yujun Shen , Changsong Lei , Yu Zhou , Ran Yi , Deli Zhao , Wenping Wang , Yong-Jin Liu

Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning

Diffusion models are trained by learning a sequence of models that reverse each step of noise corruption. Typically, the model parameters are fully shared across multiple timesteps to enhance training efficiency. However, since the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Qianli Ma , Xuefei Ning , Dongrui Liu , Li Niu , Linfeng Zhang