Related papers: DisCa: Accelerating Video Diffusion Transformers w…

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Diffusion models have achieved remarkable progress in the field of video generation. However, their iterative denoising nature requires a large number of inference steps to generate a video, which is slow and computationally expensive. In…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Haiyu Zhang , Xinyuan Chen , Yaohui Wang , Xihui Liu , Yunhong Wang , Yu Qiao

Accelerating Diffusion Sampling with Classifier-based Feature Distillation

Although diffusion model has shown great potential for generating higher quality images than GANs, slow sampling speed hinders its wide application in practice. Progressive distillation is thus proposed for fast sampling by progressively…

Computer Vision and Pattern Recognition · Computer Science 2023-03-15 Wujie Sun , Defang Chen , Can Wang , Deshi Ye , Yan Feng , Chun Chen

Accelerating Diffusion Transformers with Token-wise Feature Caching

Diffusion transformers have shown significant effectiveness in both image and video synthesis at the expense of huge computation costs. To address this problem, feature caching methods have been introduced to accelerate diffusion…

Machine Learning · Computer Science 2025-02-20 Chang Zou , Xuyang Liu , Ting Liu , Siteng Huang , Linfeng Zhang

Accelerating Video Diffusion Models via Distribution Matching

Generative models, particularly diffusion models, have made significant success in data synthesis across various modalities, including images, videos, and 3D assets. However, current diffusion models are computationally intensive, often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Yuanzhi Zhu , Hanshu Yan , Huan Yang , Kai Zhang , Junnan Li

HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration

Diffusion Transformers (DiTs) excel in generative tasks but face practical deployment challenges due to high inference costs. Feature caching, which stores and retrieves redundant computations, offers the potential for acceleration.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Yushi Huang , Zining Wang , Ruihao Gong , Jing Liu , Xinjie Zhang , Jinyang Guo , Xianglong Liu , Jun Zhang

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Image diffusion distillation achieves high-fidelity generation with very few sampling steps. However, applying these techniques directly to video diffusion often results in unsatisfactory frame quality due to the limited visual quality in…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Yuanhao Zhai , Kevin Lin , Zhengyuan Yang , Linjie Li , Jianfeng Wang , Chung-Ching Lin , David Doermann , Junsong Yuan , Lijuan Wang

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

In this paper, we propose an efficient, fast, and versatile distillation method to accelerate the generation of pre-trained diffusion models: Flash Diffusion. The method reaches state-of-the-art performances in terms of FID and CLIP-Score…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Clément Chadebec , Onur Tasar , Eyal Benaroche , Benjamin Aubin

Compute Only 16 Tokens in One Timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching

Diffusion transformers have gained significant attention in recent years for their ability to generate high-quality images and videos, yet still suffer from a huge computational cost due to their iterative denoising process. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Zhixin Zheng , Xinyu Wang , Chang Zou , Shaobo Wang , Linfeng Zhang

SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

Diffusion models achieve state-of-the-art video generation quality, but their inference remains expensive due to the large number of sequential denoising steps. This has motivated a growing line of research on accelerating diffusion…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Yasaman Haghighi , Alexandre Alahi

LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration

Diffusion models have achieved remarkable success in image and video generation tasks. However, the high computational demands of Diffusion Transformers (DiTs) pose a significant challenge to their practical deployment. While feature…

Computer Vision and Pattern Recognition · Computer Science 2026-05-28 Peiliang Cai , Jiacheng Liu , Haowen Xu , Xinyu Wang , Chang Zou , Linfeng Zhang

DiffSparse: Accelerating Diffusion Transformers with Learned Token Sparsity

Diffusion models demonstrate outstanding performance in image generation, but their multi-step inference mechanism requires immense computational cost. Previous works accelerate inference by leveraging layer or token cache techniques to…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Haowei Zhu , Ji Liu , Ziqiong Liu , Dong Li , Junhai Yong , Bin Wang , Emad Barsoum

Progressive Distillation for Fast Sampling of Diffusion Models

Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their slow sampling time: generating high quality…

Machine Learning · Computer Science 2022-06-08 Tim Salimans , Jonathan Ho

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

In this paper, we present \textbf{\textit{FasterCache}}, a novel training-free strategy designed to accelerate the inference of video diffusion models with high-quality generation. By analyzing existing cache-based methods, we observe that…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Zhengyao Lv , Chenyang Si , Junhao Song , Zhenyu Yang , Yu Qiao , Ziwei Liu , Kwan-Yee K. Wong

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Video generation models have demonstrated remarkable performance, yet their broader adoption remains constrained by slow inference speeds and substantial computational costs, primarily due to the iterative nature of the denoising process.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-04 Xin Zhou , Dingkang Liang , Kaijin Chen , Tianrui Feng , Xiwu Chen , Hongkai Lin , Yikang Ding , Feiyang Tan , Hengshuang Zhao , Xiang Bai

LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation

Training-free acceleration has emerged as an advanced research area in video generation based on diffusion models. The redundancy of latents in diffusion model inference provides a natural entry point for acceleration. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Yang Xiao , Gen Li , Kaiyuan Deng , Yushu Wu , Zheng Zhan , Yanzhi Wang , Xiaolong Ma , Bo Hui

SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching

Diffusion models have revolutionized high-fidelity image and video synthesis, yet their computational demands remain prohibitive for real-time applications. These models face two fundamental challenges: strict temporal dependencies…

Machine Learning · Computer Science 2025-09-16 Jiacheng Liu , Chang Zou , Yuanhuiyi Lyu , Fei Ren , Shaobo Wang , Kaixin Li , Linfeng Zhang

Learnable Sampler Distillation for Discrete Diffusion Models

Discrete diffusion models (DDMs) have shown powerful generation ability for discrete data modalities like text and molecules. However, their practical application is hindered by inefficient sampling, requiring a large number of sampling…

Machine Learning · Computer Science 2025-09-25 Feiyang Fu , Tongxian Guo , Zhaoqiang Liu

Revisiting Diffusion Models: From Generative Pre-training to One-Step Generation

Diffusion distillation is a widely used technique to reduce the sampling cost of diffusion models, yet it often requires extensive training, and the student performance tends to be degraded. Recent studies show that incorporating a GAN…

Machine Learning · Computer Science 2025-06-12 Bowen Zheng , Tianming Yang

Continual Learning of Diffusion Models with Generative Distillation

Diffusion models are powerful generative models that achieve state-of-the-art performance in image synthesis. However, training them demands substantial amounts of data and computational resources. Continual learning would allow for…

Machine Learning · Computer Science 2025-03-05 Sergi Masip , Pau Rodriguez , Tinne Tuytelaars , Gido M. van de Ven

DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

Diffusion probabilistic models have shown significant progress in video generation; however, their computational efficiency is limited by the large number of sampling steps required. Reducing sampling steps often compromises video quality…

Computer Vision and Pattern Recognition · Computer Science 2026-05-08 Zihan Ding , Chi Jin , Difan Liu , Haitian Zheng , Krishna Kumar Singh , Qiang Zhang , Yan Kang , Zhe Lin , Yuchen Liu