English
Related papers

Related papers: CacheQuant: Comprehensively Accelerated Diffusion …

200 papers

Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities. Notwithstanding their prowess, these models often incur substantial computational costs,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Xinyin Ma , Gongfan Fang , Xinchao Wang

Efficient video generation models are increasingly vital for multimedia synthetic content generation. Leveraging the Transformer architecture and the diffusion process, video DiT models have emerged as a dominant approach for high-quality…

Graphics · Computer Science 2026-02-27 Yuanxin Wei , Lansong Diao , Bujiao Chen , Shenggan Cheng , Zhengping Qian , Wenyuan Yu , Nong Xiao , Wei Lin , Jiangsu Du

Recent years have witnessed the rapid development of acceleration techniques for diffusion models, especially caching-based acceleration methods. These studies seek to answer two fundamental questions: "When to cache" and "How to use…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Jiazi Bu , Pengyang Ling , Yujie Zhou , Yibin Wang , Yuhang Zang , Dahua Lin , Jiaqi Wang

Diffusion Models have become a cornerstone of modern generative AI for their exceptional generation quality and controllability. However, their inherent \textit{multi-step iterations} and \textit{complex backbone networks} lead to…

Diffusion models achieve state-of-the-art video generation quality, but their inference remains expensive due to the large number of sequential denoising steps. This has motivated a growing line of research on accelerating diffusion…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Yasaman Haghighi , Alexandre Alahi

Diffusion-based image generation models excel at producing high-quality synthetic content, but suffer from slow and computationally expensive inference. Prior work has attempted to mitigate this by caching and reusing features within…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Anirud Aggarwal , Abhinav Shrivastava , Matthew Gwilliam

Diffusion models demonstrate outstanding performance in image generation, but their multi-step inference mechanism requires immense computational cost. Previous works accelerate inference by leveraging layer or token cache techniques to…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Haowei Zhu , Ji Liu , Ziqiong Liu , Dong Li , Junhai Yong , Bin Wang , Emad Barsoum

Video generation models have demonstrated remarkable performance, yet their broader adoption remains constrained by slow inference speeds and substantial computational costs, primarily due to the iterative nature of the denoising process.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-04 Xin Zhou , Dingkang Liang , Kaijin Chen , Tianrui Feng , Xiwu Chen , Hongkai Lin , Yikang Ding , Feiyang Tan , Hengshuang Zhao , Xiang Bai

Diffusion transformers have shown significant effectiveness in both image and video synthesis at the expense of huge computation costs. To address this problem, feature caching methods have been introduced to accelerate diffusion…

Machine Learning · Computer Science 2025-02-20 Chang Zou , Xuyang Liu , Ting Liu , Siteng Huang , Linfeng Zhang

Recently, Diffusion Transformers (DiTs) have emerged as a dominant architecture in video generation, surpassing U-Net-based models in terms of performance. However, the enhanced capabilities of DiTs come with significant drawbacks,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Junyi Wu , Zhiteng Li , Zheng Hui , Yulun Zhang , Linghe Kong , Xiaokang Yang

Diffusion-based video editing has emerged as an important paradigm for high-quality and flexible content generation. However, despite their generality and strong modeling capacity, Diffusion Transformers (DiT) remain computationally…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Tianyi Liu , Ye Lu , Linfeng Zhang , Chen Cai , Jianjun Gao , Yi Wang , Kim-Hui Yap , Lap-Pui Chau

Diffusion models achieve remarkable generative quality, but computational overhead scales with step count, model depth, and sequence length. Feature caching is effective since adjacent timesteps yield highly similar features. However, an…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Tingyan Wen , Haoyu Li , Yihuang Chen , Xing Zhou , Lifei Zhu , Xueqian Wang

Diffusion models have achieved impressive generative quality across modalities like 2D images, videos, and 3D shapes, but their inference remains computationally expensive due to the iterative denoising process. While recent caching-based…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Mengyu Yang , Yanming Yang , Chenyi Xu , Chenxi Song , Yufan Zuo , Tong Zhao , Ruibo Li , Chi Zhang

Diffusion models have emerged as powerful generative models, but their high computation cost in iterative sampling remains a significant bottleneck. In this work, we present an in-depth and insightful study of state-of-the-art acceleration…

Computer Vision and Pattern Recognition · Computer Science 2025-11-19 Weizhi Gao , Zhichao Hou , Junqi Yin , Feiyi Wang , Linyu Peng , Xiaorui Liu

Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due to their iterative structure and deep transformer stacks. To alleviate this inefficiency, we propose \textbf{FastCache}, a…

Machine Learning · Computer Science 2026-03-30 Dong Liu , Yanxuan Yu , Jiayi Zhang , Yifan Li , Ben Lengerich , Ying Nian Wu

Diffusion models have recently revolutionized the field of image synthesis due to their ability to generate photorealistic images. However, one of the major drawbacks of diffusion models is that the image generation process is costly. A…

As a fundamental backbone for video generation, diffusion models are challenged by low inference speed due to the sequential nature of denoising. Previous methods speed up the models by caching and reusing model outputs at uniformly…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Feng Liu , Shiwei Zhang , Xiaofeng Wang , Yujie Wei , Haonan Qiu , Yuzhong Zhao , Yingya Zhang , Qixiang Ye , Fang Wan

Training-free acceleration has emerged as an advanced research area in video generation based on diffusion models. The redundancy of latents in diffusion model inference provides a natural entry point for acceleration. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Yang Xiao , Gen Li , Kaiyuan Deng , Yushu Wu , Zheng Zhan , Yanzhi Wang , Xiaolong Ma , Bo Hui

Diffusion models produce realistic images and videos but require substantial computational resources, necessitating multi-accelerator parallelism for real-time deployment. However, parallel inference introduces significant communication…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Jiajun Luo , Yicheng Xiao , Jianru Xu , Yangxiu You , Rongwei Lu , Chen Tang , Jingyan Jiang , Zhi Wang

While diffusion models have achieved great success in the field of video generation, this progress is accompanied by a rapidly escalating computational burden. Among the existing acceleration methods, Feature Caching is popular due to its…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Chang Zou , Changlin Li , Yang Li , Patrol Li , Jianbing Wu , Xiao He , Songtao Liu , Zhao Zhong , Kailin Huang , Linfeng Zhang
‹ Prev 1 2 3 10 Next ›