English
Related papers

Related papers: Accelerating Frequency Domain Diffusion Models wit…

200 papers

Feature caching approaches accelerate diffusion transformers (DiTs) by storing the output features of computationally expensive modules at certain timesteps, and exploiting them for subsequent steps to reduce redundant computations. Recent…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Byunggwan Son , Jeimin Jeon , Jeongwoo Choi , Bumsub Ham

The application of diffusion transformers is suffering from their significant inference costs. Recently, feature caching has been proposed to solve this problem by reusing features from previous timesteps, thereby skipping computation in…

Diffusion models suffer from substantial computational overhead due to their inherently iterative inference process. While feature caching offers a promising acceleration strategy by reusing intermediate outputs across timesteps, naive…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Xurui Peng , Chenqian Yan , Hong Liu , Rui Ma , Fangmin Chen , Xing Wang , Zhihua Wu , Songwei Liu , Mingbao Lin

Diffusion-based image generation models excel at producing high-quality synthetic content, but suffer from slow and computationally expensive inference. Prior work has attempted to mitigate this by caching and reusing features within…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Anirud Aggarwal , Abhinav Shrivastava , Matthew Gwilliam

Diffusion Transformer (DiT) is a crucial method for content generation. However, it needs a lot of time to sample. Many studies have attempted to use caching to reduce the time consumption of sampling. Existing caching methods accelerate…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Junxiang Qiu , Shuo Wang , Jinda Lu , Lin Liu , Houcheng Jiang , Xingyu Zhu , Yanbin Hao

Diffusion Transformer (DiT) has exhibited impressive generation capabilities but faces great challenges due to its high computational complexity. To address this issue, various methods, notably feature caching, have been introduced.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Zhen Zou , Feng Zhao

Diffusion models have gradually gained prominence in the field of image synthesis, showcasing remarkable generative capabilities. Nevertheless, the slow inference and complex networks, resulting from redundancy at both temporal and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Xuewen Liu , Zhikai Li , Qingyi Gu

In this paper, we present \textbf{\textit{FasterCache}}, a novel training-free strategy designed to accelerate the inference of video diffusion models with high-quality generation. By analyzing existing cache-based methods, we observe that…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Zhengyao Lv , Chenyang Si , Junhao Song , Zhenyu Yang , Yu Qiao , Ziwei Liu , Kwan-Yee K. Wong

Diffusion transformers (DiT) have become the de facto choice for generating high-quality images and videos, largely due to their scalability, which enables the construction of larger models for enhanced performance. However, the increased…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Pratheba Selvaraju , Tianyu Ding , Tianyi Chen , Ilya Zharkov , Luming Liang

Diffusion models deliver high-fidelity synthesis but remain slow due to iterative sampling. We empirically observe there exists feature invariance in deterministic sampling, and present InvarDiff, a training-free acceleration method that…

Computer Vision and Pattern Recognition · Computer Science 2025-12-08 Zihao Wu

Recent advances in diffusion models have demonstrated remarkable capabilities in video generation. However, the computational intensity remains a significant challenge for practical applications. While feature caching has been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Xuran Ma , Yexin Liu , Yaofu Liu , Xianfeng Wu , Mingzhe Zheng , Zihao Wang , Ser-Nam Lim , Harry Yang

Diffusion Transformers (DiTs) have emerged as the dominant architecture for high-quality image and video generation, yet their iterative denoising process incurs substantial computational cost during inference. Existing caching methods…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Guandong Li

Diffusion transformers have shown significant effectiveness in both image and video synthesis at the expense of huge computation costs. To address this problem, feature caching methods have been introduced to accelerate diffusion…

Machine Learning · Computer Science 2025-02-20 Chang Zou , Xuyang Liu , Ting Liu , Siteng Huang , Linfeng Zhang

With the advance of diffusion models, today's video generation has achieved impressive quality. To extend the generation length and facilitate real-world applications, a majority of video diffusion models (VDMs) generate videos in an…

Computer Vision and Pattern Recognition · Computer Science 2025-05-22 Kaifeng Gao , Jiaxin Shi , Hanwang Zhang , Chunping Wang , Jun Xiao , Long Chen

Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due to their iterative structure and deep transformer stacks. To alleviate this inefficiency, we propose \textbf{FastCache}, a…

Machine Learning · Computer Science 2026-03-30 Dong Liu , Yanxuan Yu , Jiayi Zhang , Yifan Li , Ben Lengerich , Ying Nian Wu

Diffusion models have recently attained significant interest within the community owing to their strong performance as generative models. Furthermore, its application to inverse problems have demonstrated state-of-the-art performance.…

Image and Video Processing · Electrical Eng. & Systems 2022-03-22 Hyungjin Chung , Byeongsu Sim , Jong Chul Ye

Diffusion and rectified flow (RF) models generate high-fidelity images and videos, but their iterative velocity-field evaluations are computationally expensive. Existing caching methods accelerate sampling by skipping timesteps, yet their…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Xiao Liu , Kai Liu , Naiyang Guan , Hongliang Lu , Zhixin Wang , Zhikai Chen , Renjing Pei , Yulun Zhang

Diffusion models have emerged as state-of-the-art in image generation, but their practical deployment is hindered by the significant computational cost of their iterative denoising process. While existing caching techniques can accelerate…

Computer Vision and Pattern Recognition · Computer Science 2025-11-06 Mingyu Sung , Il-Min Kim , Sangseok Yun , Jae-Mo Kang

Diffusion transformers have gained significant attention in recent years for their ability to generate high-quality images and videos, yet still suffer from a huge computational cost due to their iterative denoising process. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Zhixin Zheng , Xinyu Wang , Chang Zou , Shaobo Wang , Linfeng Zhang

Latent Diffusion Models (LDMs) produce high-quality, photo-realistic images, however, the latency incurred by multiple costly inference iterations can restrict their applicability. We introduce LatentCRF, a continuous Conditional Random…

Computer Vision and Pattern Recognition · Computer Science 2024-12-25 Kanchana Ranasinghe , Sadeep Jayasumana , Andreas Veit , Ayan Chakrabarti , Daniel Glasner , Michael S Ryoo , Srikumar Ramalingam , Sanjiv Kumar
‹ Prev 1 2 3 10 Next ›