Related papers: Accelerating Frequency Domain Diffusion Models wit…

Relational Feature Caching for Accelerating Diffusion Transformers

Feature caching approaches accelerate diffusion transformers (DiTs) by storing the output features of computationally expensive modules at certain timesteps, and exploiting them for subsequent steps to reduce redundant computations. Recent…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Byunggwan Son , Jeimin Jeon , Jeongwoo Choi , Bumsub Ham

FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching

The application of diffusion transformers is suffering from their significant inference costs. Recently, feature caching has been proposed to solve this problem by reusing features from previous timesteps, thereby skipping computation in…

Machine Learning · Computer Science 2025-10-13 Jiacheng Liu , Peiliang Cai , Qinming Zhou , Yuqi Lin , Deyang Kong , Benhao Huang , Yupei Pan , Haowen Xu , Chang Zou , Junshu Tang , Shikang Zheng , Linfeng Zhang

ERTACache: Error Rectification and Timesteps Adjustment for Efficient Diffusion

Diffusion models suffer from substantial computational overhead due to their inherently iterative inference process. While feature caching offers a promising acceleration strategy by reusing intermediate outputs across timesteps, naive…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Xurui Peng , Chenqian Yan , Hong Liu , Rui Ma , Fangmin Chen , Xing Wang , Zhihua Wu , Songwei Liu , Mingbao Lin

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Diffusion-based image generation models excel at producing high-quality synthetic content, but suffer from slow and computationally expensive inference. Prior work has attempted to mitigate this by caching and reusing features within…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Anirud Aggarwal , Abhinav Shrivastava , Matthew Gwilliam

Accelerating Diffusion Transformer via Error-Optimized Cache

Diffusion Transformer (DiT) is a crucial method for content generation. However, it needs a lot of time to sample. Many studies have attempted to use caching to reduce the time consumption of sampling. Existing caching methods accelerate…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Junxiang Qiu , Shuo Wang , Jinda Lu , Lin Liu , Houcheng Jiang , Xingyu Zhu , Yanbin Hao

FEB-Cache: Frequency-Guided Exposure Bias Reduction for Enhancing Diffusion Transformer Caching

Diffusion Transformer (DiT) has exhibited impressive generation capabilities but faces great challenges due to its high computational complexity. To address this issue, various methods, notably feature caching, have been introduced.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Zhen Zou , Feng Zhao

CacheQuant: Comprehensively Accelerated Diffusion Models

Diffusion models have gradually gained prominence in the field of image synthesis, showcasing remarkable generative capabilities. Nevertheless, the slow inference and complex networks, resulting from redundancy at both temporal and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Xuewen Liu , Zhikai Li , Qingyi Gu

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

In this paper, we present \textbf{\textit{FasterCache}}, a novel training-free strategy designed to accelerate the inference of video diffusion models with high-quality generation. By analyzing existing cache-based methods, we observe that…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Zhengyao Lv , Chenyang Si , Junhao Song , Zhenyu Yang , Yu Qiao , Ziwei Liu , Kwan-Yee K. Wong

FORA: Fast-Forward Caching in Diffusion Transformer Acceleration

Diffusion transformers (DiT) have become the de facto choice for generating high-quality images and videos, largely due to their scalability, which enables the construction of larger models for enhanced performance. However, the increased…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Pratheba Selvaraju , Tianyu Ding , Tianyi Chen , Ilya Zharkov , Luming Liang

InvarDiff: Cross-Scale Invariance Caching for Accelerated Diffusion Models

Diffusion models deliver high-fidelity synthesis but remain slow due to iterative sampling. We empirically observe there exists feature invariance in deterministic sampling, and present InvarDiff, a training-free acceleration method that…

Computer Vision and Pattern Recognition · Computer Science 2025-12-08 Zihao Wu

Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

Recent advances in diffusion models have demonstrated remarkable capabilities in video generation. However, the computational intensity remains a significant challenge for practical applications. While feature caching has been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Xuran Ma , Yexin Liu , Yaofu Liu , Xianfeng Wu , Mingzhe Zheng , Zihao Wang , Ser-Nam Lim , Harry Yang

Frequency-Aware Error-Bounded Caching for Accelerating Diffusion Transformers

Diffusion Transformers (DiTs) have emerged as the dominant architecture for high-quality image and video generation, yet their iterative denoising process incurs substantial computational cost during inference. Existing caching methods…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Guandong Li

Accelerating Diffusion Transformers with Token-wise Feature Caching

Diffusion transformers have shown significant effectiveness in both image and video synthesis at the expense of huge computation costs. To address this problem, feature caching methods have been introduced to accelerate diffusion…

Machine Learning · Computer Science 2025-02-20 Chang Zou , Xuyang Liu , Ting Liu , Siteng Huang , Linfeng Zhang

Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

With the advance of diffusion models, today's video generation has achieved impressive quality. To extend the generation length and facilitate real-world applications, a majority of video diffusion models (VDMs) generate videos in an…

Computer Vision and Pattern Recognition · Computer Science 2025-05-22 Kaifeng Gao , Jiaxin Shi , Hanwang Zhang , Chunping Wang , Jun Xiao , Long Chen

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due to their iterative structure and deep transformer stacks. To alleviate this inefficiency, we propose \textbf{FastCache}, a…

Machine Learning · Computer Science 2026-03-30 Dong Liu , Yanxuan Yu , Jiayi Zhang , Yifan Li , Ben Lengerich , Ying Nian Wu

Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction

Diffusion models have recently attained significant interest within the community owing to their strong performance as generative models. Furthermore, its application to inverse problems have demonstrated state-of-the-art performance.…

Image and Video Processing · Electrical Eng. & Systems 2022-03-22 Hyungjin Chung , Byeongsu Sim , Jong Chul Ye

Accelerating Rectified Flow Models via Trajectory-Aware Caching

Diffusion and rectified flow (RF) models generate high-fidelity images and videos, but their iterative velocity-field evaluations are computationally expensive. Existing caching methods accelerate sampling by skipping timesteps, yet their…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Xiao Liu , Kai Liu , Naiyang Guan , Hongliang Lu , Zhixin Wang , Zhikai Chen , Renjing Pei , Yulun Zhang

H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models

Diffusion models have emerged as state-of-the-art in image generation, but their practical deployment is hindered by the significant computational cost of their iterative denoising process. While existing caching techniques can accelerate…

Computer Vision and Pattern Recognition · Computer Science 2025-11-06 Mingyu Sung , Il-Min Kim , Sangseok Yun , Jae-Mo Kang

Compute Only 16 Tokens in One Timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching

Diffusion transformers have gained significant attention in recent years for their ability to generate high-quality images and videos, yet still suffer from a huge computational cost due to their iterative denoising process. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Zhixin Zheng , Xinyu Wang , Chang Zou , Shaobo Wang , Linfeng Zhang

LatentCRF: Continuous CRF for Efficient Latent Diffusion

Latent Diffusion Models (LDMs) produce high-quality, photo-realistic images, however, the latency incurred by multiple costly inference iterations can restrict their applicability. We introduce LatentCRF, a continuous Conditional Random…

Computer Vision and Pattern Recognition · Computer Science 2024-12-25 Kanchana Ranasinghe , Sadeep Jayasumana , Andreas Veit , Ayan Chakrabarti , Daniel Glasner , Michael S Ryoo , Srikumar Ramalingam , Sanjiv Kumar