Related papers: Relational Feature Caching for Accelerating Diffus…

ProCache: Constraint-Aware Feature Caching with Selective Computation for Diffusion Transformer Acceleration

Diffusion Transformers (DiTs) have achieved state-of-the-art performance in generative modeling, yet their high computational cost hinders real-time deployment. While feature caching offers a promising training-free acceleration solution by…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Fanpu Cao , Yaofo Chen , Zeng You , Wei Luo

Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers

Diffusion Transformers (DiTs) have demonstrated exceptional performance in high-fidelity image and video generation. To reduce their substantial computational costs, feature caching techniques have been proposed to accelerate inference by…

Computer Vision and Pattern Recognition · Computer Science 2025-08-25 Shikang Zheng , Liang Feng , Xinyu Wang , Qinming Zhou , Peiliang Cai , Chang Zou , Jiacheng Liu , Yuqi Lin , Junjie Chen , Yue Ma , Linfeng Zhang

Accelerating Frequency Domain Diffusion Models with Error-Feedback Event-Driven Caching

Diffusion models achieve remarkable success in time series generation. However, slow inference limits their practical deployment. We propose E$^2$-CRF (Error-Feedback Event-Driven Cumulative Residual Feature caching) to accelerate frequency…

Machine Learning · Computer Science 2026-04-28 Dong Liu , Haisheng Wang , Yanxuan Yu

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due to their iterative structure and deep transformer stacks. To alleviate this inefficiency, we propose \textbf{FastCache}, a…

Machine Learning · Computer Science 2026-03-30 Dong Liu , Yanxuan Yu , Jiayi Zhang , Yifan Li , Ben Lengerich , Ying Nian Wu

FEB-Cache: Frequency-Guided Exposure Bias Reduction for Enhancing Diffusion Transformer Caching

Diffusion Transformer (DiT) has exhibited impressive generation capabilities but faces great challenges due to its high computational complexity. To address this issue, various methods, notably feature caching, have been introduced.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Zhen Zou , Feng Zhao

Accelerating Diffusion Transformer via Error-Optimized Cache

Diffusion Transformer (DiT) is a crucial method for content generation. However, it needs a lot of time to sample. Many studies have attempted to use caching to reduce the time consumption of sampling. Existing caching methods accelerate…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Junxiang Qiu , Shuo Wang , Jinda Lu , Lin Liu , Houcheng Jiang , Xingyu Zhu , Yanbin Hao

Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor

Diffusion Transformers (DiTs) have demonstrated remarkable performance in visual generation tasks. However, their low inference speed limits their deployment in low-resource applications. Recent training-free approaches exploit the…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Xiaoliu Guan , Lielin Jiang , Hanqi Chen , Xu Zhang , Jiaxing Yan , Guanzhong Wang , Yi Liu , Zetao Zhang , Yu Wu

Accelerating Diffusion Transformers with Token-wise Feature Caching

Diffusion transformers have shown significant effectiveness in both image and video synthesis at the expense of huge computation costs. To address this problem, feature caching methods have been introduced to accelerate diffusion…

Machine Learning · Computer Science 2025-02-20 Chang Zou , Xuyang Liu , Ting Liu , Siteng Huang , Linfeng Zhang

FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching

The application of diffusion transformers is suffering from their significant inference costs. Recently, feature caching has been proposed to solve this problem by reusing features from previous timesteps, thereby skipping computation in…

Machine Learning · Computer Science 2025-10-13 Jiacheng Liu , Peiliang Cai , Qinming Zhou , Yuqi Lin , Deyang Kong , Benhao Huang , Yupei Pan , Haowen Xu , Chang Zou , Junshu Tang , Shikang Zheng , Linfeng Zhang

Forecast the Principal, Stabilize the Residual: Subspace-Aware Feature Caching for Efficient Diffusion Transformers

Diffusion Transformer (DiT) models have achieved unprecedented quality in image and video generation, yet their iterative sampling process remains computationally prohibitive. To accelerate inference, feature caching methods have emerged by…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Guantao Chen , Shikang Zheng , Yuqi Lin , Linfeng Zhang

Frequency-Aware Error-Bounded Caching for Accelerating Diffusion Transformers

Diffusion Transformers (DiTs) have emerged as the dominant architecture for high-quality image and video generation, yet their iterative denoising process incurs substantial computational cost during inference. Existing caching methods…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Guandong Li

From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers

Diffusion Transformers (DiT) have revolutionized high-fidelity image and video synthesis, yet their computational demands remain prohibitive for real-time applications. To solve this problem, feature caching has been proposed to accelerate…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Jiacheng Liu , Chang Zou , Yuanhuiyi Lyu , Junjie Chen , Linfeng Zhang

Accelerating Diffusion Transformer via Gradient-Optimized Cache

Feature caching has emerged as an effective strategy to accelerate diffusion transformer (DiT) sampling through temporal feature reuse. It is a challenging problem since (1) Progressive error accumulation from cached blocks significantly…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Junxiang Qiu , Lin Liu , Shuo Wang , Jinda Lu , Kezhou Chen , Yanbin Hao

DiCache: Let Diffusion Model Determine Its Own Cache

Recent years have witnessed the rapid development of acceleration techniques for diffusion models, especially caching-based acceleration methods. These studies seek to answer two fundamental questions: "When to cache" and "How to use…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Jiazi Bu , Pengyang Ling , Yujie Zhou , Yibin Wang , Yuhang Zang , Dahua Lin , Jiaqi Wang

FORA: Fast-Forward Caching in Diffusion Transformer Acceleration

Diffusion transformers (DiT) have become the de facto choice for generating high-quality images and videos, largely due to their scalability, which enables the construction of larger models for enhanced performance. However, the increased…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Pratheba Selvaraju , Tianyu Ding , Tianyi Chen , Ilya Zharkov , Luming Liang

Rethinking Token-wise Feature Caching: Accelerating Diffusion Transformers with Dual Feature Caching

Diffusion Transformers (DiT) have become the dominant methods in image and video generation yet still suffer substantial computational costs. As an effective approach for DiT acceleration, feature caching methods are designed to cache the…

Machine Learning · Computer Science 2025-11-19 Chang Zou , Evelyn Zhang , Runlin Guo , Haohang Xu , Conghui He , Xuming Hu , Linfeng Zhang

AdaCorrection: Adaptive Offset Cache Correction for Accurate Diffusion Transformers

Diffusion Transformers (DiTs) achieve state-of-the-art performance in high-fidelity image and video generation but suffer from expensive inference due to their iterative denoising structure. While prior methods accelerate sampling by…

Computer Vision and Pattern Recognition · Computer Science 2026-05-11 Dong Liu , Yanxuan Yu , Ben Lengerich , Ying Nian Wu

Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

Recent advances in diffusion models have demonstrated remarkable capabilities in video generation. However, the computational intensity remains a significant challenge for practical applications. While feature caching has been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Xuran Ma , Yexin Liu , Yaofu Liu , Xianfeng Wu , Mingzhe Zheng , Zihao Wang , Ser-Nam Lim , Harry Yang

ERTACache: Error Rectification and Timesteps Adjustment for Efficient Diffusion

Diffusion models suffer from substantial computational overhead due to their inherently iterative inference process. While feature caching offers a promising acceleration strategy by reusing intermediate outputs across timesteps, naive…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Xurui Peng , Chenqian Yan , Hong Liu , Rui Ma , Fangmin Chen , Xing Wang , Zhihua Wu , Songwei Liu , Mingbao Lin

HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration

Diffusion Transformers (DiTs) excel in generative tasks but face practical deployment challenges due to high inference costs. Feature caching, which stores and retrieves redundant computations, offers the potential for acceleration.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Yushi Huang , Zining Wang , Ruihao Gong , Jing Liu , Xinjie Zhang , Jinyang Guo , Xianglong Liu , Jun Zhang