Related papers: TMPDiff: Temporal Mixed-Precision for Diffusion Mo…

TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models

Diffusion models have emerged as preeminent contenders in the realm of generative models. Distinguished by their distinctive sequential generative processes, characterized by hundreds or even thousands of timesteps, diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Haojun Sun , Chen Tang , Zhi Wang , Yuan Meng , Jingyan jiang , Xinzhu Ma , Wenwu Zhu

AsyncDiff: Asynchronous Timestep Conditioning for Enhanced Text-to-Image Diffusion Inference

Text-to-image diffusion inference typically follows synchronized schedules, where the numerical integrator advances the latent state to the same timestep at which the denoiser is conditioned. We propose an asynchronous inference mechanism…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Longhuan Xu , Feng Yin , Cunjian Chen

SimDiff: Simpler Yet Better Diffusion Model for Time Series Point Forecasting

Diffusion models have recently shown promise in time series forecasting, particularly for probabilistic predictions. However, they often fail to achieve state-of-the-art point estimation performance compared to regression-based methods.…

Artificial Intelligence · Computer Science 2025-11-25 Hang Ding , Xue Wang , Tian Zhou , Tao Yao

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech

Denoising diffusion probabilistic models (DDPMs) have recently achieved leading performances in many generative tasks. However, the inherited iterative sampling process costs hinder their applications to text-to-speech deployment. Through…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-14 Rongjie Huang , Zhou Zhao , Huadai Liu , Jinglin Liu , Chenye Cui , Yi Ren

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

A diffusion model, which is formulated to produce an image using thousands of denoising steps, usually suffers from a slow inference speed. Existing acceleration algorithms simplify the sampling by skipping most steps yet exhibit…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Mengfei Xia , Yujun Shen , Changsong Lei , Yu Zhou , Ran Yi , Deli Zhao , Wenping Wang , Yong-Jin Liu

Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation

Diffusion and flow matching models have achieved remarkable success in text-to-image generation. However, these models typically rely on the predetermined denoising schedules for all prompts. The multi-step reverse diffusion process can be…

Computer Vision and Pattern Recognition · Computer Science 2025-03-06 Zilyu Ye , Zhiyang Chen , Tiancheng Li , Zemin Huang , Weijian Luo , Guo-Jun Qi

MPQ-Diff: Mixed Precision Quantization for Diffusion Models

Diffusion models (DMs) generate remarkable high quality images via the stochastic denoising process, which unfortunately incurs high sampling time. Post-quantizing the trained diffusion models in fixed bit-widths, e.g., 4 bits on weights…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Rocco Manz Maruzzelli , Basile Lewandowski , Lydia Y. Chen

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis

Diffusion-based models have achieved state-of-the-art performance on text-to-image synthesis tasks. However, one critical limitation of these models is the low fidelity of generated images with respect to the text description, such as…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Qiucheng Wu , Yujian Liu , Handong Zhao , Trung Bui , Zhe Lin , Yang Zhang , Shiyu Chang

UniDiff: A Unified Diffusion Framework for Multimodal Time Series Forecasting

As multimodal data proliferates across diverse real-world applications, leveraging heterogeneous information such as texts and timestamps for accurate time series forecasting (TSF) has become a critical challenge. While diffusion models…

Machine Learning · Computer Science 2025-12-09 Da Zhang , Bingyu Li , Zhuyuan Zhao , Junyu Gao , Feiping Nie , Xuelong Li

BADiff: Bandwidth Adaptive Diffusion Model

In this work, we propose a novel framework to enable diffusion models to adapt their generation quality based on real-time network bandwidth constraints. Traditional diffusion models produce high-fidelity images by performing a fixed number…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Xi Zhang , Hanwei Zhu , Yan Zhong , Jiamang Wang , Weisi Lin

Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes

Diffusion models have emerged as a promising approach for text generation, with recent works falling into two main categories: discrete and continuous diffusion models. Discrete diffusion models apply token corruption independently using…

Computation and Language · Computer Science 2025-05-29 Bocheng Li , Zhujin Gao , Linli Xu

TPDiff: Temporal Pyramid Video Diffusion Model

The development of video diffusion models unveils a significant challenge: the substantial computational demands. To mitigate this challenge, we note that the reverse process of diffusion exhibits an inherent entropy-reducing nature. Given…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Lingmin Ran , Mike Zheng Shou

Timestep-Aware Block Masking for Efficient Diffusion Model Inference

Diffusion Probabilistic Models (DPMs) have achieved great success in image generation but suffer from high inference latency due to their iterative denoising nature. Motivated by the evolving feature dynamics across the denoising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Haodong He , Yuan Gao , Weizhong Zhang , Gui-Song Xia

TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Diffusion models have opened the path to a wide range of text-based image editing frameworks. However, these typically build on the multi-step nature of the diffusion backwards process, and adapting them to distilled, fast-sampling methods…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Gilad Deutch , Rinon Gal , Daniel Garibi , Or Patashnik , Daniel Cohen-Or

MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

Diffusion models have achieved significant visual generation quality. However, their significant computational and memory costs pose challenge for their application on resource-constrained mobile devices or even desktop GPUs. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-05-31 Tianchen Zhao , Xuefei Ning , Tongcheng Fang , Enshu Liu , Guyue Huang , Zinan Lin , Shengen Yan , Guohao Dai , Yu Wang

Timestep-Aware Correction for Quantized Diffusion Models

Diffusion models have marked a significant breakthrough in the synthesis of semantically coherent images. However, their extensive noise estimation networks and the iterative generation process limit their wider application, particularly on…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Yuzhe Yao , Feng Tian , Jun Chen , Haonan Lin , Guang Dai , Yong Liu , Jingdong Wang

DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease

Diffusion models have achieved remarkable success in generating high-fidelity content but suffer from slow, iterative sampling, resulting in high latency that limits their use in interactive applications. We introduce DRiffusion, a parallel…

Machine Learning · Computer Science 2026-03-30 Runsheng Bai , Chengyu Zhang , Yangdong Deng

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Diffusion models have recently achieved great success in the synthesis of high-quality images and videos. However, the existing denoising techniques in diffusion models are commonly based on step-by-step noise predictions, which suffers…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Hancheng Ye , Jiakang Yuan , Renqiu Xia , Xiangchao Yan , Tao Chen , Junchi Yan , Botian Shi , Bo Zhang

Fast Inference in Denoising Diffusion Models via MMD Finetuning

Denoising Diffusion Models (DDMs) have become a popular tool for generating high-quality samples from complex data distributions. These models are able to capture sophisticated patterns and structures in the data, and can generate samples…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Emanuele Aiello , Diego Valsesia , Enrico Magli

Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models

Aside from offering state-of-the-art performance in medical image generation, denoising diffusion probabilistic models (DPM) can also serve as a representation learner to capture semantic information and potentially be used as an image…

Image and Video Processing · Electrical Eng. & Systems 2024-07-09 Chun-Mei Feng