Related papers: Patch Diffusion: Faster and More Data-Efficient Tr…

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Enze Xie , Lewei Yao , Han Shi , Zhili Liu , Daquan Zhou , Zhaoqiang Liu , Jiawei Li , Zhenguo Li

Masked Diffusion Models Are Fast Distribution Learners

Diffusion model has emerged as the \emph{de-facto} model for image generation, yet the heavy training overhead hinders its broader adoption in the research community. We observe that diffusion models are commonly trained to learn all…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Jiachen Lei , Qinglong Wang , Peng Cheng , Zhongjie Ba , Zhan Qin , Zhibo Wang , Zhenguang Liu , Kui Ren

Fast Training of Diffusion Models with Masked Transformers

We propose an efficient approach to train large diffusion models with masked transformers. While masked transformers have been extensively explored for representation learning, their application to generative learning is less explored in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Hongkai Zheng , Weili Nie , Arash Vahdat , Anima Anandkumar

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration

Diffusion models are emerging expressive generative models, in which a large number of time steps (inference steps) are required for a single image generation. To accelerate such tedious process, reducing steps uniformly is considered as an…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Lijiang Li , Huixia Li , Xiawu Zheng , Jie Wu , Xuefeng Xiao , Rui Wang , Min Zheng , Xin Pan , Fei Chao , Rongrong Ji

Hierarchical Patch Diffusion Models for High-Resolution Video Generation

Diffusion models have demonstrated remarkable performance in image and video synthesis. However, scaling them to high-resolution inputs is challenging and requires restructuring the diffusion pipeline into multiple independent components,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 Ivan Skorokhodov , Willi Menapace , Aliaksandr Siarohin , Sergey Tulyakov

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

We present SinDiffusion, leveraging denoising diffusion models to capture internal distribution of patches from a single natural image. SinDiffusion significantly improves the quality and diversity of generated samples compared with…

Computer Vision and Pattern Recognition · Computer Science 2022-11-23 Weilun Wang , Jianmin Bao , Wengang Zhou , Dongdong Chen , Dong Chen , Lu Yuan , Houqiang Li

Improving Diffusion Model Efficiency Through Patching

Diffusion models are a powerful class of generative models that iteratively denoise samples to produce data. While many works have focused on the number of iterations in this sampling procedure, few have focused on the cost of each…

Machine Learning · Computer Science 2022-07-12 Troy Luhman , Eric Luhman

Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability

Detectors often suffer from performance drop due to domain gap between training and testing data. Recent methods explore diffusion models applied to domain generalization (DG) and adaptation (DA) tasks, but still struggle with large…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

DeepCache: Accelerating Diffusion Models for Free

Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities. Notwithstanding their prowess, these models often incur substantial computational costs,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Xinyin Ma , Gongfan Fang , Xinchao Wang

Decentralized Diffusion Models

Large-scale AI model training divides work across thousands of GPUs, then synchronizes gradients across them at each step. This incurs a significant network burden that only centralized, monolithic clusters can support, driving up…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 David McAllister , Matthew Tancik , Jiaming Song , Angjoo Kanazawa

Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems

Diffusion models have achieved excellent success in solving inverse problems due to their ability to learn strong image priors, but existing approaches require a large training dataset of images that should come from the same distribution…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Jason Hu , Bowen Song , Jeffrey A. Fessler , Liyue Shen

Faster Training of Diffusion Models and Improved Density Estimation via Parallel Score Matching

In Diffusion Probabilistic Models (DPMs), the task of modeling the score evolution via a single time-dependent neural network necessitates extended training periods and may potentially impede modeling flexibility and capacity. To counteract…

Machine Learning · Computer Science 2023-06-06 Etrit Haxholli , Marco Lorenzi

Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment

In this paper, we point out that suboptimal noise-data mapping leads to slow training of diffusion models. During diffusion training, current methods diffuse each image across the entire noise space, resulting in a mixture of all images at…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Yiheng Li , Heyang Jiang , Akio Kodaira , Masayoshi Tomizuka , Kurt Keutzer , Chenfeng Xu

Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems

Diffusion models can learn strong image priors from underlying data distribution and use them to solve inverse problems, but the training process is computationally expensive and requires lots of data. Such bottlenecks prevent most existing…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Jason Hu , Bowen Song , Xiaojian Xu , Liyue Shen , Jeffrey A. Fessler

A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Training diffusion models is always a computation-intensive task. In this paper, we introduce a novel speed-up method for diffusion model training, called, which is based on a closer look at time steps. Our key findings are: i) Time steps…

Machine Learning · Computer Science 2025-03-26 Kai Wang , Mingjia Shi , Yukun Zhou , Zekai Li , Zhihang Yuan , Yuzhang Shang , Xiaojiang Peng , Hanwang Zhang , Yang You

Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting

Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation…

Machine Learning · Computer Science 2024-06-07 Jincheng Zhong , Xingzhuo Guo , Jiaxiang Dong , Mingsheng Long

PQD: Post-training Quantization for Efficient Diffusion Models

Diffusionmodels(DMs)havedemonstratedremarkableachievements in synthesizing images of high fidelity and diversity. However, the extensive computational requirements and slow generative speed of diffusion models have limited their widespread…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Jiaojiao Ye , Zhen Wang , Linnan Jiang

Nonparametric Data Attribution for Diffusion Models

Data attribution for generative models seeks to quantify the influence of individual training examples on model outputs. Existing methods for diffusion models typically require access to model gradients or retraining, limiting their…

Machine Learning · Computer Science 2025-10-17 Yutian Zhao , Chao Du , Xiaosen Zheng , Tianyu Pang , Min Lin

Prompt Diffusion Robustifies Any-Modality Prompt Learning

Foundation models enable prompt-based classifiers for zero-shot and few-shot learning. Nonetheless, the conventional method of employing fixed prompts suffers from distributional shifts that negatively impact generalizability to unseen…

Machine Learning · Computer Science 2024-10-29 Yingjun Du , Gaowen Liu , Yuzhang Shang , Yuguang Yao , Ramana Kompella , Cees G. M. Snoek

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Diffusion models have emerged as powerful generative frameworks by progressively adding noise to data through a forward process and then reversing this process to generate realistic samples. While these models have achieved strong…

Machine Learning · Computer Science 2025-03-04 Xingzhuo Guo , Yu Zhang , Baixu Chen , Haoran Xu , Jianmin Wang , Mingsheng Long