Related papers: Denoising Task Difficulty-based Curriculum for Tra…

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

With the success of image generation, generative diffusion models are increasingly adopted for discriminative tasks, as pixel generation provides a unified perception interface. However, directly repurposing the generative denoising process…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Ziqi Pang , Xin Xu , Yu-Xiong Wang

Addressing Negative Transfer in Diffusion Models

Diffusion-based generative models have achieved remarkable success in various domains. It trains a shared model on denoising tasks that encompass different noise levels simultaneously, representing a form of multi-task learning (MTL).…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Hyojun Go , JinYoung Kim , Yunsung Lee , Seunghyun Lee , Shinhyeok Oh , Hyeongdon Moon , Seungtaek Choi

Adaptive Training Meets Progressive Scaling: Elevating Efficiency in Diffusion Models

Diffusion models have demonstrated remarkable efficacy in various generative tasks with the predictive prowess of denoising model. Currently, diffusion models employ a uniform denoising model across all timesteps. However, the inherent…

Machine Learning · Computer Science 2024-12-30 Wenhao Li , Xiu Su , Yu Han , Shan You , Tao Huang , Chang Xu

Empowering Diffusion Models on the Embedding Space for Text Generation

Diffusion models have achieved state-of-the-art synthesis quality on both visual and audio tasks, and recent works further adapt them to textual data by diffusing on the embedding space. In this paper, we conduct systematic studies of the…

Computation and Language · Computer Science 2024-04-23 Zhujin Gao , Junliang Guo , Xu Tan , Yongxin Zhu , Fang Zhang , Jiang Bian , Linli Xu

Denoising Task Routing for Diffusion Models

Diffusion models generate highly realistic images by learning a multi-step denoising process, naturally embodying the principles of multi-task learning (MTL). Despite the inherent connection between diffusion models and MTL, there remains…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Byeongjun Park , Sangmin Woo , Hyojun Go , Jin-Young Kim , Changick Kim

Diffusion Models in Vision: A Survey

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Mubarak Shah

Memorization Control in Diffusion Models from Denoising-centric Perspective

Controlling memorization in diffusion models is critical for applications that require generated data to closely match the training distribution. Existing approaches mainly focus on data centric or model centric modifications, treating the…

Machine Learning · Computer Science 2026-01-30 Thuy Phuong Vu , Mai Viet Hoang Do , Minhhuy Le , Dinh-Cuong Hoang , Phan Xuan Tan

Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon

Diffusion models (DMs) are a powerful generative framework that have attracted significant attention in recent years. However, the high computational cost of training DMs limits their practical applications. In this paper, we start with a…

Machine Learning · Computer Science 2024-04-12 Tianshuo Xu , Peng Mi , Ruilin Wang , Yingcong Chen

Efficient Diffusion Training via Min-SNR Weighting Strategy

Denoising diffusion models have been a mainstream approach for image generation, however, training these models often suffers from slow convergence. In this paper, we discovered that the slow convergence is partly due to conflicting…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Tiankai Hang , Shuyang Gu , Chen Li , Jianmin Bao , Dong Chen , Han Hu , Xin Geng , Baining Guo

Diffusion Model for Generative Image Denoising

In supervised learning for image denoising, usually the paired clean images and noisy images are collected or synthesised to train a denoising model. L2 norm loss or other distance functions are used as the objective function for training.…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Yutong Xie , Minne Yuan , Bin Dong , Quanzheng Li

Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation

Diffusion- and flow-based models usually allocate compute uniformly across space, updating all patches with the same timestep and number of function evaluations. While convenient, this ignores the heterogeneity of natural images: some…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Johannes Schusterbauer , Ming Gui , Yusong Li , Pingchuan Ma , Felix Krause , Björn Ommer

Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning

Diffusion models are trained by learning a sequence of models that reverse each step of noise corruption. Typically, the model parameters are fully shared across multiple timesteps to enhance training efficiency. However, since the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Qianli Ma , Xuefei Ning , Dongrui Liu , Li Niu , Linfeng Zhang

Noise Estimation for Generative Diffusion Models

Generative diffusion models have emerged as leading models in speech and image generation. However, in order to perform well with a small number of denoising steps, a costly tuning of the set of noise parameters is needed. In this work, we…

Machine Learning · Computer Science 2021-09-14 Robin San-Roman , Eliya Nachmani , Lior Wolf

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs

A wide variety of deep generative models has been developed in the past decade. Yet, these models often struggle with simultaneously addressing three key requirements including: high sample quality, mode coverage, and fast sampling. We call…

Machine Learning · Computer Science 2022-04-06 Zhisheng Xiao , Karsten Kreis , Arash Vahdat

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Generative models have made significant impacts across various domains, largely due to their ability to scale during training by increasing data, computational resources, and model size, a phenomenon characterized by the scaling laws.…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Nanye Ma , Shangyuan Tong , Haolin Jia , Hexiang Hu , Yu-Chuan Su , Mingda Zhang , Xuan Yang , Yandong Li , Tommi Jaakkola , Xuhui Jia , Saining Xie

Explaining generative diffusion models via visual analysis for interpretable decision-making process

Diffusion models have demonstrated remarkable performance in generation tasks. Nevertheless, explaining the diffusion process remains challenging due to it being a sequence of denoising noisy images that are difficult for experts to…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Ji-Hoon Park , Yeong-Joon Ju , Seong-Whan Lee

Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment

Diffusion models have shown remarkable success across generative tasks, yet their high computational demands challenge deployment on resource-limited platforms. This paper investigates a critical question for compute-optimal diffusion model…

Computer Vision and Pattern Recognition · Computer Science 2025-08-11 Zhenbang Du , Yonggan Fu , Lifu Wang , Jiayi Qian , Xiao Luo , Yingyan , Lin

Diffusion models under low-noise regime

Recent work on diffusion models proposed that they operate in two regimes: memorization, in which models reproduce their training data, and generalization, in which they generate novel samples. While this has been tested in high-noise…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Elizabeth Pavlova , Xue-Xin Wei

Diffusion Models in Low-Level Vision: A Survey

Deep generative models have garnered significant attention in low-level vision tasks due to their generative capabilities. Among them, diffusion model-based solutions, characterized by a forward diffusion process and a reverse denoising…

Computer Vision and Pattern Recognition · Computer Science 2025-02-26 Chunming He , Yuqi Shen , Chengyu Fang , Fengyang Xiao , Longxiang Tang , Yulun Zhang , Wangmeng Zuo , Zhenhua Guo , Xiu Li

Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion

There is strong empirical evidence that the state-of-the-art diffusion modeling paradigm leads to models that memorize the training set, especially when the training set is small. Prior methods to mitigate the memorization problem often…

Machine Learning · Computer Science 2026-03-03 Kulin Shah , Alkis Kalavasis , Adam R. Klivans , Giannis Daras