Related papers: Upsample Guidance: Scale Up Diffusion Models witho…

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

Large-scale generative models, such as text-to-image diffusion models, have garnered widespread attention across diverse domains due to their creative and high-fidelity image generation. Nonetheless, existing large-scale diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Younghyun Kim , Geunmin Hwang , Junyu Zhang , Eunbyung Park

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

Diffusion models have proven to be highly effective in image and video generation; however, they encounter challenges in the correct composition of objects when generating images of varying sizes due to single-scale training data. Adapting…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Lanqing Guo , Yingqing He , Haoxin Chen , Menghan Xia , Xiaodong Cun , Yufei Wang , Siyu Huang , Yong Zhang , Xintao Wang , Qifeng Chen , Ying Shan , Bihan Wen

Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models

Guidance is a crucial technique for extracting the best performance out of image-generating diffusion models. Traditionally, a constant guidance weight has been applied throughout the sampling chain of an image. We show that guidance is…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Tuomas Kynkäänniemi , Miika Aittala , Tero Karras , Samuli Laine , Timo Aila , Jaakko Lehtinen

FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes

The class-conditional image generation based on diffusion models is renowned for generating high-quality and diverse images. However, most prior efforts focus on generating images for general categories, e.g., 1000 classes in ImageNet-1k. A…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Ziying Pan , Kun Wang , Gang Li , Feihong He , Yongxuan Lai

Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

In light of the widespread success of generative models, a significant amount of research has gone into speeding up their sampling time. However, generative models are often sampled multiple times to obtain a diverse set incurring a cost…

Machine Learning · Computer Science 2023-11-27 Gabriele Corso , Yilun Xu , Valentin de Bortoli , Regina Barzilay , Tommi Jaakkola

Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance

Masked generative models (MGMs) have shown impressive generative ability while providing an order of magnitude efficient sampling steps compared to continuous diffusion models. However, MGMs still underperform in image synthesis compared to…

Computer Vision and Pattern Recognition · Computer Science 2024-10-18 Jiwan Hur , Dong-Jae Lee , Gyojin Han , Jaehyun Choi , Yunho Jeon , Junmo Kim

On Distillation of Guided Diffusion Models

Classifier-free guided diffusion models have recently been shown to be highly effective at high-resolution image generation, and they have been widely used in large-scale diffusion frameworks including DALLE-2, Stable Diffusion and Imagen.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Chenlin Meng , Robin Rombach , Ruiqi Gao , Diederik P. Kingma , Stefano Ermon , Jonathan Ho , Tim Salimans

Diffusion Self-Guidance for Controllable Image Generation

Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Dave Epstein , Allan Jabri , Ben Poole , Alexei A. Efros , Aleksander Holynski

Image retrieval outperforms diffusion models on data augmentation

Many approaches have been proposed to use diffusion models to augment training datasets for downstream tasks, such as classification. However, diffusion models are themselves trained on large datasets, often with noisy annotations, and it…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Max F. Burg , Florian Wenzel , Dominik Zietlow , Max Horn , Osama Makansi , Francesco Locatello , Chris Russell

Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance

Diffusion models have shown significant progress in image translation tasks recently. However, due to their stochastic nature, there's often a trade-off between style transformation and content preservation. Current strategies aim to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-08 Gihyun Kwon , Jong Chul Ye

Self-Guided Diffusion Models

Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process. However, guidance requires a large amount of image-annotation pairs for training and is…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Vincent Tao Hu , David W Zhang , Yuki M. Asano , Gertjan J. Burghouts , Cees G. M. Snoek

Enhancing Diffusion Model Guidance through Calibration and Regularization

Classifier-guided diffusion models have emerged as a powerful approach for conditional image generation, but they suffer from overconfident predictions during early denoising steps, causing the guidance gradient to vanish. This paper…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Seyed Alireza Javid , Amirhossein Bagheri , Nuria González-Prelcic

High-Resolution Image Editing via Multi-Stage Blended Diffusion

Diffusion models have shown great results in image generation and in image editing. However, current approaches are limited to low resolutions due to the computational cost of training diffusion models for high-resolution generation. We…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Johannes Ackermann , Minjun Li

Diffusion Models Beat GANs on Image Synthesis

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For…

Machine Learning · Computer Science 2021-06-02 Prafulla Dhariwal , Alex Nichol

Universal Guidance for Diffusion Models

Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance algorithm that enables…

Computer Vision and Pattern Recognition · Computer Science 2023-02-15 Arpit Bansal , Hong-Min Chu , Avi Schwarzschild , Soumyadip Sengupta , Micah Goldblum , Jonas Geiping , Tom Goldstein

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

In this work, we investigate the capability of generating images from pre-trained diffusion models at much higher resolutions than the training image sizes. In addition, the generated images should have arbitrary image aspect ratios. When…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Yingqing He , Shaoshu Yang , Haoxin Chen , Xiaodong Cun , Menghan Xia , Yong Zhang , Xintao Wang , Ran He , Qifeng Chen , Ying Shan

Accelerating Guided Diffusion Sampling with Splitting Numerical Methods

Guided diffusion is a technique for conditioning the output of a diffusion model at sampling time without retraining the network for each specific task. One drawback of diffusion models, however, is their slow sampling process. Recent…

Computer Vision and Pattern Recognition · Computer Science 2023-01-30 Suttisak Wizadwongsa , Supasorn Suwajanakorn

Fine color guidance in diffusion models and its application to image compression at extremely low bitrates

This study addresses the challenge of, without training or fine-tuning, controlling the global color aspect of images generated with a diffusion model. We rewrite the guidance equations to ensure that the outputs are closer to a known color…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Tom Bordin , Thomas Maugey

Guiding a Diffusion Model with a Bad Version of Itself

The primary axes of interest in image-generating diffusion models are image quality, the amount of variation in the results, and how well the results align with a given condition, e.g., a class label or a text prompt. The popular…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Tero Karras , Miika Aittala , Tuomas Kynkäänniemi , Jaakko Lehtinen , Timo Aila , Samuli Laine

Diffusion Posterior Proximal Sampling for Image Restoration

Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements…

Image and Video Processing · Electrical Eng. & Systems 2024-08-07 Hongjie Wu , Linchao He , Mingqin Zhang , Dongdong Chen , Kunming Luo , Mengting Luo , Ji-Zhe Zhou , Hu Chen , Jiancheng Lv