Related papers: Universal Guidance for Diffusion Models

Guiding a Diffusion Model with a Bad Version of Itself

The primary axes of interest in image-generating diffusion models are image quality, the amount of variation in the results, and how well the results align with a given condition, e.g., a class label or a text prompt. The popular…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Tero Karras , Miika Aittala , Tuomas Kynkäänniemi , Jaakko Lehtinen , Timo Aila , Samuli Laine

Simple Guidance Mechanisms for Discrete Diffusion Models

Diffusion models for continuous data gained widespread adoption owing to their high quality generation and control mechanisms. However, controllable diffusion on discrete data faces challenges given that continuous guidance methods do not…

Machine Learning · Computer Science 2025-05-29 Yair Schiff , Subham Sekhar Sahoo , Hao Phung , Guanghan Wang , Sam Boshar , Hugo Dalla-torre , Bernardo P. de Almeida , Alexander Rush , Thomas Pierrot , Volodymyr Kuleshov

Self-Guided Diffusion Models

Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process. However, guidance requires a large amount of image-annotation pairs for training and is…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Vincent Tao Hu , David W Zhang , Yuki M. Asano , Gertjan J. Burghouts , Cees G. M. Snoek

Fine color guidance in diffusion models and its application to image compression at extremely low bitrates

This study addresses the challenge of, without training or fine-tuning, controlling the global color aspect of images generated with a diffusion model. We rewrite the guidance equations to ensure that the outputs are closer to a known color…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Tom Bordin , Thomas Maugey

Gradient Guidance for Diffusion Models: An Optimization Perspective

Diffusion models have demonstrated empirical successes in various applications and can be adapted to task-specific needs via guidance. This paper studies a form of gradient guidance for adapting a pre-trained diffusion model towards…

Machine Learning · Statistics 2024-10-17 Yingqing Guo , Hui Yuan , Yukang Yang , Minshuo Chen , Mengdi Wang

Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models

Our goal is to develop fine-grained real-image editing methods suitable for real-world applications. In this paper, we first summarize four requirements for these methods and propose a novel diffusion-based image editing framework with…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Naoki Matsunaga , Masato Ishii , Akio Hayakawa , Kenji Suzuki , Takuya Narihira

General and Efficient Steering of Unconditional Diffusion

Guiding unconditional diffusion models typically requires either retraining with conditional inputs or per-step gradient computations (e.g., classifier-based guidance), both of which incur substantial computational overhead. We present a…

Machine Learning · Computer Science 2026-02-13 Qingsong Wang , Mikhail Belkin , Yusu Wang

Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

Diffusion models are capable of generating impressive images conditioned on text descriptions, and extensions of these models allow users to edit images at a relatively coarse scale. However, the ability to precisely edit the layout,…

Computer Vision and Pattern Recognition · Computer Science 2024-02-01 Daniel Geng , Andrew Owens

Diffusion Self-Guidance for Controllable Image Generation

Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Dave Epstein , Allan Jabri , Ben Poole , Alexei A. Efros , Aleksander Holynski

Upsample Guidance: Scale Up Diffusion Models without Training

Diffusion models have demonstrated superior performance across various generative tasks including images, videos, and audio. However, they encounter difficulties in directly generating high-resolution samples. Previously proposed solutions…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Juno Hwang , Yong-Hyun Park , Junghyo Jo

Guided Diffusion from Self-Supervised Diffusion Features

Guidance serves as a key concept in diffusion models, yet its effectiveness is often limited by the need for extra data annotation or classifier pretraining. That is why guidance was harnessed from self-supervised learning backbones, like…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Vincent Tao Hu , Yunlu Chen , Mathilde Caron , Yuki M. Asano , Cees G. M. Snoek , Bjorn Ommer

Understanding and Improving Training-free Loss-based Diffusion Guidance

Adding additional control to pretrained diffusion models has become an increasingly popular research area, with extensive applications in computer vision, reinforcement learning, and AI for science. Recently, several studies have proposed…

Machine Learning · Computer Science 2024-05-30 Yifei Shen , Xinyang Jiang , Yezhen Wang , Yifan Yang , Dongqi Han , Dongsheng Li

Variational Control for Guidance in Diffusion Models

Diffusion models exhibit excellent sample quality, but existing guidance methods often require additional model training or are limited to specific tasks. We revisit guidance in diffusion models from the perspective of variational inference…

Machine Learning · Computer Science 2025-05-27 Kushagra Pandey , Farrin Marouf Sofian , Felix Draxler , Theofanis Karaletsos , Stephan Mandt

Classifier-Free Diffusion Guidance

Classifier guidance is a recently introduced method to trade off mode coverage and sample fidelity in conditional diffusion models post training, in the same spirit as low temperature sampling or truncation in other types of generative…

Machine Learning · Computer Science 2022-07-27 Jonathan Ho , Tim Salimans

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

We present InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions. Unlike existing approaches that integrate prior knowledge and pre-define the output space (e.g., categories and…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Zigang Geng , Binxin Yang , Tiankai Hang , Chen Li , Shuyang Gu , Ting Zhang , Jianmin Bao , Zheng Zhang , Han Hu , Dong Chen , Baining Guo

Readout Guidance: Learning Control from Diffusion Features

We present Readout Guidance, a method for controlling text-to-image diffusion models with learned signals. Readout Guidance uses readout heads, lightweight networks trained to extract signals from the features of a pre-trained, frozen…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Grace Luo , Trevor Darrell , Oliver Wang , Dan B Goldman , Aleksander Holynski

DiffUTE: Universal Text Editing Diffusion Model

Diffusion model based language-guided image editing has achieved great success recently. However, existing state-of-the-art diffusion models struggle with rendering correct text and text style during generation. To tackle this problem, we…

Computer Vision and Pattern Recognition · Computer Science 2023-10-19 Haoxing Chen , Zhuoer Xu , Zhangxuan Gu , Jun Lan , Xing Zheng , Yaohui Li , Changhua Meng , Huijia Zhu , Weiqiang Wang

Diffusion Models without Classifier-free Guidance

This paper presents Model-guidance (MG), a novel objective for training diffusion model that addresses and removes of the commonly used Classifier-free guidance (CFG). Our innovative approach transcends the standard modeling of solely data…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Zhicong Tang , Jianmin Bao , Dong Chen , Baining Guo

Conditional Image Generation with Pretrained Generative Model

In recent years, diffusion models have gained popularity for their ability to generate higher-quality images in comparison to GAN models. However, like any other large generative models, these models require a huge amount of data,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Rajesh Shrestha , Bowen Xie

Spectral Guidance for Flexible and Efficient Control of Diffusion Models

We introduce Spectral Guidance, a framework for controlling diffusion models by leveraging the intrinsic geometry of the generative process. As data is progressively corrupted by noise, only a small number of features remain informative for…

Machine Learning · Computer Science 2026-05-29 Gabriel Moreira , Manuel Marques , João Paulo Costeira , Chenyan Xiong