Related papers: Filter-Guided Diffusion for Controllable Image Gen…

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Omer Bar-Tal , Lior Yariv , Yaron Lipman , Tali Dekel

Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

This paper introduces a novel approach to leverage the generalizability of Diffusion Models for Source-Free Domain Adaptation (DM-SFDA). Our proposed DMSFDA method involves fine-tuning a pre-trained text-to-image diffusion model to generate…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Shivang Chopra , Suraj Kothawade , Houda Aynaou , Aman Chadha

Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation

Guided diffusion-model generation is a promising direction for customizing the generation process of a pre-trained diffusion model to address specific downstream tasks. Existing guided diffusion models either rely on training the guidance…

Machine Learning · Computer Science 2025-04-01 Kim Yong Tan , Yueming Lyu , Ivor Tsang , Yew-Soon Ong

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models

Research in vision-language models has seen rapid developments off-late, enabling natural language-based interfaces for image generation and manipulation. Many existing text guided manipulation techniques are restricted to specific classes…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Paramanand Chandramouli , Kanchana Vaishnavi Gandikota

Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance

Diffusion models have shown significant progress in image translation tasks recently. However, due to their stochastic nature, there's often a trade-off between style transformation and content preservation. Current strategies aim to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-08 Gihyun Kwon , Jong Chul Ye

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Nithin Gopalakrishnan Nair , Anoop Cherian , Suhas Lohit , Ye Wang , Toshiaki Koike-Akino , Vishal M. Patel , Tim K. Marks

Diffusion Guided Domain Adaptation of Image Generators

Can a text-to-image diffusion model be used as a training objective for adapting a GAN generator to another domain? In this paper, we show that the classifier-free guidance can be leveraged as a critic and enable generators to distill…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Kunpeng Song , Ligong Han , Bingchen Liu , Dimitris Metaxas , Ahmed Elgammal

Diffusion Self-Guidance for Controllable Image Generation

Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Dave Epstein , Allan Jabri , Ben Poole , Alexei A. Efros , Aleksander Holynski

Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design

Generative models have the potential to accelerate key steps in the discovery of novel molecular therapeutics and materials. Diffusion models have recently emerged as a powerful approach, excelling at unconditional sample generation and,…

Biomolecules · Quantitative Biology 2024-07-17 Leo Klarner , Tim G. J. Rudner , Garrett M. Morris , Charlotte M. Deane , Yee Whye Teh

Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance

We present a simple but effective training-free approach for text-driven image-to-image translation based on a pretrained text-to-image diffusion model. Our goal is to generate an image that aligns with the target task while preserving the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Hyunsoo Lee , Minsoo Kang , Bohyung Han

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation

Diffusion-based text-to-image generation models like GLIDE and DALLE-2 have gained wide success recently for their superior performance in turning complex text inputs into images of high quality and wide diversity. In particular, they are…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Zhihong Pan , Xin Zhou , Hao Tian

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Forward-only Diffusion Probabilistic Models

This work presents a forward-only diffusion (FoD) approach for generative modelling. In contrast to traditional diffusion models that rely on a coupled forward-backward diffusion scheme, FoD directly learns data generation through a single…

Machine Learning · Computer Science 2025-09-29 Ziwei Luo , Fredrik K. Gustafsson , Jens Sjölund , Thomas B. Schön

Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

Diffusion models are capable of generating impressive images conditioned on text descriptions, and extensions of these models allow users to edit images at a relatively coarse scale. However, the ability to precisely edit the layout,…

Computer Vision and Pattern Recognition · Computer Science 2024-02-01 Daniel Geng , Andrew Owens

Self-Guidance: Boosting Flow and Diffusion Generation on Their Own

Proper guidance strategies are essential to achieve high-quality generation results without retraining diffusion and flow-based text-to-image models. Existing guidance either requires specific training or strong inductive biases of…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Tiancheng Li , Weijian Luo , Zhiyang Chen , Liyuan Ma , Guo-Jun Qi

Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance

Masked generative models (MGMs) have shown impressive generative ability while providing an order of magnitude efficient sampling steps compared to continuous diffusion models. However, MGMs still underperform in image synthesis compared to…

Computer Vision and Pattern Recognition · Computer Science 2024-10-18 Jiwan Hur , Dong-Jae Lee , Gyojin Han , Jaehyun Choi , Yunho Jeon , Junmo Kim

MedDiff-FT: Data-Efficient Diffusion Model Fine-tuning with Structural Guidance for Controllable Medical Image Synthesis

Recent advancements in deep learning for medical image segmentation are often limited by the scarcity of high-quality training data.While diffusion models provide a potential solution by generating synthetic images, their effectiveness in…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Jianhao Xie , Ziang Zhang , Zhenyu Weng , Yuesheng Zhu , Guibo Luo

Guiding a Diffusion Model with a Bad Version of Itself

The primary axes of interest in image-generating diffusion models are image quality, the amount of variation in the results, and how well the results align with a given condition, e.g., a class label or a text prompt. The popular…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Tero Karras , Miika Aittala , Tuomas Kynkäänniemi , Jaakko Lehtinen , Timo Aila , Samuli Laine

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

Large-scale generative models, such as text-to-image diffusion models, have garnered widespread attention across diverse domains due to their creative and high-fidelity image generation. Nonetheless, existing large-scale diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Younghyun Kim , Geunmin Hwang , Junyu Zhang , Eunbyung Park

Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling

Text-guided diffusion models have become essential for high-quality image synthesis, enabling dynamic image editing. In image editing, two crucial aspects are editability, which determines the extent of modification, and faithfulness, which…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Hansam Cho , Seoung Bum Kim