Related papers: Conditional Image Generation with Pretrained Gener…

On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models

Conditional image generation has paved the way for several breakthroughs in image editing, generating stock photos and 3-D object generation. This continues to be a significant area of interest with the rise of new state-of-the-art methods…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Vedant Singh , Surgan Jandial , Ayush Chopra , Siddharth Ramesh , Balaji Krishnamurthy , Vineeth N. Balasubramanian

Towards Controllable Image Generation through Representation-Conditioned Diffusion Models

Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Nithesh Chandher Karthikeyan , Jonas Unger , Gabriel Eilertsen

A Simple Approach to Unifying Diffusion-based Conditional Generation

Recent progress in image generation has sparked research into controlling these models through condition signals, with various methods addressing specific challenges in conditional generation. Instead of proposing another specialized…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Xirui Li , Charles Herrmann , Kelvin C. K. Chan , Yinxiao Li , Deqing Sun , Chao Ma , Ming-Hsuan Yang

Conditional Image Synthesis with Diffusion Models: A Survey

Conditional image synthesis based on user-specified requirements is a key component in creating complex visual content. In recent years, diffusion-based generative modeling has become a highly effective way for conditional image synthesis,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Zheyuan Zhan , Defang Chen , Jian-Ping Mei , Zhenghe Zhao , Jiawei Chen , Chun Chen , Siwei Lyu , Can Wang

Is Conditional Generative Modeling all you need for Decision-Making?

Recent improvements in conditional generative modeling have made it possible to generate high-quality images from language descriptions alone. We investigate whether these methods can directly address the problem of sequential…

Machine Learning · Computer Science 2023-07-11 Anurag Ajay , Yilun Du , Abhi Gupta , Joshua Tenenbaum , Tommi Jaakkola , Pulkit Agrawal

Visual Chain-of-Thought Diffusion Models

Recent progress with conditional image diffusion models has been stunning, and this holds true whether we are speaking about models conditioned on a text description, a scene layout, or a sketch. Unconditional image diffusion models are…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 William Harvey , Frank Wood

Diffusion Models Beat GANs on Image Synthesis

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For…

Machine Learning · Computer Science 2021-06-02 Prafulla Dhariwal , Alex Nichol

Conditional Generation from Unconditional Diffusion Models using Denoiser Representations

Denoising diffusion models have gained popularity as a generative modeling technique for producing high-quality and diverse images. Applying these models to downstream tasks requires conditioning, which can take the form of text, class…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Alexandros Graikos , Srikar Yellapragada , Dimitris Samaras

Guiding a Diffusion Model with a Bad Version of Itself

The primary axes of interest in image-generating diffusion models are image quality, the amount of variation in the results, and how well the results align with a given condition, e.g., a class label or a text prompt. The popular…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Tero Karras , Miika Aittala , Tuomas Kynkäänniemi , Jaakko Lehtinen , Timo Aila , Samuli Laine

Bi-Noising Diffusion: Towards Conditional Diffusion Models with Generative Restoration Priors

Conditional diffusion probabilistic models can model the distribution of natural images and can generate diverse and realistic samples based on given conditions. However, oftentimes their results can be unrealistic with observable color…

Computer Vision and Pattern Recognition · Computer Science 2022-12-15 Kangfu Mei , Nithin Gopalakrishnan Nair , Vishal M. Patel

Exploring Compositional Visual Generation with Latent Classifier Guidance

Diffusion probabilistic models have achieved enormous success in the field of image generation and manipulation. In this paper, we explore a novel paradigm of using the diffusion model and classifier guidance in the latent semantic space…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Changhao Shi , Haomiao Ni , Kai Li , Shaobo Han , Mingfu Liang , Martin Renqiang Min

Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis

The success of deep learning in computer vision over the past decade has hinged on large labeled datasets and strong pretrained models. In data-scarce settings, the quality of these pretrained models becomes crucial for effective transfer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Bingxin Ke , Kevin Qu , Tianfu Wang , Nando Metzger , Shengyu Huang , Bo Li , Anton Obukhov , Konrad Schindler

Augmented Conditioning Is Enough For Effective Training Image Generation

Image generation abilities of text-to-image diffusion models have significantly advanced, yielding highly photo-realistic images from descriptive text and increasing the viability of leveraging synthetic images to train computer vision…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Jiahui Chen , Amy Zhang , Adriana Romero-Soriano

Manifold Preserving Guided Diffusion

Despite the recent advancements, conditional image generation still faces challenges of cost, generalizability, and the need for task-specific training. In this paper, we propose Manifold Preserving Guided Diffusion (MPGD), a training-free…

Machine Learning · Computer Science 2023-11-29 Yutong He , Naoki Murata , Chieh-Hsin Lai , Yuhta Takida , Toshimitsu Uesaka , Dongjun Kim , Wei-Hsiang Liao , Yuki Mitsufuji , J. Zico Kolter , Ruslan Salakhutdinov , Stefano Ermon

Conditional Video Generation for High-Efficiency Video Compression

Perceptual studies demonstrate that conditional diffusion models excel at reconstructing video content aligned with human visual perception. Building on this insight, we propose a video compression framework that leverages conditional…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Fangqiu Yi , Jingyu Xu , Jiawei Shao , Chi Zhang , Xuelong Li

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

Large-scale generative models, such as text-to-image diffusion models, have garnered widespread attention across diverse domains due to their creative and high-fidelity image generation. Nonetheless, existing large-scale diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Younghyun Kim , Geunmin Hwang , Junyu Zhang , Eunbyung Park

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Nithin Gopalakrishnan Nair , Anoop Cherian , Suhas Lohit , Ye Wang , Toshiaki Koike-Akino , Vishal M. Patel , Tim K. Marks

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

Recent advances in diffusion models enable many powerful instruments for image editing. One of these instruments is text-driven image manipulations: editing semantic attributes of an image according to the provided text description. %…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Nikita Starodubcev , Dmitry Baranchuk , Valentin Khrulkov , Artem Babenko

CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

Large generative diffusion models have revolutionized text-to-image generation and offer immense potential for conditional generation tasks such as image enhancement, restoration, editing, and compositing. However, their widespread adoption…

Computer Vision and Pattern Recognition · Computer Science 2024-02-20 Kangfu Mei , Mauricio Delbracio , Hossein Talebi , Zhengzhong Tu , Vishal M. Patel , Peyman Milanfar

Guidance Free Image Editing via Explicit Conditioning

Current sampling mechanisms for conditional diffusion models rely mainly on Classifier Free Guidance (CFG) to generate high-quality images. However, CFG requires several denoising passes in each time step, e.g., up to three passes in image…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Mehdi Noroozi , Alberto Gil Ramos , Luca Morreale , Ruchika Chavhan , Malcolm Chadwick , Abhinav Mehrotra , Sourav Bhattacharya