Universal Guidance for Diffusion Models

Arpit Bansal; Hong-Min Chu; Avi Schwarzschild; Soumyadip Sengupta; Micah Goldblum; Jonas Geiping; Tom Goldstein

Universal Guidance for Diffusion Models

Computer Vision and Pattern Recognition 2023-02-15 v1 Machine Learning

Authors: Arpit Bansal , Hong-Min Chu , Avi Schwarzschild , Soumyadip Sengupta , Micah Goldblum , Jonas Geiping , Tom Goldstein

View on arXiv ↗ PDF ↗

Abstract

Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance modalities without the need to retrain any use-specific components. We show that our algorithm successfully generates quality images with guidance functions including segmentation, face recognition, object detection, and classifier signals. Code is available at https://github.com/arpitbansal297/Universal-Guided-Diffusion.

Keywords

diffusion model image editing image generation

Cite

@article{arxiv.2302.07121,
  title  = {Universal Guidance for Diffusion Models},
  author = {Arpit Bansal and Hong-Min Chu and Avi Schwarzschild and Soumyadip Sengupta and Micah Goldblum and Jonas Geiping and Tom Goldstein},
  journal= {arXiv preprint arXiv:2302.07121},
  year   = {2023}
}

Universal Guidance for Diffusion Models

Abstract

Keywords

Cite

Related papers