Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

Seongmin Lee; Benjamin Hoover; Hendrik Strobelt; Zijie J. Wang; ShengYun Peng; Austin Wright; Kevin Li; Haekyu Park; Haoyang Yang; Duen Horng Chau

Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

Computation and Language 2024-09-04 v3 Artificial Intelligence Human-Computer Interaction Machine Learning

Authors: Seongmin Lee , Benjamin Hoover , Hendrik Strobelt , Zijie J. Wang , ShengYun Peng , Austin Wright , Kevin Li , Haekyu Park , Haoyang Yang , Duen Horng Chau

View on arXiv ↗ PDF ↗

Abstract

Diffusion-based generative models' impressive ability to create convincing images has garnered global attention. However, their complex structures and operations often pose challenges for non-experts to grasp. We present Diffusion Explainer, the first interactive visualization tool that explains how Stable Diffusion transforms text prompts into images. Diffusion Explainer tightly integrates a visual overview of Stable Diffusion's complex structure with explanations of the underlying operations. By comparing image generation of prompt variants, users can discover the impact of keyword changes on image generation. A 56-participant user study demonstrates that Diffusion Explainer offers substantial learning benefits to non-experts. Our tool has been used by over 10,300 users from 124 countries at https://poloclub.github.io/diffusion-explainer/.

Keywords

diffusion model image editing text-to-image generation

Cite

@article{arxiv.2305.03509,
  title  = {Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion},
  author = {Seongmin Lee and Benjamin Hoover and Hendrik Strobelt and Zijie J. Wang and ShengYun Peng and Austin Wright and Kevin Li and Haekyu Park and Haoyang Yang and Duen Horng Chau},
  journal= {arXiv preprint arXiv:2305.03509},
  year   = {2024}
}

Comments

5 pages, 7 figures

Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

Abstract

Keywords

Cite

Comments

Related papers