English

Heavy-Tailed Diffusion Models

Machine Learning 2024-10-30 v2 Machine Learning

Abstract

Diffusion models achieve state-of-the-art generation quality across many applications, but their ability to capture rare or extreme events in heavy-tailed distributions remains unclear. In this work, we show that traditional diffusion and flow-matching models with standard Gaussian priors fail to capture heavy-tailed behavior. We address this by repurposing the diffusion framework for heavy-tail estimation using multivariate Student-t distributions. We develop a tailored perturbation kernel and derive the denoising posterior based on the conditional Student-t distribution for the backward process. Inspired by γ\gamma-divergence for heavy-tailed distributions, we derive a training objective for heavy-tailed denoisers. The resulting framework introduces controllable tail generation using only a single scalar hyperparameter, making it easily tunable for diverse real-world distributions. As specific instantiations of our framework, we introduce t-EDM and t-Flow, extensions of existing diffusion and flow models that employ a Student-t prior. Remarkably, our approach is readily compatible with standard Gaussian diffusion models and requires only minimal code changes. Empirically, we show that our t-EDM and t-Flow outperform standard diffusion models in heavy-tail estimation on high-resolution weather datasets in which generating rare and extreme events is crucial.

Keywords

Cite

@article{arxiv.2410.14171,
  title  = {Heavy-Tailed Diffusion Models},
  author = {Kushagra Pandey and Jaideep Pathak and Yilun Xu and Stephan Mandt and Michael Pritchard and Arash Vahdat and Morteza Mardani},
  journal= {arXiv preprint arXiv:2410.14171},
  year   = {2024}
}

Comments

51 pages, v2 Fixes typo and adds some more recent work

R2 v1 2026-06-28T19:26:50.153Z