English

Computational bottlenecks for denoising diffusions

Machine Learning 2026-04-09 v3 Machine Learning

Abstract

Denoising diffusions sample from a probability distribution μ\mu in Rd\mathbb{R}^d by constructing a stochastic process (x^t:t0)({\hat{\boldsymbol x}}_t:t\ge 0) in Rd\mathbb{R}^d such that x^0{\hat{\boldsymbol x}}_0 is easy to sample, but the distribution of x^T\hat{\boldsymbol x}_T at large TT approximates μ\mu. The drift m:Rd×RRd{\boldsymbol m}:\mathbb{R}^d\times\mathbb{R}\to\mathbb{R}^d of this diffusion process is learned my minimizing a score-matching objective. Is every probability distribution μ\mu, for which sampling is tractable, also amenable to sampling via diffusions? We provide evidence to the contrary by studying a probability distribution μ\mu for which sampling is easy, but the drift of the diffusion process is intractable -- under a popular conjecture on information-computation gaps in statistical estimation. We show that there exist drifts that are superpolynomially close to the optimum value (among polynomial time drifts) and yet yield samples with distribution that is very far from the target one.

Keywords

Cite

@article{arxiv.2503.08028,
  title  = {Computational bottlenecks for denoising diffusions},
  author = {Andrea Montanari and Viet Vu},
  journal= {arXiv preprint arXiv:2503.08028},
  year   = {2026}
}

Comments

51 pages; 2 figures

R2 v1 2026-06-28T22:15:12.327Z