Related papers: Implicit Regularisation in Diffusion Models: An Al…

Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves

We theoretically investigate the phenomena of generalization and memorization in diffusion models. Empirical studies suggest that these phenomena are influenced by model complexity and the size of the training dataset. In our experiments,…

Machine Learning · Computer Science 2025-10-09 Anand Jerry George , Rodrigo Veiga , Nicolas Macris

Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

Diffusion models have become a leading paradigm in generative AI, with score estimation via denoising score matching as a central component. While recent theory provides strong statistical guarantees, it typically relies on…

Machine Learning · Computer Science 2026-04-21 Yinbin Han , Meisam Razaviyayn , Renyuan Xu

Generalization through variance: how noise shapes inductive biases in diffusion models

How diffusion models generalize beyond their training set is not known, and is somewhat mysterious given two facts: the optimum of the denoising score matching (DSM) objective usually used to train diffusion models is the score function of…

Machine Learning · Computer Science 2025-04-18 John J. Vastola

Diffusion Models Memorize in Training -- and Generalize in Inference

Diffusion models generalize well in practice. However, an optimal diffusion model fully memorizes the training data and therefore fails to generalize, raising the question of what induces generalization in a real diffusion model. We show…

Machine Learning · Computer Science 2026-05-21 Tim Kaiser , Markus Kollmann

Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure

In this work, we study the generalizability of diffusion models by looking into the hidden properties of the learned score functions, which are essentially a series of deep denoisers trained on various noise levels. We observe that as…

Machine Learning · Computer Science 2024-12-03 Xiang Li , Yixiang Dai , Qing Qu

Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models

We investigate the approximation efficiency of score functions by deep neural networks in diffusion-based generative modeling. While existing approximation theories utilize the smoothness of score functions, they suffer from the curse of…

Machine Learning · Computer Science 2023-09-21 Song Mei , Yuchen Wu

The Emergence of Reproducibility and Generalizability in Diffusion Models

In this work, we investigate an intriguing and prevalent phenomenon of diffusion models which we term as "consistent model reproducibility": given the same starting noise input and a deterministic sampler, different diffusion models often…

Machine Learning · Computer Science 2024-06-11 Huijie Zhang , Jinfan Zhou , Yifu Lu , Minzhe Guo , Peng Wang , Liyue Shen , Qing Qu

Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models

Score-based generative models (SGMs) have emerged as one of the most popular classes of generative models. A substantial body of work now exists on the analysis of SGMs, focusing either on discretization aspects or on their statistical…

Machine Learning · Statistics 2026-02-10 Benjamin Dupuis , Dario Shariatian , Maxime Haddouche , Alain Durmus , Umut Simsekli

Towards a Mechanistic Explanation of Diffusion Model Generalization

We propose a simple, training-free mechanism which explains the generalization behaviour of diffusion models. By comparing pre-trained diffusion models to their theoretically optimal empirical counterparts, we identify a shared local…

Machine Learning · Computer Science 2025-06-11 Matthew Niedoba , Berend Zwartsenberg , Kevin Murphy , Frank Wood

Memorization and Regularization in Generative Diffusion Models

Diffusion models have emerged as a powerful framework for generative modeling. At the heart of the methodology is score matching: learning gradients of families of log-densities for noisy versions of the data distribution at different…

Machine Learning · Computer Science 2025-03-19 Ricardo Baptista , Agnimitra Dasgupta , Nikola B. Kovachki , Assad Oberai , Andrew M. Stuart

Generalization in diffusion models arises from geometry-adaptive harmonic representations

Deep neural networks (DNNs) trained for image denoising are able to generate high-quality samples with score-based reverse diffusion algorithms. These impressive capabilities seem to imply an escape from the curse of dimensionality, but…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Zahra Kadkhodaie , Florentin Guth , Eero P. Simoncelli , Stéphane Mallat

Smoothing the Score Function for Generalization in Diffusion Models: An Optimization-based Explanation Framework

Diffusion models achieve remarkable generation quality, yet face a fundamental challenge known as memorization, where generated samples can replicate training samples exactly. We develop a theoretical framework to explain this phenomenon by…

Machine Learning · Computer Science 2026-03-31 Xinyu Zhou , Jiawei Zhang , Stephen J. Wright

Optimizing Input of Denoising Score Matching is Biased Towards Higher Score Norm

Many recent works utilize denoising score matching to optimize the conditional input of diffusion models. In this workshop paper, we demonstrate that such optimization breaks the equivalence between denoising score matching and exact score…

Machine Learning · Computer Science 2025-11-18 Tongda Xu

Stability and Generalization of Adversarial Diffusion Training

Algorithmic stability is an established tool for analyzing generalization. While adversarial training enhances model robustness, it often suffers from robust overfitting and an enlarged generalization gap. Although recent work has…

Machine Learning · Computer Science 2025-09-24 Hesam Hosseini , Ying Cao , Ali H. Sayed

Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization

A widely believed explanation for the remarkable generalization capacities of overparameterized neural networks is that the optimization algorithms used for training induce an implicit bias towards benign solutions. To grasp this…

Machine Learning · Computer Science 2025-12-19 Maria Matveev , Vit Fojtik , Hung-Hsu Chou , Gitta Kutyniok , Johannes Maly

Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization

Diffusion models are gaining widespread use in cutting-edge image, video, and audio generation. Score-based diffusion models stand out among these methods, necessitating the estimation of score function of the input data distribution. In…

Machine Learning · Computer Science 2024-05-24 Fangzhao Zhang , Mert Pilanci

Is Your Conditional Diffusion Model Actually Denoising?

We study the inductive biases of diffusion models with a conditioning-variable, which have seen widespread application as both text-conditioned generative image models and observation-conditioned continuous control policies. We observe that…

Machine Learning · Computer Science 2025-12-23 Daniel Pfrommer , Zehao Dou , Christopher Scarvelis , Max Simchowitz , Ali Jadbabaie

Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization

Denoising score matching plays a pivotal role in the performance of diffusion-based generative models. However, the empirical optimal score--the exact solution to the denoising score matching--leads to memorization, where generated samples…

Machine Learning · Statistics 2025-05-07 Yu-Han Wu , Pierre Marion , Gérard Biau , Claire Boyer

Random Walks with Tweedie: A Unified View of Score-Based Diffusion Models

We present a concise derivation for several influential score-based diffusion models that relies on only a few textbook results. Diffusion models have recently emerged as powerful tools for generating realistic, synthetic signals --…

Computer Vision and Pattern Recognition · Computer Science 2025-10-06 Chicago Y. Park , Michael T. McCann , Cristina Garcia-Cardona , Brendt Wohlberg , Ulugbek S. Kamilov

How Diffusion Models Memorize

Despite their success in image generation, diffusion models can memorize training data, raising serious privacy and copyright concerns. Although prior work has sought to characterize, detect, and mitigate memorization, the fundamental…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Juyeop Kim , Songkuk Kim , Jong-Seok Lee