Related papers: A Mathematical Framework for Learning Probability …

Generalization and Memorization: The Bias Potential Model

Models for learning probability distributions such as generative models and density estimators behave quite differently from models for learning functions. One example is found in the memorization phenomenon, namely the ultimate convergence…

Machine Learning · Statistics 2021-03-03 Hongkang Yang , Weinan E

The Emergence of Reproducibility and Generalizability in Diffusion Models

In this work, we investigate an intriguing and prevalent phenomenon of diffusion models which we term as "consistent model reproducibility": given the same starting noise input and a deterministic sampler, different diffusion models often…

Machine Learning · Computer Science 2024-06-11 Huijie Zhang , Jinfan Zhou , Yifu Lu , Minzhe Guo , Peng Wang , Liyue Shen , Qing Qu

Training Deep Normalizing Flow Models in Highly Incomplete Data Scenarios with Prior Regularization

Deep generative frameworks including GANs and normalizing flow models have proven successful at filling in missing values in partially observed data samples by effectively learning -- either explicitly or implicitly -- complex,…

Machine Learning · Computer Science 2021-04-06 Edgar A. Bernal

Generalization Dynamics of Linear Diffusion Models

Diffusion models are powerful generative models that produce high-quality samples from complex data. While their infinite-data behavior is well understood, their generalization with finite data remains less clear. Classical learning theory…

Machine Learning · Statistics 2026-02-02 Claudia Merger , Sebastian Goldt

On the Generalization of Diffusion Model

The diffusion probabilistic generative models are widely used to generate high-quality data. Though they can synthetic data that does not exist in the training set, the rationale behind such generalization is still unexplored. In this…

Machine Learning · Computer Science 2023-05-25 Mingyang Yi , Jiacheng Sun , Zhenguo Li

Generalization Error of Generalized Linear Models in High Dimensions

At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our…

Machine Learning · Computer Science 2020-05-04 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a…

Machine Learning · Computer Science 2020-02-14 Vikas K. Garg , Adam Kalai , Katrina Ligett , Zhiwei Steven Wu

Out of Distribution Generalization in Machine Learning

Machine learning has achieved tremendous success in a variety of domains in recent years. However, a lot of these success stories have been in places where the training and the testing distributions are extremely similar to each other. In…

Machine Learning · Statistics 2021-03-05 Martin Arjovsky

Introduction to probability and statistics: a computational framework of randomness

This text presents an unified approach of probability and statistics in the pursuit of understanding and computation of randomness in engineering or physical or social system with prediction with generalizability. Starting from elementary…

History and Overview · Mathematics 2024-01-19 Lakshman Mahto

Learning Weighted Representations for Generalization Across Designs

Predictive models that generalize well under distributional shift are often desirable and sometimes crucial to building robust and reliable machine learning applications. We focus on distributional shift that arises in causal inference from…

Machine Learning · Statistics 2018-02-27 Fredrik D. Johansson , Nathan Kallus , Uri Shalit , David Sontag

Normalizing Flows for Bayesian Posteriors: Reproducibility and Deployment

We present a computational framework for efficient learning, sampling, and distribution of general Bayesian posterior distributions. The framework leverages a machine learning approach for the construction of normalizing flows for the…

Nuclear Theory · Physics 2023-10-10 Yukari Yamauchi , Landon Buskirk , Pablo Giuliani , Kyle Godbey

Probability Distribution Learning and Its Application in Deep Learning

Despite its empirical success, deep learning still lacks a comprehensive theoretical understanding of model fitting and generalization. This paper proposes the probability distribution (PD) learning framework to analyze the optimization and…

Machine Learning · Computer Science 2025-10-09 Binchuan Qi , Wei Gong , Li Li

Principled Out-of-Distribution Generalization via Simplicity

Modern foundation models exhibit remarkable out-of-distribution (OOD) generalization, solving tasks far beyond the support of their training data. However, the theoretical principles underpinning this phenomenon remain elusive. This paper…

Machine Learning · Statistics 2025-05-29 Jiawei Ge , Amanda Wang , Shange Tang , Chi Jin

Biased Generalization in Diffusion Models

Generalization in generative modeling is defined as the ability to learn an underlying distribution from a finite dataset and produce novel samples, with evaluation largely driven by held-out performance and perceived sample quality. In…

Machine Learning · Computer Science 2026-03-05 Jerome Garnier-Brun , Luca Biggio , Davide Beltrame , Marc Mézard , Luca Saglietti

Bias and Generalization in Deep Generative Models: An Empirical Study

In high dimensional settings, density estimation algorithms rely crucially on their inductive bias. Despite recent empirical success, the inductive bias of deep generative models is not well understood. In this paper we propose a framework…

Machine Learning · Computer Science 2018-11-09 Shengjia Zhao , Hongyu Ren , Arianna Yuan , Jiaming Song , Noah Goodman , Stefano Ermon

Improved probabilistic regression using diffusion models

Probabilistic regression models the entire predictive distribution of a response variable, offering richer insights than classical point estimates and directly allowing for uncertainty quantification. While diffusion-based generative models…

Machine Learning · Computer Science 2025-10-07 Carlo Kneissl , Christopher Bülte , Philipp Scholl , Gitta Kutyniok

A Probabilistic Theory of Deep Learning

A grand challenge in machine learning is the development of computational algorithms that match or outperform humans in perceptual inference tasks that are complicated by nuisance variation. For instance, visual object recognition involves…

Machine Learning · Statistics 2015-04-03 Ankit B. Patel , Tan Nguyen , Richard G. Baraniuk

On the Generalization Properties of Diffusion Models

Diffusion models are a class of generative models that serve to establish a stochastic transport map between an empirically observed, yet unknown, target distribution and a known prior. Despite their remarkable success in real-world…

Machine Learning · Computer Science 2025-03-13 Puheng Li , Zhong Li , Huishuai Zhang , Jiang Bian

Probabilistic Graphical Models: A Concise Tutorial

Probabilistic graphical modeling is a branch of machine learning that uses probability distributions to describe the world, make predictions, and support decision-making under uncertainty. Underlying this modeling framework is an elegant…

Machine Learning · Computer Science 2025-07-24 Jacqueline Maasch , Willie Neiswanger , Stefano Ermon , Volodymyr Kuleshov

Rate of Model Collapse in Recursive Training

Given the ease of creating synthetic data from machine learning models, new models can be potentially trained on synthetic data generated by previous models. This recursive training process raises concerns about the long-term impact on…

Machine Learning · Computer Science 2024-12-24 Ananda Theertha Suresh , Andrew Thangaraj , Aditya Nanda Kishore Khandavally