English
Related papers

Related papers: Biased Generalization in Diffusion Models

200 papers

The diffusion probabilistic generative models are widely used to generate high-quality data. Though they can synthetic data that does not exist in the training set, the rationale behind such generalization is still unexplored. In this…

Machine Learning · Computer Science 2023-05-25 Mingyang Yi , Jiacheng Sun , Zhenguo Li

Diffusion models are powerful generative models that produce high-quality samples from complex data. While their infinite-data behavior is well understood, their generalization with finite data remains less clear. Classical learning theory…

Machine Learning · Statistics 2026-02-02 Claudia Merger , Sebastian Goldt

Generalization is at the core of machine learning models. However, the definition of generalization is not entirely clear. We employ set theory to introduce the concepts of algorithms, hypotheses, and dataset generalization. We analyze the…

Machine Learning · Computer Science 2023-11-14 Shiqi Liu

In high dimensional settings, density estimation algorithms rely crucially on their inductive bias. Despite recent empirical success, the inductive bias of deep generative models is not well understood. In this paper we propose a framework…

Machine Learning · Computer Science 2018-11-09 Shengjia Zhao , Hongyu Ren , Arianna Yuan , Jiaming Song , Noah Goodman , Stefano Ermon

Diffusion models are a class of generative models that serve to establish a stochastic transport map between an empirically observed, yet unknown, target distribution and a known prior. Despite their remarkable success in real-world…

Machine Learning · Computer Science 2025-03-13 Puheng Li , Zhong Li , Huishuai Zhang , Jiang Bian

Exposure bias refers to the train-test discrepancy that seemingly arises when an autoregressive generative model uses only ground-truth contexts at training time but generated ones at test time. We separate the contributions of the model…

Machine Learning · Computer Science 2019-11-11 Florian Schmidt

Diffusion models generalize well in practice. However, an optimal diffusion model fully memorizes the training data and therefore fails to generalize, raising the question of what induces generalization in a real diffusion model. We show…

Machine Learning · Computer Science 2026-05-21 Tim Kaiser , Markus Kollmann

Models for learning probability distributions such as generative models and density estimators behave quite differently from models for learning functions. One example is found in the memorization phenomenon, namely the ultimate convergence…

Machine Learning · Statistics 2021-03-03 Hongkang Yang , Weinan E

How diffusion models generalize beyond their training set is not known, and is somewhat mysterious given two facts: the optimum of the denoising score matching (DSM) objective usually used to train diffusion models is the score function of…

Machine Learning · Computer Science 2025-04-18 John J. Vastola

The generalization of machine learning models has a complex dependence on the data, model and learning algorithm. We study train and test performance, as well as the generalization gap given by the mean of their difference over different…

Machine Learning · Statistics 2022-06-29 Carlos A. Gomez-Uribe

The robust generalization of models to rare, in-distribution (ID) samples drawn from the long tail of the training distribution and to out-of-training-distribution (OOD) samples is one of the major challenges of current deep learning…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Paul Gavrikov , Janis Keuper

In this work, we investigate an intriguing and prevalent phenomenon of diffusion models which we term as "consistent model reproducibility": given the same starting noise input and a deterministic sampler, different diffusion models often…

Machine Learning · Computer Science 2024-06-11 Huijie Zhang , Jinfan Zhou , Yifu Lu , Minzhe Guo , Peng Wang , Liyue Shen , Qing Qu

Distributed learning provides an attractive framework for scaling the learning task by sharing the computational load over multiple nodes in a network. Here, we investigate the performance of distributed learning for large-scale linear…

Machine Learning · Statistics 2021-11-03 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén

Learned systems in the domain of visual recognition and cognition impress in part because even though they are trained with datasets many orders of magnitude smaller than the full population of possible images, they exhibit sufficient…

Computer Vision and Pattern Recognition · Computer Science 2022-05-03 John K. Tsotsos , Jun Luo

Generalization beyond a training dataset is a main goal of machine learning, but theoretical understanding of generalization remains an open problem for many models. The need for a new theory is exacerbated by recent observations in deep…

Machine Learning · Statistics 2022-02-08 Abdulkadir Canatar , Blake Bordelon , Cengiz Pehlevan

Diffusion probabilistic models have been successfully used to generate data from noise. However, most diffusion models are computationally expensive and difficult to interpret with a lack of theoretical justification. Random feature models…

Machine Learning · Statistics 2025-08-11 Esha Saha , Giang Tran

The widespread adoption of generative AI models has raised growing concerns about representational harm and potential discriminatory outcomes. Yet, despite growing literature on this topic, the mechanisms by which bias emerges - especially…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Xiaofeng Zhang , Michelle Lin , Simon Lacoste-Julien , Aaron Courville , Yash Goyal

Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure,…

Machine Learning · Statistics 2016-05-26 Danilo Jimenez Rezende , Shakir Mohamed , Ivo Danihelka , Karol Gregor , Daan Wierstra

Recent studies show that transformer-based architectures emulate gradient descent during a forward pass, contributing to in-context learning capabilities - an ability where the model adapts to new tasks based on a sequence of prompt…

Statistics Theory · Mathematics 2024-05-13 Karthik Duraisamy

Background: Deep learning models are typically trained using stochastic gradient descent or one of its variants. These methods update the weights using their gradient, estimated from a small fraction of the training data. It has been…

Machine Learning · Statistics 2018-01-03 Elad Hoffer , Itay Hubara , Daniel Soudry
‹ Prev 1 2 3 10 Next ›