English
Related papers

Related papers: On Memorization in Diffusion Models

200 papers

When do diffusion models reproduce their training data, and when are they able to generate samples beyond it? A practically relevant theoretical understanding of this interplay between memorization and generalization may significantly…

Machine Learning · Computer Science 2025-08-26 Sam Buchanan , Druv Pai , Yi Ma , Valentin De Bortoli

Despite their success in image generation, diffusion models can memorize training data, raising serious privacy and copyright concerns. Although prior work has sought to characterize, detect, and mitigate memorization, the fundamental…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Juyeop Kim , Songkuk Kim , Jong-Seok Lee

Diffusion models, widely used for image and video generation, face a significant limitation: the risk of memorizing and reproducing training data during inference, potentially generating unauthorized copyrighted content. While prior…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Chen Chen , Enhuai Liu , Daochang Liu , Mubarak Shah , Chang Xu

Diffusion models (DMs) memorize training images and can reproduce near-duplicates during generation. Current detection methods identify verbatim memorization but fail to capture two critical aspects: quantifying partial memorization…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Jimmy Z. Di , Yiwei Lu , Yaoliang Yu , Gautam Kamath , Adam Dziedzic , Franziska Boenisch

Recent breakthroughs in diffusion models have exhibited exceptional image-generation capabilities. However, studies show that some outputs are merely replications of training data. Such replications present potential legal challenges for…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Yuxin Wen , Yuchen Liu , Chen Chen , Lingjuan Lyu

There is strong empirical evidence that the state-of-the-art diffusion modeling paradigm leads to models that memorize the training set, especially when the training set is small. Prior methods to mitigate the memorization problem often…

Machine Learning · Computer Science 2026-03-03 Kulin Shah , Alkis Kalavasis , Adam R. Klivans , Giannis Daras

Diffusion models are central to modern generative modeling, and understanding how they balance memorization and generalization is critical for reliable deployment. Recent work has shown that memorization in diffusion models is shaped by…

Machine Learning · Computer Science 2026-04-28 Bingqing Jiang , Difan Zou

In this work, we investigate an intriguing and prevalent phenomenon of diffusion models which we term as "consistent model reproducibility": given the same starting noise input and a deterministic sampler, different diffusion models often…

Machine Learning · Computer Science 2024-06-11 Huijie Zhang , Jinfan Zhou , Yifu Lu , Minzhe Guo , Peng Wang , Liyue Shen , Qing Qu

Diffusion models have achieved remarkable success across a wide range of generative tasks. A key challenge is understanding the mechanisms that prevent their memorization of training data and allow generalization. In this work, we…

Machine Learning · Computer Science 2025-10-29 Tony Bonnaire , Raphaël Urfin , Giulio Biroli , Marc Mézard

Diffusion models excel at generating high-quality, diverse samples, yet they risk memorizing training data when overfit to the training objective. We analyze the distinctions between memorization and generalization in diffusion models…

Machine Learning · Computer Science 2026-02-12 Zekai Zhang , Xiao Li , Xiang Li , Lianghe Shi , Meng Wu , Molei Tao , Qing Qu

Multimodal machine learning, especially text-to-image models like Stable Diffusion and DALL-E 3, has gained significance for transforming text into detailed images. Despite their growing use and remarkable generative capabilities, there is…

Computer Vision and Pattern Recognition · Computer Science 2023-12-14 Ali Naseh , Jaechul Roh , Amir Houmansadr

Diffusion-based models, such as the Stable Diffusion model, have revolutionized text-to-image synthesis with their ability to produce high-quality, high-resolution images. These advancements have prompted significant progress in image…

Cryptography and Security · Computer Science 2023-12-07 Ali Naseh , Jaechul Roh , Amir Houmansadr

Diffusion models (DMs) produce very detailed and high-quality images. Their power results from extensive training on large amounts of data, usually scraped from the internet without proper attribution or consent from content creators.…

Machine Learning · Computer Science 2024-11-05 Dominik Hintersdorf , Lukas Struppek , Kristian Kersting , Adam Dziedzic , Franziska Boenisch

As diffusion probabilistic models (DPMs) are being employed as mainstream models for generative artificial intelligence (AI), the study of their memorization of the raw training data has attracted growing attention. Existing works in this…

Cryptography and Security · Computer Science 2024-10-15 Yunhao Chen , Xingjun Ma , Difan Zou , Yu-Gang Jiang

The past few years have witnessed substantial advances in image generation powered by diffusion models. However, it was shown that diffusion models are susceptible to training data memorization, raising significant concerns regarding…

Cryptography and Security · Computer Science 2025-08-01 Zhe Ma , Qingming Li , Xuhong Zhang , Tianyu Du , Ruixiao Lin , Zonghui Wang , Shouling Ji , Wenzhi Chen

Large-scale text-to-image diffusion models excel in generating high-quality images from textual inputs, yet concerns arise as research indicates their tendency to memorize and replicate training data, raising We also addressed the issue of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Ruchika Chavhan , Ondrej Bohdal , Yongshuo Zong , Da Li , Timothy Hospedales

This position paper argues that understanding generalization in diffusion models requires fundamentally new theoretical frameworks that go beyond both classical statistical learning theory and the benign overfitting paradigm developed for…

Machine Learning · Computer Science 2026-05-08 Pierre Marion , Yu-Han Wu

We theoretically investigate the phenomena of generalization and memorization in diffusion models. Empirical studies suggest that these phenomena are influenced by model complexity and the size of the training dataset. In our experiments,…

Machine Learning · Computer Science 2025-10-09 Anand Jerry George , Rodrigo Veiga , Nicolas Macris

Diffusion models excel in generating images that closely resemble their training data but are also susceptible to data memorization, raising privacy, ethical, and legal concerns, particularly in sensitive domains such as medical imaging. We…

Computer Vision and Pattern Recognition · Computer Science 2025-02-12 Raman Dutt , Ondrej Bohdal , Pedro Sanchez , Sotirios A. Tsaftaris , Timothy Hospedales

Diffusion models have emerged as a powerful framework for generative modeling. At the heart of the methodology is score matching: learning gradients of families of log-densities for noisy versions of the data distribution at different…

Machine Learning · Computer Science 2025-03-19 Ricardo Baptista , Agnimitra Dasgupta , Nikola B. Kovachki , Assad Oberai , Andrew M. Stuart
‹ Prev 1 2 3 10 Next ›