English
Related papers

Related papers: Detecting, Explaining, and Mitigating Memorization…

200 papers

Large-scale text-to-image diffusion models excel in generating high-quality images from textual inputs, yet concerns arise as research indicates their tendency to memorize and replicate training data, raising We also addressed the issue of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Ruchika Chavhan , Ondrej Bohdal , Yongshuo Zong , Da Li , Timothy Hospedales

Diffusion models have achieved remarkable success in Text-to-Image generation tasks, leading to the development of many commercial models. However, recent studies have reported that diffusion models often generate replicated images in train…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Chunsan Hong , Tae-Hyun Oh , Minhyuk Sung

Diffusion-based models, such as the Stable Diffusion model, have revolutionized text-to-image synthesis with their ability to produce high-quality, high-resolution images. These advancements have prompted significant progress in image…

Cryptography and Security · Computer Science 2023-12-07 Ali Naseh , Jaechul Roh , Amir Houmansadr

While diffusion models excel at generating high-quality images, their tendency to memorize training data poses significant privacy and copyright risks. In this work, we for the first time identify that memorization induces internal…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Yuanmin Huang , Mi Zhang , Chen Chen , Feifei Li , Geng Hong , Xiaoyu You , Min Yang

Recent advancements in text-to-image diffusion models have demonstrated their remarkable capability to generate high-quality images from textual prompts. However, increasing research indicates that these models memorize and replicate images…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Jie Ren , Yaxin Li , Shenglai Zeng , Han Xu , Lingjuan Lyu , Yue Xing , Jiliang Tang

Diffusion models, widely used for image and video generation, face a significant limitation: the risk of memorizing and reproducing training data during inference, potentially generating unauthorized copyrighted content. While prior…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Chen Chen , Enhuai Liu , Daochang Liu , Mubarak Shah , Chang Xu

Visual Generative AI models have demonstrated remarkable capability in generating high-quality images from user inputs like text prompts. However, because these models have billions of parameters, they risk memorizing certain parts of the…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Lena Reissinger , Yuanyuan Li , Anna-Carolina Haensch , Neeraj Sarna

Diffusion models (DMs) memorize training images and can reproduce near-duplicates during generation. Current detection methods identify verbatim memorization but fail to capture two critical aspects: quantifying partial memorization…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Jimmy Z. Di , Yiwei Lu , Yaoliang Yu , Gautam Kamath , Adam Dziedzic , Franziska Boenisch

There is strong empirical evidence that the state-of-the-art diffusion modeling paradigm leads to models that memorize the training set, especially when the training set is small. Prior methods to mitigate the memorization problem often…

Machine Learning · Computer Science 2026-03-03 Kulin Shah , Alkis Kalavasis , Adam R. Klivans , Giannis Daras

Recent works have shown that diffusion models are able to memorize training images and emit them at generation time. However, the metrics used to evaluate memorization and its mitigation techniques suffer from dataset-dependent biases and…

Machine Learning · Computer Science 2025-03-04 Nicky Kriplani , Minh Pham , Gowthami Somepalli , Chinmay Hegde , Niv Cohen

In this paper, we introduce a geometric framework to analyze memorization in diffusion models through the sharpness of the log probability density. We mathematically justify a previously proposed score-difference-based memorization metric…

Machine Learning · Computer Science 2025-08-20 Dongjae Jeon , Dueun Kim , Albert No

Pretrained diffusion models and their outputs are widely accessible due to their exceptional capacity for synthesizing high-quality images and their open-source nature. The users, however, may face litigation risks owing to the models'…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Chen Chen , Daochang Liu , Chang Xu

Images generated by diffusion models like Stable Diffusion are increasingly widespread. Recent works and even lawsuits have shown that these models are prone to replicating their training data, unbeknownst to the user. In this paper, we…

Machine Learning · Computer Science 2023-06-01 Gowthami Somepalli , Vasu Singla , Micah Goldblum , Jonas Geiping , Tom Goldstein

Diffusion-based image generative models produce high-fidelity images through iterative denoising but remain vulnerable to memorization, where they unintentionally reproduce exact copies or parts of training images. Recent memorization…

Machine Learning · Computer Science 2026-02-11 Rohan Asthana , Vasileios Belagiannis

Due to their capacity to generate novel and high-quality samples, diffusion models have attracted significant research interest in recent years. Notably, the typical training objective of diffusion models, i.e., denoising score matching,…

Machine Learning · Computer Science 2025-02-21 Xiangming Gu , Chao Du , Tianyu Pang , Chongxuan Li , Min Lin , Ye Wang

Despite their success in image generation, diffusion models can memorize training data, raising serious privacy and copyright concerns. Although prior work has sought to characterize, detect, and mitigate memorization, the fundamental…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Juyeop Kim , Songkuk Kim , Jong-Seok Lee

Diffusion models are central to modern generative modeling, and understanding how they balance memorization and generalization is critical for reliable deployment. Recent work has shown that memorization in diffusion models is shaped by…

Machine Learning · Computer Science 2026-04-28 Bingqing Jiang , Difan Zou

Diffusion models have achieved remarkable success across diverse domains, but they remain vulnerable to memorization -- reproducing training data rather than generating novel outputs. This not only limits their creative potential but also…

Machine Learning · Statistics 2025-11-10 Zeqi Ye , Qijie Zhu , Molei Tao , Minshuo Chen

Diffusion models have demonstrated remarkable capability in generating high-quality visual content from textual descriptions. However, since these models are trained on large-scale internet data, they inevitably learn undesirable concepts,…

Machine Learning · Computer Science 2025-02-18 Anh Bui , Khanh Doan , Trung Le , Paul Montague , Tamas Abraham , Dinh Phung

Pretrained large language models (LLMs) have revolutionized natural language processing (NLP) tasks such as summarization, question answering, and translation. However, LLMs pose significant security risks due to their tendency to memorize…

Computation and Language · Computer Science 2024-09-24 Zhepeng Wang , Runxue Bao , Yawen Wu , Jackson Taylor , Cao Xiao , Feng Zheng , Weiwen Jiang , Shangqian Gao , Yanfu Zhang
‹ Prev 1 2 3 10 Next ›