Related papers: Pruning for Robust Concept Erasing in Diffusion Mo…

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

While large-scale text-to-image diffusion models have demonstrated impressive image-generation capabilities, there are significant concerns about their potential misuse for generating unsafe content, violating copyright, and perpetuating…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Ruchika Chavhan , Da Li , Timothy Hospedales

Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness

Diffusion models have achieved unprecedented success in image generation but pose increasing risks in terms of privacy, fairness, and security. A growing demand exists to \emph{erase} sensitive or harmful concepts (e.g., NSFW content,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Zixuan Fu , Yan Ren , Finn Carter , Chenyue Wen , Le Ku , Daheng Yu , Emily Davis , Bo Zhang

Erased or Dormant? Rethinking Concept Erasure Through Reversibility

To what extent does concept erasure eliminate generative capacity in diffusion models? While prior evaluations have primarily focused on measuring concept suppression under specific textual prompts, we explore a complementary and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Ping Liu , Chi Zhang

Rethinking the Vulnerability of Concept Erasure and a New Method

The proliferation of text-to-image diffusion models has raised significant privacy and security concerns, particularly regarding the generation of copyrighted or harmful images. In response, concept erasure (defense) methods have been…

Machine Learning · Computer Science 2025-10-06 Alex D. Richardson , Kaicheng Zhang , Lucas Beerens , Dongdong Chen

Erasing Concepts from Diffusion Models

Motivated by recent advancements in text-to-image diffusion, we study erasure of specific concepts from the model's weights. While Stable Diffusion has shown promise in producing explicit or realistic artwork, it has raised concerns…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Rohit Gandikota , Joanna Materzynska , Jaden Fiotto-Kaufman , David Bau

Comprehensive Evaluation and Analysis for NSFW Concept Erasure in Text-to-Image Diffusion Models

Text-to-image diffusion models have gained widespread application across various domains, demonstrating remarkable creative potential. However, the strong generalization capabilities of diffusion models can inadvertently lead to the…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Die Chen , Zhiwen Li , Cen Chen , Yuexiang Xie , Xiaodan Li , Jinyan Ye , Yingda Chen , Yaliang Li

Growth Inhibitors for Suppressing Inappropriate Image Concepts in Diffusion Models

Despite their remarkable image generation capabilities, text-to-image diffusion models inadvertently learn inappropriate concepts from vast and unfiltered training data, which leads to various ethical and business risks. Specifically,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Die Chen , Zhiwen Li , Mingyuan Fan , Cen Chen , Wenmeng Zhou , Yanhao Wang , Yaliang Li

Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation

Diffusion models excel at generating visually striking content from text but can inadvertently produce undesirable or harmful content when trained on unfiltered internet data. A practical solution is to selectively removing target concepts…

Machine Learning · Computer Science 2025-05-26 Anh Bui , Long Vuong , Khanh Doan , Trung Le , Paul Montague , Tamas Abraham , Dinh Phung

Minimalist Concept Erasure in Generative Models

Recent advances in generative models have demonstrated remarkable capabilities in producing high-quality images, but their reliance on large-scale unlabeled data has raised significant safety and copyright concerns. Efforts to address these…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Yang Zhang , Er Jin , Yanfei Dong , Yixuan Wu , Philip Torr , Ashkan Khakzar , Johannes Stegmaier , Kenji Kawaguchi

Hiding and Recovering Knowledge in Text-to-Image Diffusion Models via Learnable Prompts

Diffusion models have demonstrated remarkable capability in generating high-quality visual content from textual descriptions. However, since these models are trained on large-scale internet data, they inevitably learn undesirable concepts,…

Machine Learning · Computer Science 2025-02-18 Anh Bui , Khanh Doan , Trung Le , Paul Montague , Tamas Abraham , Dinh Phung

Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

Pruning-based unlearning has recently emerged as a fast, training-free, and data-independent approach to remove undesired concepts from diffusion models. It promises high efficiency and robustness, offering an attractive alternative to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Ci Zhang , Zhaojun Ding , Chence Yang , Jun Liu , Xiaoming Zhai , Shaoyi Huang , Beiwen Li , Xiaolong Ma , Jin Lu , Geng Yuan

All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models

Text-to-Image models such as Stable Diffusion have shown impressive image generation synthesis, thanks to the utilization of large-scale datasets. However, these datasets may contain sexually explicit, copyrighted, or undesirable content,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Seunghoo Hong , Juhun Lee , Simon S. Woo

Rethinking Robust Adversarial Concept Erasure in Diffusion Models

Concept erasure aims to selectively unlearning undesirable content in diffusion models (DMs) to reduce the risk of sensitive content generation. As a novel paradigm in concept erasure, most existing methods employ adversarial training to…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Qinghong Yin , Yu Tian , Heming Yang , Xiang Chen , Xianlin Zhang , Xueming Li , Yue Zhan

Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models

Recent advances in diffusion generative models have yielded remarkable progress. While the quality of generated content continues to improve, these models have grown considerably in size and complexity. This increasing computational burden…

Machine Learning · Computer Science 2025-03-13 Reza Shirkavand , Peiran Yu , Shangqian Gao , Gowthami Somepalli , Tom Goldstein , Heng Huang

Prototype-Guided Concept Erasure in Diffusion Models

Concept erasure is extensively utilized in image generation to prevent text-to-image models from generating undesired content. Existing methods can effectively erase narrow concepts that are specific and concrete, such as distinct…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Yuze Cai , Jiahao Lu , Hongxiang Shi , Yichao Zhou , Hong Lu

TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models

Recent advances in text-to-image diffusion models enable photorealistic image generation, but they also risk producing malicious content, such as NSFW images. To mitigate risk, concept erasure methods are studied to facilitate the model to…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Ruidong Chen , Honglin Guo , Lanjun Wang , Chenyu Zhang , Weizhi Nie , An-An Liu

When Are Concepts Erased From Diffusion Models?

In concept erasure, a model is modified to selectively prevent it from generating a target concept. Despite the rapid development of new methods, it remains unclear how thoroughly these approaches remove the target concept from the model.…

Machine Learning · Computer Science 2025-11-10 Kevin Lu , Nicky Kriplani , Rohit Gandikota , Minh Pham , David Bau , Chinmay Hegde , Niv Cohen

Concept Corrector: Erase concepts on the fly for text-to-image diffusion models

Text-to-image diffusion models have demonstrated the underlying risk of generating various unwanted content, such as sexual elements. To address this issue, the task of concept erasure has been introduced, aiming to erase any undesired…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Zheling Meng , Bo Peng , Xiaochuan Jin , Yueming Lyu , Wei Wang , Jing Dong , Tieniu Tan

Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models

Diffusion models (DMs) have achieved remarkable success in text-to-image generation, but they also pose safety risks, such as the potential generation of harmful content and copyright violations. The techniques of machine unlearning, also…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Yimeng Zhang , Xin Chen , Jinghan Jia , Yihua Zhang , Chongyu Fan , Jiancheng Liu , Mingyi Hong , Ke Ding , Sijia Liu

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Chi-Pin Huang , Kai-Po Chang , Chung-Ting Tsai , Yung-Hsuan Lai , Fu-En Yang , Yu-Chiang Frank Wang