Related papers: Erasing Concepts from Diffusion Models

When Are Concepts Erased From Diffusion Models?

In concept erasure, a model is modified to selectively prevent it from generating a target concept. Despite the rapid development of new methods, it remains unclear how thoroughly these approaches remove the target concept from the model.…

Machine Learning · Computer Science 2025-11-10 Kevin Lu , Nicky Kriplani , Rohit Gandikota , Minh Pham , David Bau , Chinmay Hegde , Niv Cohen

All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models

Text-to-Image models such as Stable Diffusion have shown impressive image generation synthesis, thanks to the utilization of large-scale datasets. However, these datasets may contain sexually explicit, copyrighted, or undesirable content,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Seunghoo Hong , Juhun Lee , Simon S. Woo

Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation

Diffusion models excel at generating visually striking content from text but can inadvertently produce undesirable or harmful content when trained on unfiltered internet data. A practical solution is to selectively removing target concepts…

Machine Learning · Computer Science 2025-05-26 Anh Bui , Long Vuong , Khanh Doan , Trung Le , Paul Montague , Tamas Abraham , Dinh Phung

Concept Corrector: Erase concepts on the fly for text-to-image diffusion models

Text-to-image diffusion models have demonstrated the underlying risk of generating various unwanted content, such as sexual elements. To address this issue, the task of concept erasure has been introduced, aiming to erase any undesired…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Zheling Meng , Bo Peng , Xiaochuan Jin , Yueming Lyu , Wei Wang , Jing Dong , Tieniu Tan

Comprehensive Evaluation and Analysis for NSFW Concept Erasure in Text-to-Image Diffusion Models

Text-to-image diffusion models have gained widespread application across various domains, demonstrating remarkable creative potential. However, the strong generalization capabilities of diffusion models can inadvertently lead to the…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Die Chen , Zhiwen Li , Cen Chen , Yuexiang Xie , Xiaodan Li , Jinyan Ye , Yingda Chen , Yaliang Li

Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning

Generating images from text has become easier because of the scaling of diffusion models and advancements in the field of vision and language. These models are trained using vast amounts of data from the Internet. Hence, they often contain…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Masane Fuchi , Tomohiro Takagi

Pruning for Robust Concept Erasing in Diffusion Models

Despite the impressive capabilities of generating images, text-to-image diffusion models are susceptible to producing undesirable outputs such as NSFW content and copyrighted artworks. To address this issue, recent studies have focused on…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Tianyun Yang , Juan Cao , Chang Xu

Erased or Dormant? Rethinking Concept Erasure Through Reversibility

To what extent does concept erasure eliminate generative capacity in diffusion models? While prior evaluations have primarily focused on measuring concept suppression under specific textual prompts, we explore a complementary and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Ping Liu , Chi Zhang

Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness

Diffusion models have achieved unprecedented success in image generation but pose increasing risks in terms of privacy, fairness, and security. A growing demand exists to \emph{erase} sensitive or harmful concepts (e.g., NSFW content,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Zixuan Fu , Yan Ren , Finn Carter , Chenyue Wen , Le Ku , Daheng Yu , Emily Davis , Bo Zhang

Ablating Concepts in Text-to-Image Diffusion Models

Large-scale text-to-image diffusion models can generate high-fidelity images with powerful compositional ability. However, these models are typically trained on an enormous amount of Internet data, often containing copyrighted material,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-17 Nupur Kumari , Bingliang Zhang , Sheng-Yu Wang , Eli Shechtman , Richard Zhang , Jun-Yan Zhu

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Chi-Pin Huang , Kai-Po Chang , Chung-Ting Tsai , Yung-Hsuan Lai , Fu-En Yang , Yu-Chiang Frank Wang

Bi-Erasing: A Bidirectional Framework for Concept Removal in Diffusion Models

Concept erasure, which fine-tunes diffusion models to remove undesired or harmful visual concepts, has become a mainstream approach to mitigating unsafe or illegal image generation in text-to-image models.However, existing removal methods…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Hao Chen , Yiwei Wang , Songze Li

T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models

Recent advances in text-to-video (T2V) diffusion models have significantly enhanced the quality of generated videos. However, their capability to produce explicit or harmful content introduces new challenges related to misuse and potential…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Xiaoyu Ye , Songjie Cheng , Yongtao Wang , Yajiao Xiong , Yishen Li

FADE: Adversarial Concept Erasure in Flow Models

Diffusion models have demonstrated remarkable image generation capabilities, but also pose risks in privacy and fairness by memorizing sensitive concepts or perpetuating biases. We propose a novel \textbf{concept erasure} method for…

Computer Vision and Pattern Recognition · Computer Science 2025-07-17 Zixuan Fu , Yan Ren , Finn Carter , Chenyue Wang , Ze Niu , Dacheng Yu , Emily Davis , Bo Zhang

Erasure or Erosion? Evaluating Compositional Degradation in Unlearned Text-To-Image Diffusion Models

Post-hoc unlearning has emerged as a practical mechanism for removing undesirable concepts from large text-to-image diffusion models. However, prior work primarily evaluates unlearning through erasure success; its impact on broader…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Arian Komaei Koma , Seyed Amir Kasaei , Ali Aghayari , AmirMahdi Sadeghzadeh , Mohammad Hossein Rohban

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

While large-scale text-to-image diffusion models have demonstrated impressive image-generation capabilities, there are significant concerns about their potential misuse for generating unsafe content, violating copyright, and perpetuating…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Ruchika Chavhan , Da Li , Timothy Hospedales

Circumventing Concept Erasure Methods For Text-to-Image Generative Models

Text-to-image generative models can produce photo-realistic images for an extremely broad range of concepts, and their usage has proliferated widely among the general public. On the flip side, these models have numerous drawbacks, including…

Machine Learning · Computer Science 2023-10-10 Minh Pham , Kelly O. Marshall , Niv Cohen , Govind Mittal , Chinmay Hegde

TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models

Text-to-image diffusion models have shown unprecedented generative capability, but their ability to produce undesirable concepts (e.g.~pornographic content, sensitive identities, copyrighted styles) poses serious concerns for privacy,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Finn Carter

Safe and Reliable Diffusion Models via Subspace Projection

Large-scale text-to-image (T2I) diffusion models have revolutionized image generation, enabling the synthesis of highly detailed visuals from textual descriptions. However, these models may inadvertently generate inappropriate content, such…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Huiqiang Chen , Tianqing Zhu , Linlin Wang , Xin Yu , Longxiang Gao , Wanlei Zhou

A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models

Text-to-Image (T2I) models have made remarkable progress in generating high-quality, diverse visual content from natural language prompts. However, their ability to reproduce copyrighted styles, sensitive imagery, and harmful content raises…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Changhoon Kim , Yanjun Qi