Related papers: Robust Concept Erasure Using Task Vectors

Concept Corrector: Erase concepts on the fly for text-to-image diffusion models

Text-to-image diffusion models have demonstrated the underlying risk of generating various unwanted content, such as sexual elements. To address this issue, the task of concept erasure has been introduced, aiming to erase any undesired…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Zheling Meng , Bo Peng , Xiaochuan Jin , Yueming Lyu , Wei Wang , Jing Dong , Tieniu Tan

Circumventing Concept Erasure Methods For Text-to-Image Generative Models

Text-to-image generative models can produce photo-realistic images for an extremely broad range of concepts, and their usage has proliferated widely among the general public. On the flip side, these models have numerous drawbacks, including…

Machine Learning · Computer Science 2023-10-10 Minh Pham , Kelly O. Marshall , Niv Cohen , Govind Mittal , Chinmay Hegde

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Chi-Pin Huang , Kai-Po Chang , Chung-Ting Tsai , Yung-Hsuan Lai , Fu-En Yang , Yu-Chiang Frank Wang

Pruning for Robust Concept Erasing in Diffusion Models

Despite the impressive capabilities of generating images, text-to-image diffusion models are susceptible to producing undesirable outputs such as NSFW content and copyrighted artworks. To address this issue, recent studies have focused on…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Tianyun Yang , Juan Cao , Chang Xu

Espresso: Robust Concept Filtering in Text-to-Image Models

Diffusion based text-to-image models are trained on large datasets scraped from the Internet, potentially containing unacceptable concepts (e.g., copyright-infringing or unsafe). We need concept removal techniques (CRTs) which are i)…

Computer Vision and Pattern Recognition · Computer Science 2025-02-27 Anudeep Das , Vasisht Duddu , Rui Zhang , N. Asokan

Erasing with Precision: Evaluating Specific Concept Erasure from Text-to-Image Generative Models

Studies have been conducted to prevent specific concepts from being generated from pretrained text-to-image generative models, achieving concept erasure in various ways. However, the performance evaluation of these studies is still largely…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Masane Fuchi , Tomohiro Takagi

Differential Vector Erasure: Unified Training-Free Concept Erasure for Flow Matching Models

Text-to-image diffusion models have demonstrated remarkable capabilities in generating high-quality images, yet their tendency to reproduce undesirable concepts, such as NSFW content, copyrighted styles, or specific objects, poses growing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Zhiqi Zhang , Xinhao Zhong , Yi Sun , Shuoyang Sun , Bin Chen , Shu-Tao Xia , Xuan Wang

Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts

Concept erasure techniques have recently gained significant attention for their potential to remove unwanted concepts from text-to-image models. While these methods often demonstrate promising results in controlled settings, their…

Computer Vision and Pattern Recognition · Computer Science 2025-10-09 Ibtihel Amara , Ahmed Imtiaz Humayun , Ivana Kajic , Zarana Parekh , Natalie Harris , Sarah Young , Chirag Nagpal , Najoung Kim , Junfeng He , Cristina Nader Vasconcelos , Deepak Ramachandran , Golnoosh Farnadi , Katherine Heller , Mohammad Havaei , Negar Rostamzadeh

A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

Concept unlearning has emerged as a promising direction for reducing the risks of harmful content generation in text-to-image diffusion models by selectively erasing undesirable concepts from a model's parameters. Existing approaches…

Artificial Intelligence · Computer Science 2026-03-20 Duc Hao Pham , Van Duy Truong , Duy Khanh Dinh , Tien Cuong Nguyen , Dien Hy Ngo , Tuan Anh Bui

Erased or Dormant? Rethinking Concept Erasure Through Reversibility

To what extent does concept erasure eliminate generative capacity in diffusion models? While prior evaluations have primarily focused on measuring concept suppression under specific textual prompts, we explore a complementary and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Ping Liu , Chi Zhang

One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

Concept erasing has recently emerged as an effective paradigm to prevent text-to-image diffusion models from generating visually undesirable or even harmful content. However, current removal methods heavily rely on manually crafted text…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Feiran Li , Qianqian Xu , Shilong Bao , Zhiyong Yang , Xiaochun Cao , Qingming Huang

A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models

Text-to-Image (T2I) models have made remarkable progress in generating high-quality, diverse visual content from natural language prompts. However, their ability to reproduce copyrighted styles, sensitive imagery, and harmful content raises…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Changhoon Kim , Yanjun Qi

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

While large-scale text-to-image diffusion models have demonstrated impressive image-generation capabilities, there are significant concerns about their potential misuse for generating unsafe content, violating copyright, and perpetuating…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Ruchika Chavhan , Da Li , Timothy Hospedales

Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model

Concept Erasure, which aims to prevent pretrained text-to-image models from generating content associated with semantic-harmful concepts (i.e., target concepts), is getting increased attention. State-of-the-art methods formulate this task…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Hongxu Chen , Zhen Wang , Taoran Mei , Lin Li , Bowei Zhu , Runshi Li , Long Chen

PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models

Concept erasure techniques for text-to-video (T2V) diffusion models report substantial suppression of sensitive content, yet current evaluation is limited to checking whether the target concept is absent from generated frames, treating…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Yiwei Xie , Zheng Zhang , Ping Liu

Erasing Concepts from Diffusion Models

Motivated by recent advancements in text-to-image diffusion, we study erasure of specific concepts from the model's weights. While Stable Diffusion has shown promise in producing explicit or realistic artwork, it has raised concerns…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Rohit Gandikota , Joanna Materzynska , Jaden Fiotto-Kaufman , David Bau

ACE: Anti-Editing Concept Erasure in Text-to-Image Models

Recent advance in text-to-image diffusion models have significantly facilitated the generation of high-quality images, but also raising concerns about the illegal creation of harmful content, such as copyrighted images. Existing concept…

Computer Vision and Pattern Recognition · Computer Science 2025-01-06 Zihao Wang , Yuxiang Wei , Fan Li , Renjing Pei , Hang Xu , Wangmeng Zuo

Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaboration

Text-to-image generative models have achieved impressive fidelity and diversity, but can inadvertently produce unsafe or undesirable content due to implicit biases embedded in large-scale training datasets. Existing concept erasure methods,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Jun Li , Lizhi Xiong , Ziqiang Li , Weiwei Jiang , Zhangjie Fu , Yong Li , Guo-Sen Xie

T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models

Recent advances in text-to-video (T2V) diffusion models have significantly enhanced the quality of generated videos. However, their capability to produce explicit or harmful content introduces new challenges related to misuse and potential…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Xiaoyu Ye , Songjie Cheng , Yongtao Wang , Yajiao Xiong , Yishen Li

Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate

Remarkable progress in text-to-image diffusion models has brought a major concern about potentially generating images on inappropriate or trademarked concepts. Concept erasing has been investigated with the goals of deleting target concepts…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Byung Hyun Lee , Sungjin Lim , Seunggyu Lee , Dong Un Kang , Se Young Chun