English
Related papers

Related papers: Robust Concept Erasure Using Task Vectors

200 papers

Text-to-image diffusion models have demonstrated the underlying risk of generating various unwanted content, such as sexual elements. To address this issue, the task of concept erasure has been introduced, aiming to erase any undesired…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Zheling Meng , Bo Peng , Xiaochuan Jin , Yueming Lyu , Wei Wang , Jing Dong , Tieniu Tan

Text-to-image generative models can produce photo-realistic images for an extremely broad range of concepts, and their usage has proliferated widely among the general public. On the flip side, these models have numerous drawbacks, including…

Machine Learning · Computer Science 2023-10-10 Minh Pham , Kelly O. Marshall , Niv Cohen , Govind Mittal , Chinmay Hegde

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Chi-Pin Huang , Kai-Po Chang , Chung-Ting Tsai , Yung-Hsuan Lai , Fu-En Yang , Yu-Chiang Frank Wang

Despite the impressive capabilities of generating images, text-to-image diffusion models are susceptible to producing undesirable outputs such as NSFW content and copyrighted artworks. To address this issue, recent studies have focused on…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Tianyun Yang , Juan Cao , Chang Xu

Diffusion based text-to-image models are trained on large datasets scraped from the Internet, potentially containing unacceptable concepts (e.g., copyright-infringing or unsafe). We need concept removal techniques (CRTs) which are i)…

Computer Vision and Pattern Recognition · Computer Science 2025-02-27 Anudeep Das , Vasisht Duddu , Rui Zhang , N. Asokan

Studies have been conducted to prevent specific concepts from being generated from pretrained text-to-image generative models, achieving concept erasure in various ways. However, the performance evaluation of these studies is still largely…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Masane Fuchi , Tomohiro Takagi

Text-to-image diffusion models have demonstrated remarkable capabilities in generating high-quality images, yet their tendency to reproduce undesirable concepts, such as NSFW content, copyrighted styles, or specific objects, poses growing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Zhiqi Zhang , Xinhao Zhong , Yi Sun , Shuoyang Sun , Bin Chen , Shu-Tao Xia , Xuan Wang

Concept erasure techniques have recently gained significant attention for their potential to remove unwanted concepts from text-to-image models. While these methods often demonstrate promising results in controlled settings, their…

Concept unlearning has emerged as a promising direction for reducing the risks of harmful content generation in text-to-image diffusion models by selectively erasing undesirable concepts from a model's parameters. Existing approaches…

Artificial Intelligence · Computer Science 2026-03-20 Duc Hao Pham , Van Duy Truong , Duy Khanh Dinh , Tien Cuong Nguyen , Dien Hy Ngo , Tuan Anh Bui

To what extent does concept erasure eliminate generative capacity in diffusion models? While prior evaluations have primarily focused on measuring concept suppression under specific textual prompts, we explore a complementary and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Ping Liu , Chi Zhang

Concept erasing has recently emerged as an effective paradigm to prevent text-to-image diffusion models from generating visually undesirable or even harmful content. However, current removal methods heavily rely on manually crafted text…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Feiran Li , Qianqian Xu , Shilong Bao , Zhiyong Yang , Xiaochun Cao , Qingming Huang

Text-to-Image (T2I) models have made remarkable progress in generating high-quality, diverse visual content from natural language prompts. However, their ability to reproduce copyrighted styles, sensitive imagery, and harmful content raises…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Changhoon Kim , Yanjun Qi

While large-scale text-to-image diffusion models have demonstrated impressive image-generation capabilities, there are significant concerns about their potential misuse for generating unsafe content, violating copyright, and perpetuating…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Ruchika Chavhan , Da Li , Timothy Hospedales

Concept Erasure, which aims to prevent pretrained text-to-image models from generating content associated with semantic-harmful concepts (i.e., target concepts), is getting increased attention. State-of-the-art methods formulate this task…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Hongxu Chen , Zhen Wang , Taoran Mei , Lin Li , Bowei Zhu , Runshi Li , Long Chen

Concept erasure techniques for text-to-video (T2V) diffusion models report substantial suppression of sensitive content, yet current evaluation is limited to checking whether the target concept is absent from generated frames, treating…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Yiwei Xie , Zheng Zhang , Ping Liu

Motivated by recent advancements in text-to-image diffusion, we study erasure of specific concepts from the model's weights. While Stable Diffusion has shown promise in producing explicit or realistic artwork, it has raised concerns…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Rohit Gandikota , Joanna Materzynska , Jaden Fiotto-Kaufman , David Bau

Recent advance in text-to-image diffusion models have significantly facilitated the generation of high-quality images, but also raising concerns about the illegal creation of harmful content, such as copyrighted images. Existing concept…

Computer Vision and Pattern Recognition · Computer Science 2025-01-06 Zihao Wang , Yuxiang Wei , Fan Li , Renjing Pei , Hang Xu , Wangmeng Zuo

Text-to-image generative models have achieved impressive fidelity and diversity, but can inadvertently produce unsafe or undesirable content due to implicit biases embedded in large-scale training datasets. Existing concept erasure methods,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Jun Li , Lizhi Xiong , Ziqiang Li , Weiwei Jiang , Zhangjie Fu , Yong Li , Guo-Sen Xie

Recent advances in text-to-video (T2V) diffusion models have significantly enhanced the quality of generated videos. However, their capability to produce explicit or harmful content introduces new challenges related to misuse and potential…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Xiaoyu Ye , Songjie Cheng , Yongtao Wang , Yajiao Xiong , Yishen Li

Remarkable progress in text-to-image diffusion models has brought a major concern about potentially generating images on inappropriate or trademarked concepts. Concept erasing has been investigated with the goals of deleting target concepts…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Byung Hyun Lee , Sungjin Lim , Seunggyu Lee , Dong Un Kang , Se Young Chun
‹ Prev 1 2 3 10 Next ›