Related papers: CusConcept: Customized Visual Concept Decompositio…

OmniPrism: Learning Disentangled Visual Concept for Image Generation

Creative visual concept generation often draws inspiration from specific concepts in a reference image to produce relevant outcomes. However, existing methods are typically constrained to single-aspect concept generation or are easily…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yangyang Li , Daqing Liu , Wu Liu , Allen He , Xinchen Liu , Yongdong Zhang , Guoqing Jin

Multi-Concept Customization of Text-to-Image Diffusion

While generative models produce high-quality images of concepts learned from a large-scale database, a user often wishes to synthesize instantiations of their own concepts (for example, their family, pets, or items). Can we teach a model to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Nupur Kumari , Bingliang Zhang , Richard Zhang , Eli Shechtman , Jun-Yan Zhu

A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models

Text-to-image diffusion models have made significant advancements in generating high-quality, diverse images from text prompts. However, the inherent limitations of textual signals often prevent these models from fully capturing specific…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Ziqiang Li , Jun Li , Lizhi Xiong , Zhangjie Fu , Zechao Li

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging. In this work, we introduce Concept Weaver, a method for composing…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Gihyun Kwon , Simon Jenni , Dingzeyu Li , Joon-Young Lee , Jong Chul Ye , Fabian Caba Heilbron

TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation

Despite significant advancements in customizing text-to-image and video generation models, generating images and videos that effectively integrate multiple personalized concepts remains a challenging task. To address this, we present…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Gihyun Kwon , Jong Chul Ye

CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization

We propose CatVersion, an inversion-based method that learns the personalized concept through a handful of examples. Subsequently, users can utilize text prompts to generate images that embody the personalized concept, thereby achieving…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Ruoyu Zhao , Mingrui Zhu , Shiyin Dong , Nannan Wang , Xinbo Gao

Concept Decomposition for Visual Exploration and Inspiration

A creative idea is often born from transforming, combining, and modifying ideas from existing visual examples capturing various concepts. However, one cannot simply copy the concept as a whole, and inspiration is achieved by examining…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Yael Vinker , Andrey Voynov , Daniel Cohen-Or , Ariel Shamir

Visual Concepts Tokenization

Obtaining the human-like perception ability of abstracting visual concepts from concrete pixels has always been a fundamental and important target in machine learning research fields such as disentangled representation learning and scene…

Computer Vision and Pattern Recognition · Computer Science 2022-10-14 Tao Yang , Yuwang Wang , Yan Lu , Nanning Zheng

MultiBooth: Towards Generating All Your Concepts in an Image from Text

This paper introduces MultiBooth, a novel and efficient technique for multi-concept customization in image generation from text. Despite the significant advancements in customized generation methods, particularly with the success of…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Chenyang Zhu , Kai Li , Yue Ma , Chunming He , Xiu Li

A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

Concept unlearning has emerged as a promising direction for reducing the risks of harmful content generation in text-to-image diffusion models by selectively erasing undesirable concepts from a model's parameters. Existing approaches…

Artificial Intelligence · Computer Science 2026-03-20 Duc Hao Pham , Van Duy Truong , Duy Khanh Dinh , Tien Cuong Nguyen , Dien Hy Ngo , Tuan Anh Bui

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

While personalized text-to-image generation has enabled the learning of a single concept from multiple images, a more practical yet challenging scenario involves learning multiple concepts within a single image. However, existing works…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Shaozhe Hao , Kai Han , Zhengyao Lv , Shihao Zhao , Kwan-Yee K. Wong

The Hidden Language of Diffusion Models

Text-to-image diffusion models have demonstrated an unparalleled ability to generate high-quality, diverse images from a textual prompt. However, the internal representations learned by these models remain an enigma. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2023-10-06 Hila Chefer , Oran Lang , Mor Geva , Volodymyr Polosukhin , Assaf Shocher , Michal Irani , Inbar Mosseri , Lior Wolf

ControlCom: Controllable Image Composition using Diffusion Model

Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Bo Zhang , Yuxuan Duan , Jun Lan , Yan Hong , Huijia Zhu , Weiqiang Wang , Li Niu

ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models

The inherent ambiguity in defining visual concepts poses significant challenges for modern generative models, such as the diffusion-based Text-to-Image (T2I) models, in accurately learning concepts from a single image. Existing methods lack…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Fernando Julio Cendra , Kai Han

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

While modern diffusion models excel at generating high-quality and diverse images, they still struggle with high-fidelity compositional and multimodal control, particularly when users simultaneously specify text prompts, subject references,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yusuf Dalva , Guocheng Gordon Qian , Maya Goldenberg , Tsai-Shien Chen , Kfir Aberman , Sergey Tulyakov , Pinar Yanardag , Kuan-Chieh Jackson Wang

SegDiscover: Visual Concept Discovery via Unsupervised Semantic Segmentation

Visual concept discovery has long been deemed important to improve interpretability of neural networks, because a bank of semantically meaningful concepts would provide us with a starting point for building machine learning models that…

Computer Vision and Pattern Recognition · Computer Science 2022-04-26 Haiyang Huang , Zhi Chen , Cynthia Rudin

Scaling Concept With Text-Guided Diffusion Models

Text-guided diffusion models have revolutionized generative tasks by producing high-fidelity content from text descriptions. They have also enabled an editing paradigm where concepts can be replaced through text conditioning (e.g., a dog to…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Chao Huang , Susan Liang , Yunlong Tang , Yapeng Tian , Anurag Kumar , Chenliang Xu

Non-confusing Generation of Customized Concepts in Diffusion Models

We tackle the common challenge of inter-concept visual confusion in compositional concept generation using text-guided diffusion models (TGDMs). It becomes even more pronounced in the generation of customized concepts, due to the scarcity…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 Wang Lin , Jingyuan Chen , Jiaxin Shi , Yichen Zhu , Chen Liang , Junzhong Miao , Tao Jin , Zhou Zhao , Fei Wu , Shuicheng Yan , Hanwang Zhang

Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis

The customization of text-to-image models has seen significant advancements, yet generating multiple personalized concepts remains a challenging task. Current methods struggle with attribute leakage and layout confusion when handling…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Zebin Yao , Fangxiang Feng , Ruifan Li , Xiaojie Wang

Concept Corrector: Erase concepts on the fly for text-to-image diffusion models

Text-to-image diffusion models have demonstrated the underlying risk of generating various unwanted content, such as sexual elements. To address this issue, the task of concept erasure has been introduced, aiming to erase any undesired…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Zheling Meng , Bo Peng , Xiaochuan Jin , Yueming Lyu , Wei Wang , Jing Dong , Tieniu Tan