Related papers: CharacterFactory: Sampling Consistent Characters w…

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Character Animation aims to generating character videos from still images through driving signals. Currently, diffusion models have become the mainstream in visual generation research, owing to their robust generative capabilities. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Li Hu , Xin Gao , Peng Zhang , Ke Sun , Bang Zhang , Liefeng Bo

MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models

Large-scale text-to-image diffusion models, (e.g., DALL-E, SDXL) are capable of generating famous persons by simply referring to their names. Is it possible to make such models generate generic identities as simple as the famous ones, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Jing Zhao , Heliang Zheng , Chaoyue Wang , Long Lan , Wanrong Hunag , Yuhua Tang

Inserting Anybody in Diffusion Models via Celeb Basis

Exquisite demand exists for customizing the pretrained large text-to-image model, $\textit{e.g.}$, Stable Diffusion, to generate innovative concepts, such as the users themselves. However, the newly-added concept from previous customization…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Ge Yuan , Xiaodong Cun , Yong Zhang , Maomao Li , Chenyang Qi , Xintao Wang , Ying Shan , Huicheng Zheng

Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization

Customized image generation, which seeks to synthesize images with consistent characters, holds significant relevance for applications such as storytelling, portrait generation, and character design. However, previous approaches have…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Yuhang Ma , Wenting Xu , Jiji Tang , Qinfeng Jin , Rongsheng Zhang , Zeng Zhao , Changjie Fan , Zhipeng Hu

When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for Personalized Image Generation

Text-to-image diffusion models have remarkably excelled in producing diverse, high-quality, and photo-realistic images. This advancement has spurred a growing interest in incorporating specific identities into generated content. Most…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Xiaoming Li , Xinyu Hou , Chen Change Loy

Limitations of Face Image Generation

Text-to-image diffusion models have achieved widespread popularity due to their unprecedented image generation capability. In particular, their ability to synthesize and modify human faces has spurred research into using generated face…

Computer Vision and Pattern Recognition · Computer Science 2023-12-22 Harrison Rosenberg , Shimaa Ahmed , Guruprasad V Ramesh , Ramya Korlakai Vinayak , Kassem Fawaz

A Method for Training-free Person Image Picture Generation

The current state-of-the-art Diffusion model has demonstrated excellent results in generating images. However, the images are monotonous and are mostly the result of the distribution of images of people in the training set, making it…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Tianyu Chen

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Omri Avrahami , Amir Hertz , Yael Vinker , Moab Arar , Shlomi Fruchter , Ohad Fried , Daniel Cohen-Or , Dani Lischinski

CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

In the field of digital content creation, generating high-quality 3D characters from single images is challenging, especially given the complexities of various body poses and the issues of self-occlusion and pose ambiguity. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Hao-Yang Peng , Jia-Peng Zhang , Meng-Hao Guo , Yan-Pei Cao , Shi-Min Hu

Human Machine Co-Creation. A Complementary Cognitive Approach to Creative Character Design Process Using GANs

Recent advances in Generative Adversarial Networks GANs applications continue to attract the attention of researchers in different fields. In such a framework, two neural networks compete adversely to generate new visual contents…

Artificial Intelligence · Computer Science 2023-11-27 Mohammad Lataifeh , Xavier A Carrascoa , Ashraf M Elnagara , Naveed Ahmeda , Imran Junejo

IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Recent visual generative models enable story generation with consistent characters from text, but human-centric story generation faces additional challenges, such as maintaining detailed and diverse human face consistency and coordinating…

Computer Vision and Pattern Recognition · Computer Science 2025-12-30 Donghao Zhou , Jingyu Lin , Guibao Shen , Quande Liu , Jialin Gao , Lihao Liu , Lan Du , Cunjian Chen , Chi-Wing Fu , Xiaowei Hu , Pheng-Ann Heng

Measuring Style Similarity in Diffusion Models

Generative models are now widely used by graphic designers and artists. Prior works have shown that these models remember and often replicate content from their training data during generation. Hence as their proliferation increases, it has…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Gowthami Somepalli , Anubhav Gupta , Kamal Gupta , Shramay Palta , Micah Goldblum , Jonas Geiping , Abhinav Shrivastava , Tom Goldstein

Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy

Different forms of customized 2D avatars are widely used in gaming applications, virtual communication, education, and content creation. However, existing approaches often fail to capture fine-grained facial expressions and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Hao Yu , Rupayan Mallick , Margrit Betke , Sarah Adel Bargal

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

The recent advancements in image-text diffusion models have stimulated research interest in large-scale 3D generative models. Nevertheless, the limited availability of diverse 3D resources presents significant challenges to learning. In…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Chi Zhang , Yiwen Chen , Yijun Fu , Zhenglin Zhou , Gang YU , Billzb Wang , Bin Fu , Tao Chen , Guosheng Lin , Chunhua Shen

IntersectGAN: Learning Domain Intersection for Generating Images with Multiple Attributes

Generative adversarial networks (GANs) have demonstrated great success in generating various visual content. However, images generated by existing GANs are often of attributes (e.g., smiling expression) learned from one image domain. As a…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Zehui Yao , Boyan Zhang , Zhiyong Wang , Wanli Ouyang , Dong Xu , Dagan Feng

CharacterGAN: Few-Shot Keypoint Character Animation and Reposing

We introduce CharacterGAN, a generative model that can be trained on only a few samples (8 - 15) of a given character. Our model generates novel poses based on keypoint locations, which can be modified in real time while providing…

Computer Vision and Pattern Recognition · Computer Science 2022-01-13 Tobias Hinz , Matthew Fisher , Oliver Wang , Eli Shechtman , Stefan Wermter

TaleDiffusion: Multi-Character Story Generation with Dialogue Rendering

Text-to-story visualization is challenging due to the need for consistent interaction among multiple characters across frames. Existing methods struggle with character consistency, leading to artifact generation and inaccurate dialogue…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Ayan Banerjee , Josep Llados , Umapada Pal , Anjan Dutta

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Current learning-based subject customization approaches, predominantly relying on U-Net architectures, suffer from limited generalization ability and compromised image quality. Meanwhile, optimization-based methods require subject-specific…

Computer Vision and Pattern Recognition · Computer Science 2025-04-18 Jiale Tao , Yanbing Zhang , Qixun Wang , Yiji Cheng , Haofan Wang , Xu Bai , Zhengguang Zhou , Ruihuang Li , Linqing Wang , Chunyu Wang , Qin Lin , Qinglin Lu

ExFaceGAN: Exploring Identity Directions in GAN's Learned Latent Space for Synthetic Identity Generation

Deep generative models have recently presented impressive results in generating realistic face images of random synthetic identities. To generate multiple samples of a certain synthetic identity, previous works proposed to disentangle the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Fadi Boutros , Marcel Klemt , Meiling Fang , Arjan Kuijper , Naser Damer

OneActor: Consistent Character Generation via Cluster-Conditioned Guidance

Text-to-image diffusion models benefit artists with high-quality image generation. Yet their stochastic nature hinders artists from creating consistent images of the same subject. Existing methods try to tackle this challenge and generate…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Jiahao Wang , Caixia Yan , Haonan Lin , Weizhan Zhang , Mengmeng Wang , Tieliang Gong , Guang Dai , Hao Sun