English
Related papers

Related papers: CharacterFactory: Sampling Consistent Characters w…

200 papers

Character Animation aims to generating character videos from still images through driving signals. Currently, diffusion models have become the mainstream in visual generation research, owing to their robust generative capabilities. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Li Hu , Xin Gao , Peng Zhang , Ke Sun , Bang Zhang , Liefeng Bo

Large-scale text-to-image diffusion models, (e.g., DALL-E, SDXL) are capable of generating famous persons by simply referring to their names. Is it possible to make such models generate generic identities as simple as the famous ones, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Jing Zhao , Heliang Zheng , Chaoyue Wang , Long Lan , Wanrong Hunag , Yuhua Tang

Exquisite demand exists for customizing the pretrained large text-to-image model, $\textit{e.g.}$, Stable Diffusion, to generate innovative concepts, such as the users themselves. However, the newly-added concept from previous customization…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Ge Yuan , Xiaodong Cun , Yong Zhang , Maomao Li , Chenyang Qi , Xintao Wang , Ying Shan , Huicheng Zheng

Customized image generation, which seeks to synthesize images with consistent characters, holds significant relevance for applications such as storytelling, portrait generation, and character design. However, previous approaches have…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Yuhang Ma , Wenting Xu , Jiji Tang , Qinfeng Jin , Rongsheng Zhang , Zeng Zhao , Changjie Fan , Zhipeng Hu

Text-to-image diffusion models have remarkably excelled in producing diverse, high-quality, and photo-realistic images. This advancement has spurred a growing interest in incorporating specific identities into generated content. Most…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Xiaoming Li , Xinyu Hou , Chen Change Loy

Text-to-image diffusion models have achieved widespread popularity due to their unprecedented image generation capability. In particular, their ability to synthesize and modify human faces has spurred research into using generated face…

Computer Vision and Pattern Recognition · Computer Science 2023-12-22 Harrison Rosenberg , Shimaa Ahmed , Guruprasad V Ramesh , Ramya Korlakai Vinayak , Kassem Fawaz

The current state-of-the-art Diffusion model has demonstrated excellent results in generating images. However, the images are monotonous and are mostly the result of the distribution of images of people in the training set, making it…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Tianyu Chen

Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Omri Avrahami , Amir Hertz , Yael Vinker , Moab Arar , Shlomi Fruchter , Ohad Fried , Daniel Cohen-Or , Dani Lischinski

In the field of digital content creation, generating high-quality 3D characters from single images is challenging, especially given the complexities of various body poses and the issues of self-occlusion and pose ambiguity. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Hao-Yang Peng , Jia-Peng Zhang , Meng-Hao Guo , Yan-Pei Cao , Shi-Min Hu

Recent advances in Generative Adversarial Networks GANs applications continue to attract the attention of researchers in different fields. In such a framework, two neural networks compete adversely to generate new visual contents…

Artificial Intelligence · Computer Science 2023-11-27 Mohammad Lataifeh , Xavier A Carrascoa , Ashraf M Elnagara , Naveed Ahmeda , Imran Junejo

Recent visual generative models enable story generation with consistent characters from text, but human-centric story generation faces additional challenges, such as maintaining detailed and diverse human face consistency and coordinating…

Computer Vision and Pattern Recognition · Computer Science 2025-12-30 Donghao Zhou , Jingyu Lin , Guibao Shen , Quande Liu , Jialin Gao , Lihao Liu , Lan Du , Cunjian Chen , Chi-Wing Fu , Xiaowei Hu , Pheng-Ann Heng

Generative models are now widely used by graphic designers and artists. Prior works have shown that these models remember and often replicate content from their training data during generation. Hence as their proliferation increases, it has…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Gowthami Somepalli , Anubhav Gupta , Kamal Gupta , Shramay Palta , Micah Goldblum , Jonas Geiping , Abhinav Shrivastava , Tom Goldstein

Different forms of customized 2D avatars are widely used in gaming applications, virtual communication, education, and content creation. However, existing approaches often fail to capture fine-grained facial expressions and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Hao Yu , Rupayan Mallick , Margrit Betke , Sarah Adel Bargal

The recent advancements in image-text diffusion models have stimulated research interest in large-scale 3D generative models. Nevertheless, the limited availability of diverse 3D resources presents significant challenges to learning. In…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Chi Zhang , Yiwen Chen , Yijun Fu , Zhenglin Zhou , Gang YU , Billzb Wang , Bin Fu , Tao Chen , Guosheng Lin , Chunhua Shen

Generative adversarial networks (GANs) have demonstrated great success in generating various visual content. However, images generated by existing GANs are often of attributes (e.g., smiling expression) learned from one image domain. As a…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Zehui Yao , Boyan Zhang , Zhiyong Wang , Wanli Ouyang , Dong Xu , Dagan Feng

We introduce CharacterGAN, a generative model that can be trained on only a few samples (8 - 15) of a given character. Our model generates novel poses based on keypoint locations, which can be modified in real time while providing…

Computer Vision and Pattern Recognition · Computer Science 2022-01-13 Tobias Hinz , Matthew Fisher , Oliver Wang , Eli Shechtman , Stefan Wermter

Text-to-story visualization is challenging due to the need for consistent interaction among multiple characters across frames. Existing methods struggle with character consistency, leading to artifact generation and inaccurate dialogue…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Ayan Banerjee , Josep Llados , Umapada Pal , Anjan Dutta

Current learning-based subject customization approaches, predominantly relying on U-Net architectures, suffer from limited generalization ability and compromised image quality. Meanwhile, optimization-based methods require subject-specific…

Computer Vision and Pattern Recognition · Computer Science 2025-04-18 Jiale Tao , Yanbing Zhang , Qixun Wang , Yiji Cheng , Haofan Wang , Xu Bai , Zhengguang Zhou , Ruihuang Li , Linqing Wang , Chunyu Wang , Qin Lin , Qinglin Lu

Deep generative models have recently presented impressive results in generating realistic face images of random synthetic identities. To generate multiple samples of a certain synthetic identity, previous works proposed to disentangle the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Fadi Boutros , Marcel Klemt , Meiling Fang , Arjan Kuijper , Naser Damer

Text-to-image diffusion models benefit artists with high-quality image generation. Yet their stochastic nature hinders artists from creating consistent images of the same subject. Existing methods try to tackle this challenge and generate…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Jiahao Wang , Caixia Yan , Haonan Lin , Weizhan Zhang , Mengmeng Wang , Tieliang Gong , Guang Dai , Hao Sun
‹ Prev 1 2 3 10 Next ›