Related papers: ID-Consistent, Precise Expression Generation with …

Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation

In human-centric content generation, the pre-trained text-to-image models struggle to produce user-wanted portrait images, which retain the identity of individuals while exhibiting diverse expressions. This paper introduces our efforts…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Renshuai Liu , Bowen Ma , Wei Zhang , Zhipeng Hu , Changjie Fan , Tangjie Lv , Yu Ding , Xuan Cheng

FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion

Human facial images encode a rich spectrum of information, encompassing both stable identity-related traits and mutable attributes such as pose, expression, and emotion. While recent advances in image generation have enabled high-quality…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Kazuaki Mishima , Antoni Bigata Casademunt , Stavros Petridis , Maja Pantic , Kenji Suzuki

ID-Booth: Identity-consistent Face Generation with Diffusion Models

Recent advances in generative modeling have enabled the generation of high-quality synthetic data that is applicable in a variety of domains, including face recognition. Here, state-of-the-art generative models typically rely on…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Darian Tomašević , Fadi Boutros , Chenhao Lin , Naser Damer , Vitomir Štruc , Peter Peer

Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy

Different forms of customized 2D avatars are widely used in gaming applications, virtual communication, education, and content creation. However, existing approaches often fail to capture fine-grained facial expressions and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Hao Yu , Rupayan Mallick , Margrit Betke , Sarah Adel Bargal

DiffFace: Diffusion-based Face Swapping with Facial Guidance

In this paper, we propose a diffusion-based face swapping framework for the first time, called DiffFace, composed of training ID conditional DDPM, sampling with facial guidance, and a target-preserving blending. In specific, in the training…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Kihong Kim , Yunho Kim , Seokju Cho , Junyoung Seo , Jisu Nam , Kychul Lee , Seungryong Kim , KwangHee Lee

FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion

Speech-driven 3D facial animation synthesis has been a challenging task both in industry and research. Recent methods mostly focus on deterministic deep learning methods meaning that given a speech input, the output is always the same.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Stefan Stan , Kazi Injamamul Haque , Zerrin Yumak

FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention

In latest years plethora of identity-preserving adapters for a personalized generation with diffusion models have been released. Their main disadvantage is that they are dominantly trained jointly with base diffusion models, which suffer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Sergey Karpukhin , Vadim Titov , Andrey Kuznetsov , Aibek Alanov

IdentiFace: Multi-Modal Iterative Diffusion Framework for Identifiable Suspect Face Generation in Crime Investigations

Suspect face generation remains a technical challenge in crime investigations. Traditional sketch-drawing workflows suffer from low efficiency and quality, while diffusion-based approaches still face intrinsic limitations on conditional…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Weichen Liu , Yixin Yang , Changsheng Chen , Alex Kot

Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation

Expressions are fundamental to conveying human emotions. With the rapid advancement of AI-generated content (AIGC), realistic and expressive 3D facial animation has become increasingly crucial. Despite recent progress in speech-driven…

Computer Vision and Pattern Recognition · Computer Science 2025-10-30 Yuxiang Mao , Zhijie Zhang , Zhiheng Zhang , Jiawei Liu , Chen Zeng , Shihong Xia

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

Current face reenactment and swapping methods mainly rely on GAN frameworks, but recent focus has shifted to pre-trained diffusion models for their superior generation capabilities. However, training these models is resource-intensive, and…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Yue Han , Junwei Zhu , Keke He , Xu Chen , Yanhao Ge , Wei Li , Xiangtai Li , Jiangning Zhang , Chengjie Wang , Yong Liu

Controllable Expressive 3D Facial Animation via Diffusion in a Unified Multimodal Space

Audio-driven emotional 3D facial animation encounters two significant challenges: (1) reliance on single-modal control signals (videos, text, or emotion labels) without leveraging their complementary strengths for comprehensive emotion…

Multimedia · Computer Science 2025-06-13 Kangwei Liu , Junwu Liu , Xiaowei Yi , Jinlin Guo , Yun Cao

Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

Recent advances in generative diffusion models have enabled the previously unfeasible capability of generating 3D assets from a single input image or a text prompt. In this work, we aim to enhance the quality and functionality of these…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Xiyi Chen , Marko Mihajlovic , Shaofei Wang , Sergey Prokudin , Siyu Tang

High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning

Face swapping aims to seamlessly transfer a source facial identity onto a target while preserving target attributes such as pose and expression. Diffusion models, known for their superior generative capabilities, have recently shown promise…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Dailan He , Xiahong Wang , Shulun Wang , Guanglu Song , Bingqi Ma , Hao Shao , Yu Liu , Hongsheng Li

ExpPortrait: Expressive Portrait Generation via Personalized Representation

While diffusion models have shown great potential in portrait generation, generating expressive, coherent, and controllable cinematic portrait videos remains a significant challenge. Existing intermediate signals for portrait generation,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Junyi Wang , Yudong Guo , Boyang Guo , Shengming Yang , Juyong Zhang

Model See Model Do: Speech-Driven Facial Animation with Style Control

Speech-driven 3D facial animation plays a key role in applications such as virtual avatars, gaming, and digital content creation. While existing methods have made significant progress in achieving accurate lip synchronization and generating…

Graphics · Computer Science 2025-07-16 Yifang Pan , Karan Singh , Luiz Gustavo Hafemann

Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation

Advanced diffusion-based Text-to-Image (T2I) models, such as the Stable Diffusion Model, have made significant progress in generating diverse and high-quality images using text prompts alone. However, when non-famous users require…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Yang Li , Songlin Yang , Wei Wang , Jing Dong

Supervised makeup transfer with a curated dataset: Decoupling identity and makeup features for enhanced transformation

Diffusion models have recently shown strong progress in generative tasks, offering a more stable alternative to GAN-based approaches for makeup transfer. Existing methods often suffer from limited datasets, poor disentanglement between…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Qihe Pan , Yiming Wu , Xing Zhao , Liang Xie , Guodao Sun , Ronghua Liang

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Talking face generation has historically struggled to produce head movements and natural facial expressions without guidance from additional reference videos. Recent developments in diffusion-based generative models allow for more realistic…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Michał Stypułkowski , Konstantinos Vougioukas , Sen He , Maciej Zięba , Stavros Petridis , Maja Pantic

Face Swap via Diffusion Model

This technical report presents a diffusion model based framework for face swapping between two portrait images. The basic framework consists of three components, i.e., IP-Adapter, ControlNet, and Stable Diffusion's inpainting pipeline, for…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Feifei Wang

Stable Video-Driven Portraits

Portrait animation aims to generate photo-realistic videos from a single source image by reenacting the expression and pose from a driving video. While early methods relied on 3D morphable models or feature warping techniques, they often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Mallikarjun B. R. , Fei Yin , Vikram Voleti , Nikita Drobyshev , Maksim Lapin , Aaryaman Vasishta , Varun Jampani