English
Related papers

Related papers: ID-Consistent, Precise Expression Generation with …

200 papers

In human-centric content generation, the pre-trained text-to-image models struggle to produce user-wanted portrait images, which retain the identity of individuals while exhibiting diverse expressions. This paper introduces our efforts…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Renshuai Liu , Bowen Ma , Wei Zhang , Zhipeng Hu , Changjie Fan , Tangjie Lv , Yu Ding , Xuan Cheng

Human facial images encode a rich spectrum of information, encompassing both stable identity-related traits and mutable attributes such as pose, expression, and emotion. While recent advances in image generation have enabled high-quality…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Kazuaki Mishima , Antoni Bigata Casademunt , Stavros Petridis , Maja Pantic , Kenji Suzuki

Recent advances in generative modeling have enabled the generation of high-quality synthetic data that is applicable in a variety of domains, including face recognition. Here, state-of-the-art generative models typically rely on…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Darian Tomašević , Fadi Boutros , Chenhao Lin , Naser Damer , Vitomir Štruc , Peter Peer

Different forms of customized 2D avatars are widely used in gaming applications, virtual communication, education, and content creation. However, existing approaches often fail to capture fine-grained facial expressions and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Hao Yu , Rupayan Mallick , Margrit Betke , Sarah Adel Bargal

In this paper, we propose a diffusion-based face swapping framework for the first time, called DiffFace, composed of training ID conditional DDPM, sampling with facial guidance, and a target-preserving blending. In specific, in the training…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Kihong Kim , Yunho Kim , Seokju Cho , Junyoung Seo , Jisu Nam , Kychul Lee , Seungryong Kim , KwangHee Lee

Speech-driven 3D facial animation synthesis has been a challenging task both in industry and research. Recent methods mostly focus on deterministic deep learning methods meaning that given a speech input, the output is always the same.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Stefan Stan , Kazi Injamamul Haque , Zerrin Yumak

In latest years plethora of identity-preserving adapters for a personalized generation with diffusion models have been released. Their main disadvantage is that they are dominantly trained jointly with base diffusion models, which suffer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Sergey Karpukhin , Vadim Titov , Andrey Kuznetsov , Aibek Alanov

Suspect face generation remains a technical challenge in crime investigations. Traditional sketch-drawing workflows suffer from low efficiency and quality, while diffusion-based approaches still face intrinsic limitations on conditional…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Weichen Liu , Yixin Yang , Changsheng Chen , Alex Kot

Expressions are fundamental to conveying human emotions. With the rapid advancement of AI-generated content (AIGC), realistic and expressive 3D facial animation has become increasingly crucial. Despite recent progress in speech-driven…

Computer Vision and Pattern Recognition · Computer Science 2025-10-30 Yuxiang Mao , Zhijie Zhang , Zhiheng Zhang , Jiawei Liu , Chen Zeng , Shihong Xia

Current face reenactment and swapping methods mainly rely on GAN frameworks, but recent focus has shifted to pre-trained diffusion models for their superior generation capabilities. However, training these models is resource-intensive, and…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Yue Han , Junwei Zhu , Keke He , Xu Chen , Yanhao Ge , Wei Li , Xiangtai Li , Jiangning Zhang , Chengjie Wang , Yong Liu

Audio-driven emotional 3D facial animation encounters two significant challenges: (1) reliance on single-modal control signals (videos, text, or emotion labels) without leveraging their complementary strengths for comprehensive emotion…

Multimedia · Computer Science 2025-06-13 Kangwei Liu , Junwu Liu , Xiaowei Yi , Jinlin Guo , Yun Cao

Recent advances in generative diffusion models have enabled the previously unfeasible capability of generating 3D assets from a single input image or a text prompt. In this work, we aim to enhance the quality and functionality of these…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Xiyi Chen , Marko Mihajlovic , Shaofei Wang , Sergey Prokudin , Siyu Tang

Face swapping aims to seamlessly transfer a source facial identity onto a target while preserving target attributes such as pose and expression. Diffusion models, known for their superior generative capabilities, have recently shown promise…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Dailan He , Xiahong Wang , Shulun Wang , Guanglu Song , Bingqi Ma , Hao Shao , Yu Liu , Hongsheng Li

While diffusion models have shown great potential in portrait generation, generating expressive, coherent, and controllable cinematic portrait videos remains a significant challenge. Existing intermediate signals for portrait generation,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Junyi Wang , Yudong Guo , Boyang Guo , Shengming Yang , Juyong Zhang

Speech-driven 3D facial animation plays a key role in applications such as virtual avatars, gaming, and digital content creation. While existing methods have made significant progress in achieving accurate lip synchronization and generating…

Graphics · Computer Science 2025-07-16 Yifang Pan , Karan Singh , Luiz Gustavo Hafemann

Advanced diffusion-based Text-to-Image (T2I) models, such as the Stable Diffusion Model, have made significant progress in generating diverse and high-quality images using text prompts alone. However, when non-famous users require…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Yang Li , Songlin Yang , Wei Wang , Jing Dong

Diffusion models have recently shown strong progress in generative tasks, offering a more stable alternative to GAN-based approaches for makeup transfer. Existing methods often suffer from limited datasets, poor disentanglement between…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Qihe Pan , Yiming Wu , Xing Zhao , Liang Xie , Guodao Sun , Ronghua Liang

Talking face generation has historically struggled to produce head movements and natural facial expressions without guidance from additional reference videos. Recent developments in diffusion-based generative models allow for more realistic…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Michał Stypułkowski , Konstantinos Vougioukas , Sen He , Maciej Zięba , Stavros Petridis , Maja Pantic

This technical report presents a diffusion model based framework for face swapping between two portrait images. The basic framework consists of three components, i.e., IP-Adapter, ControlNet, and Stable Diffusion's inpainting pipeline, for…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Feifei Wang

Portrait animation aims to generate photo-realistic videos from a single source image by reenacting the expression and pose from a driving video. While early methods relied on 3D morphable models or feature warping techniques, they often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Mallikarjun B. R. , Fei Yin , Vikram Voleti , Nikita Drobyshev , Maksim Lapin , Aaryaman Vasishta , Varun Jampani
‹ Prev 1 2 3 10 Next ›