Related papers: DiffusionAct: Controllable Diffusion Autoencoder f…

DisControlFace: Adding Disentangled Control to Diffusion Autoencoder for One-shot Explicit Facial Image Editing

In this work, we focus on exploring explicit fine-grained control of generative facial image editing, all while generating faithful facial appearances and consistent semantic details, which however, is quite challenging and has not been…

Computer Vision and Pattern Recognition · Computer Science 2024-07-25 Haozhe Jia , Yan Li , Hengfei Cui , Di Xu , Yuwang Wang , Tao Yu

DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

DiffusionAvatars synthesizes a high-fidelity 3D head avatar of a person, offering intuitive control over both pose and expression. We propose a diffusion-based neural renderer that leverages generic 2D priors to produce compelling images of…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Tobias Kirschstein , Simon Giebenhain , Matthias Nießner

DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation

Diffusion probabilistic models (DPMs) have shown remarkable results on various image synthesis tasks such as text-to-image generation and image inpainting. However, compared to other generative methods like VAEs and GANs, DPMs lack a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Yipeng Leng , Qiangjuan Huang , Zhiyuan Wang , Yangyang Liu , Haoyu Zhang

One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space

In this paper, we present our framework for neural face/head reenactment whose goal is to transfer the 3D head orientation and expression of a target face to a source face. Previous methods focus on learning embedding networks for identity…

Computer Vision and Pattern Recognition · Computer Science 2024-02-07 Stella Bounareli , Christos Tzelepis , Vasileios Argyriou , Ioannis Patras , Georgios Tzimiropoulos

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model

Recently, 2D speaking avatars have increasingly participated in everyday scenarios due to the fast development of facial animation techniques. However, most existing works neglect the explicit control of human bodies. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Jiazhi Guan , Quanwei Yang , Kaisiyuan Wang , Hang Zhou , Shengyi He , Zhiliang Xu , Haocheng Feng , Errui Ding , Jingdong Wang , Hongtao Xie , Youjian Zhao , Ziwei Liu

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces

In this paper, we present our method for neural face reenactment, called HyperReenact, that aims to generate realistic talking head images of a source identity, driven by a target facial pose. Existing state-of-the-art face reenactment…

Computer Vision and Pattern Recognition · Computer Science 2023-07-21 Stella Bounareli , Christos Tzelepis , Vasileios Argyriou , Ioannis Patras , Georgios Tzimiropoulos

FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

Producing expressive facial animations from static images is a challenging task. Prior methods relying on explicit geometric priors (e.g., facial landmarks or 3DMM) often suffer from artifacts in cross reenactment and struggle to capture…

Computer Vision and Pattern Recognition · Computer Science 2025-07-18 Qiang Wang , Mengchao Wang , Fan Jiang , Yaqi Fan , Yonggang Qi , Mu Xu

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

Current face reenactment and swapping methods mainly rely on GAN frameworks, but recent focus has shifted to pre-trained diffusion models for their superior generation capabilities. However, training these models is resource-intensive, and…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Yue Han , Junwei Zhu , Keke He , Xu Chen , Yanhao Ge , Wei Li , Xiangtai Li , Jiangning Zhang , Chengjie Wang , Yong Liu

Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models

DragDiffusion is a diffusion-based method for interactive point-based image editing that enables users to manipulate images by directly dragging selected points. The method claims that accurate spatial control can be achieved by optimizing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Ali Subhan , Ashir Raza

Unconstrained Facial Expression Transfer using Style-based Generator

Facial expression transfer and reenactment has been an important research problem given its applications in face editing, image manipulation, and fabricated videos generation. We present a novel method for image-based facial expression…

Computer Vision and Pattern Recognition · Computer Science 2019-12-16 Chao Yang , Ser-Nam Lim

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Editing real facial images is a crucial task in computer vision with significant demand in various real-world applications. While GAN-based methods have showed potential in manipulating images especially when combined with CLIP, these…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Dongxu Yue , Qin Guo , Munan Ning , Jiaxi Cui , Yuesheng Zhu , Li Yuan

Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data

Combining neuroimaging datasets from multiple sites and scanners can help increase statistical power and thus provide greater insight into subtle neuroanatomical effects. However, site-specific effects pose a challenge by potentially…

Computer Vision and Pattern Recognition · Computer Science 2024-08-29 Ayodeji Ijishakin , Ana Lawry Aguila , Elizabeth Levitis , Ahmed Abdulaal , Andre Altmann , James Cole

Towards Consistent and Controllable Image Synthesis for Face Editing

Face editing methods, essential for tasks like virtual avatars, digital human synthesis and identity preservation, have traditionally been built upon GAN-based techniques, while recent focus has shifted to diffusion-based models due to…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Mengting Wei , Tuomas Varanka , Yante Li , Xingxun Jiang , Huai-Qian Khor , Guoying Zhao

Face Animation with an Attribute-Guided Diffusion Model

Face animation has achieved much progress in computer vision. However, prevailing GAN-based methods suffer from unnatural distortions and artifacts due to sophisticated motion deformation. In this paper, we propose a Face Animation…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Bohan Zeng , Xuhui Liu , Sicheng Gao , Boyu Liu , Hong Li , Jianzhuang Liu , Baochang Zhang

AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing

Facial attribute editing aims to modify target attributes while preserving attribute-irrelevant content and overall image fidelity. Existing GAN-based methods provide favorable controllability, but often suffer from weak alignment between…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Wenmin Huang , Weiqi Luo , Xiaochun Cao , Jiwu Huang

ReenactGAN: Learning to Reenact Faces via Boundary Transfer

We present a novel learning-based framework for face reenactment. The proposed method, known as ReenactGAN, is capable of transferring facial movements and expressions from monocular video input of an arbitrary person to a target person.…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Wayne Wu , Yunxuan Zhang , Cheng Li , Chen Qian , Chen Change Loy

FACEGAN: Facial Attribute Controllable rEenactment GAN

The face reenactment is a popular facial animation method where the person's identity is taken from the source image and the facial motion from the driving image. Recent works have demonstrated high quality results by combining the facial…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Soumya Tripathy , Juho Kannala , Esa Rahtu

ActGAN: Flexible and Efficient One-shot Face Reenactment

This paper introduces ActGAN - a novel end-to-end generative adversarial network (GAN) for one-shot face reenactment. Given two images, the goal is to transfer the facial expression of the source actor onto a target person in a…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Ivan Kosarevych , Marian Petruk , Markian Kostiv , Orest Kupyn , Mykola Maksymenko , Volodymyr Budzan

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

Accurate and controllable image editing is a challenging task that has attracted significant attention recently. Notably, DragGAN is an interactive point-based image editing framework that achieves impressive editing results with…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Yujun Shi , Chuhui Xue , Jun Hao Liew , Jiachun Pan , Hanshu Yan , Wenqing Zhang , Vincent Y. F. Tan , Song Bai

FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint

We introduce FactorPortrait, a video diffusion method for controllable portrait animation that enables lifelike synthesis from disentangled control signals of facial expressions, head movement, and camera viewpoints. Given a single portrait…

Computer Vision and Pattern Recognition · Computer Science 2025-12-15 Jiapeng Tang , Kai Li , Chengxiang Yin , Liuhao Ge , Fei Jiang , Jiu Xu , Matthias Nießner , Christian Häne , Timur Bagautdinov , Egor Zakharov , Peihong Guo