Related papers: A Backbone Replaceable Fine-tuning Framework for S…

Coherent Loss: A Generic Framework for Stable Video Segmentation

Video segmentation approaches are of great importance for numerous vision tasks especially in video manipulation for entertainment. Due to the challenges associated with acquiring high-quality per-frame segmentation annotations and large…

Computer Vision and Pattern Recognition · Computer Science 2020-10-27 Mingyang Qian , Yi Fu , Xiao Tan , Yingying Li , Jinqing Qi , Huchuan Lu , Shilei Wen , Errui Ding

Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos

In recent years, heatmap regression based models have shown their effectiveness in face alignment and pose estimation. However, Conventional Heatmap Regression (CHR) is not accurate nor stable when dealing with high-resolution facial…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Ying Tai , Yicong Liang , Xiaoming Liu , Lei Duan , Jilin Li , Chengjie Wang , Feiyue Huang , Yu Chen

StableFace: Analyzing and Improving Motion Stability for Talking Face Generation

While previous speech-driven talking face generation methods have made significant progress in improving the visual quality and lip-sync quality of the synthesized videos, they pay less attention to lip motion jitters which greatly…

Computer Vision and Pattern Recognition · Computer Science 2022-08-31 Jun Ling , Xu Tan , Liyang Chen , Runnan Li , Yuchao Zhang , Sheng Zhao , Li Song

FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos

Recently, facial landmark detection algorithms have achieved remarkable performance on static images. However, these algorithms are neither accurate nor stable in motion-blurred videos. The missing of structure information makes it…

Computer Vision and Pattern Recognition · Computer Science 2019-10-29 Keqiang Sun , Wayne Wu , Tinghao Liu , Shuo Yang , Quan Wang , Qiang Zhou , Zuochang Ye , Chen Qian

On Improving Temporal Consistency for Online Face Liveness Detection

In this paper, we focus on improving the online face liveness detection system to enhance the security of the downstream face recognition system. Most of the existing frame-based methods are suffering from the prediction inconsistency…

Computer Vision and Pattern Recognition · Computer Science 2020-06-15 Xiang Xu , Yuanjun Xiong , Wei Xia

MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation

With the emergence of service robots and surveillance cameras, dynamic face recognition (DFR) in wild has received much attention in recent years. Face detection and head pose estimation are two important steps for DFR. Very often, the pose…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Yepeng Liu , Zaiwang Gu , Shenghua Gao , Dong Wang , Yusheng Zeng , Jun Cheng

PoseFace: Pose-Invariant Features and Pose-Adaptive Loss for Face Recognition

Despite the great success achieved by deep learning methods in face recognition, severe performance drops are observed for large pose variations in unconstrained environments (e.g., in cases of surveillance and photo-tagging). To address…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Qiang Meng , Xiaqing Xu , Xiaobo Wang , Yang Qian , Yunxiao Qin , Zezheng Wang , Chenxu Zhao , Feng Zhou , Zhen Lei

SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos

When analyzing human motion videos, the output jitters from existing pose estimators are highly-unbalanced with varied estimation errors across frames. Most frames in a video are relatively easy to estimate and only suffer from slight…

Computer Vision and Pattern Recognition · Computer Science 2022-07-22 Ailing Zeng , Lei Yang , Xuan Ju , Jiefeng Li , Jianyi Wang , Qiang Xu

Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression

Heatmap regression with a deep network has become one of the mainstream approaches to localize facial landmarks. However, the loss function for heatmap regression is rarely studied. In this paper, we analyze the ideal loss function…

Computer Vision and Pattern Recognition · Computer Science 2020-05-20 Xinyao Wang , Liefeng Bo , Li Fuxin

Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos

Recent progress in blind face restoration has resulted in producing high-quality restored results for static images. However, efforts to extend these advancements to video scenarios have been minimal, partly because of the absence of…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Zhouxia Wang , Jiawei Zhang , Xintao Wang , Tianshui Chen , Ying Shan , Wenping Wang , Ping Luo

StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation

Current diffusion models for human image animation often struggle to maintain identity (ID) consistency, especially when the reference image and driving video differ significantly in body size or position. We introduce StableAnimator++, the…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Shuyuan Tu , Zhen Xing , Xintong Han , Zhi-Qi Cheng , Qi Dai , Chong Luo , Zuxuan Wu , Yu-Gang Jiang

Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses

Facial alignment involves finding a set of landmark points on an image with a known semantic meaning. However, this semantic meaning of landmark points is often lost in 2D approaches where landmarks are either moved to visible boundaries or…

Computer Vision and Pattern Recognition · Computer Science 2017-09-11 Chandrasekhar Bhagavatula , Chenchen Zhu , Khoa Luu , Marios Savvides

2D Wasserstein Loss for Robust Facial Landmark Detection

The recent performance of facial landmark detection has been significantly improved by using deep Convolutional Neural Networks (CNNs), especially the Heatmap Regression Models (HRMs). Although their performance on common benchmark datasets…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Yongzhe Yan , Stefan Duffner , Priyanka Phutane , Anthony Berthelier , Christophe Blanc , Christophe Garcia , Thierry Chateau

Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

We present a new loss function, namely Wing loss, for robust facial landmark localisation with Convolutional Neural Networks (CNNs). We first compare and analyse different loss functions including L2, L1 and smooth L1. The analysis of these…

Computer Vision and Pattern Recognition · Computer Science 2018-10-25 Zhen-Hua Feng , Josef Kittler , Muhammad Awais , Patrik Huber , Xiao-Jun Wu

The Blessing and the Curse of the Noise behind Facial Landmark Annotations

The evolving algorithms for 2D facial landmark detection empower people to recognize faces, analyze facial expressions, etc. However, existing methods still encounter problems of unstable facial landmarks when applied to videos. Because…

Computer Vision and Pattern Recognition · Computer Science 2020-07-31 Xiaoyu Xiang , Yang Cheng , Shaoyuan Xu , Qian Lin , Jan Allebach

Sparse to Dense Motion Transfer for Face Image Animation

Face image animation from a single image has achieved remarkable progress. However, it remains challenging when only sparse landmarks are available as the driving signal. Given a source face image and a sequence of sparse face landmarks,…

Computer Vision and Pattern Recognition · Computer Science 2021-09-06 Ruiqi Zhao , Tianyi Wu , Guodong Guo

Laplace Landmark Localization

Landmark localization in images and videos is a classic problem solved in various ways. Nowadays, with deep networks prevailing throughout machine learning, there are revamped interests in pushing facial landmark detection technologies to…

Computer Vision and Pattern Recognition · Computer Science 2019-08-16 Joseph P Robinson , Yuncheng Li , Ning Zhang , Yun Fu , and Sergey Tulyakov

Joint-Motion Mutual Learning for Pose Estimation in Videos

Human pose estimation in videos has long been a compelling yet challenging task within the realm of computer vision. Nevertheless, this task remains difficult because of the complex video scenes, such as video defocus and self-occlusion.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Sifan Wu , Haipeng Chen , Yifang Yin , Sihao Hu , Runyang Feng , Yingying Jiao , Ziqi Yang , Zhenguang Liu

Identity-Preserving Pose-Guided Character Animation via Facial Landmarks Transformation

Creating realistic pose-guided image-to-video character animations while preserving facial identity remains challenging, especially in complex and dynamic scenarios such as dancing, where precise identity consistency is crucial. Existing…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Lianrui Mu , Xingze Zhou , Wenjie Zheng , Jiangnan Ye , Haoji Hu

LOTR: Face Landmark Localization Using Localization Transformer

This paper presents a novel Transformer-based facial landmark localization network named Localization Transformer (LOTR). The proposed framework is a direct coordinate regression approach leveraging a Transformer network to better utilize…

Computer Vision and Pattern Recognition · Computer Science 2022-10-06 Ukrit Watchareeruetai , Benjaphan Sommana , Sanjana Jain , Pavit Noinongyao , Ankush Ganguly , Aubin Samacoits , Samuel W. F. Earp , Nakarin Sritrakool