Related papers: A Recurrent Encoder-Decoder Network for Sequential…

RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely…

Computer Vision and Pattern Recognition · Computer Science 2018-01-19 Xi Peng , Rogerio S. Feris , Xiaoyu Wang , Dimitris N. Metaxas

Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-expressions

Recently, the recognition task of spontaneous facial micro-expressions has attracted much attention with its various real-world applications. Plenty of handcrafted or learned features have been employed for a variety of classifiers and…

Computer Vision and Pattern Recognition · Computer Science 2019-01-16 Zhaoqiang Xia , Xiaopeng Hong , Xingyu Gao , Xiaoyi Feng , Guoying Zhao

Deep Recurrent Regression for Facial Landmark Detection

We propose a novel end-to-end deep architecture for face landmark detection, based on a deep convolutional and deconvolutional network followed by carefully designed recurrent network structures. The pipeline of this architecture consists…

Computer Vision and Pattern Recognition · Computer Science 2016-11-01 Hanjiang Lai , Shengtao Xiao , Yan Pan , Zhen Cui , Jiashi Feng , Chunyan Xu , Jian Yin , Shuicheng Yan

Recurrent Video Masked Autoencoders

We present Recurrent Video Masked-Autoencoders (RVM): a novel approach to video representation learning that leverages recurrent computation to model the temporal structure of video data. RVM couples an asymmetric masking objective with a…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Daniel Zoran , Nikhil Parthasarathy , Yi Yang , Drew A Hudson , Joao Carreira , Andrew Zisserman

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Models based on deep convolutional networks have dominated recent image interpretation tasks; we investigate whether models which are also recurrent, or "temporally deep", are effective for tasks involving sequences, visual and otherwise.…

Computer Vision and Pattern Recognition · Computer Science 2016-06-02 Jeff Donahue , Lisa Anne Hendricks , Marcus Rohrbach , Subhashini Venugopalan , Sergio Guadarrama , Kate Saenko , Trevor Darrell

Weakly-supervised Disentangling with Recurrent Transformations for 3D View Synthesis

An important problem for both graphics and vision is to synthesize novel views of a 3D object from a single image. This is particularly challenging due to the partial observability inherent in projecting a 3D object onto the image space,…

Machine Learning · Computer Science 2016-01-06 Jimei Yang , Scott Reed , Ming-Hsuan Yang , Honglak Lee

Disentangling Features in 3D Face Shapes for Joint Face Reconstruction and Recognition

This paper proposes an encoder-decoder network to disentangle shape features during 3D face reconstruction from single 2D images, such that the tasks of reconstructing accurate 3D face shapes and learning discriminative shape features for…

Computer Vision and Pattern Recognition · Computer Science 2018-04-02 Feng Liu , Ronghang Zhu , Dan Zeng , Qijun Zhao , Xiaoming Liu

Learning Spectral-Spatial-Temporal Features via a Recurrent Convolutional Neural Network for Change Detection in Multispectral Imagery

Change detection is one of the central problems in earth observation and was extensively investigated over recent decades. In this paper, we propose a novel recurrent convolutional neural network (ReCNN) architecture, which is trained to…

Computer Vision and Pattern Recognition · Computer Science 2019-03-27 Lichao Mou , Lorenzo Bruzzone , Xiao Xiang Zhu

Recurrent Regression for Face Recognition

To address the sequential changes of images including poses, in this paper we propose a recurrent regression neural network(RRNN) framework to unify two classic tasks of cross-pose face recognition on still images and video-based face…

Computer Vision and Pattern Recognition · Computer Science 2016-07-26 Yang Li , Wenming Zheng , Zhen Cui

Video Description using Bidirectional Recurrent Neural Networks

Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions. The combination of Convolutional and Recurrent Neural Networks in…

Computer Vision and Pattern Recognition · Computer Science 2016-12-13 Álvaro Peris , Marc Bolaños , Petia Radeva , Francisco Casacuberta

Clustering and Recognition of Spatiotemporal Features through Interpretable Embedding of Sequence to Sequence Recurrent Neural Networks

Encoder-decoder recurrent neural network models (RNN Seq2Seq) have achieved great success in ubiquitous areas of computation and applications. It was shown to be successful in modeling data with both temporal and spatial dependencies for…

Machine Learning · Computer Science 2020-02-03 Kun Su , Eli Shlizerman

Recurrent Network Models for Human Dynamics

We propose the Encoder-Recurrent-Decoder (ERD) model for recognition and prediction of human body pose in videos and motion capture. The ERD model is a recurrent neural network that incorporates nonlinear encoder and decoder networks before…

Computer Vision and Pattern Recognition · Computer Science 2015-09-30 Katerina Fragkiadaki , Sergey Levine , Panna Felsen , Jitendra Malik

Improved Face Detection and Alignment using Cascade Deep Convolutional Network

Real-world face detection and alignment demand an advanced discriminative model to address challenges by pose, lighting and expression. Illuminated by the deep learning algorithm, some convolutional neural networks based face detection and…

Computer Vision and Pattern Recognition · Computer Science 2017-08-01 Weilin Cong , Sanyuan Zhao , Hui Tian , Jianbing Shen

Head2Head++: Deep Facial Attributes Re-Targeting

Facial video re-targeting is a challenging problem aiming to modify the facial attributes of a target subject in a seamless manner by a driving monocular sequence. We leverage the 3D geometry of faces and Generative Adversarial Networks…

Computer Vision and Pattern Recognition · Computer Science 2021-09-29 Michail Christos Doukas , Mohammad Rami Koujan , Viktoriia Sharmanska , Anastasios Roussos

Learning Blind Video Temporal Consistency

Applying image processing algorithms independently to each frame of a video often leads to undesired inconsistent results over time. Developing temporally consistent video-based extensions, however, requires domain knowledge for individual…

Computer Vision and Pattern Recognition · Computer Science 2018-08-02 Wei-Sheng Lai , Jia-Bin Huang , Oliver Wang , Eli Shechtman , Ersin Yumer , Ming-Hsuan Yang

Head2Head: Video-based Neural Head Synthesis

In this paper, we propose a novel machine learning architecture for facial reenactment. In particular, contrary to the model-based approaches or recent frame-based methods that use Deep Convolutional Neural Networks (DCNNs) to generate…

Computer Vision and Pattern Recognition · Computer Science 2020-05-25 Mohammad Rami Koujan , Michail Christos Doukas , Anastasios Roussos , Stefanos Zafeiriou

Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

In this paper, we present an end-to-end approach to simultaneously learn spatio-temporal features and corresponding similarity metric for video-based person re-identification. Given the video sequence of a person, features from each frame…

Computer Vision and Pattern Recognition · Computer Science 2016-06-14 Lin Wu , Chunhua Shen , Anton van den Hengel

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional…

Computer Vision and Pattern Recognition · Computer Science 2017-12-11 Ayush Tewari , Michael Zollhöfer , Hyeongwoo Kim , Pablo Garrido , Florian Bernard , Patrick Pérez , Christian Theobalt

Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks

Facial expressions are one of the most powerful ways for depicting specific patterns in human behavior and describing human emotional state. Despite the impressive advances of affective computing over the last decade, automatic video-based…

Computer Vision and Pattern Recognition · Computer Science 2021-01-18 Thomas Teixeira , Eric Granger , Alessandro Lameiras Koerich

Coarse to Fine Multi-Resolution Temporal Convolutional Network

Temporal convolutional networks (TCNs) are a commonly used architecture for temporal video segmentation. TCNs however, tend to suffer from over-segmentation errors and require additional refinement modules to ensure smoothness and temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-05-25 Dipika Singhania , Rahul Rahaman , Angela Yao