Related papers: Recurrent Network Models for Human Dynamics

RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely…

Computer Vision and Pattern Recognition · Computer Science 2018-01-19 Xi Peng , Rogerio S. Feris , Xiaoyu Wang , Dimitris N. Metaxas

PVRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction

Human motion prediction, which aims to predict future human poses given past poses, has recently seen increased interest. Many recent approaches are based on Recurrent Neural Networks (RNN) which model human poses with exponential maps.…

Computer Vision and Pattern Recognition · Computer Science 2021-06-15 Hongsong Wang , Jian Dong , Bin Cheng , Jiashi Feng

A Recurrent Encoder-Decoder Network for Sequential Face Alignment

We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment. Our proposed model predicts 2D facial point maps regularized by a regression loss, while uniquely exploiting recurrent learning at both…

Computer Vision and Pattern Recognition · Computer Science 2016-08-24 Xi Peng , Rogerio S. Feris , Xiaoyu Wang , Dimitris N. Metaxas

Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers

We propose to leverage Transformer architectures for non-autoregressive human motion prediction. Our approach decodes elements in parallel from a query sequence, instead of conditioning on previous predictions such as instate-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2021-09-17 Angel Martínez-González , Michael Villamizar , Jean-Marc Odobez

Deep representation learning for human motion prediction and classification

Generative models of 3D human motion are often restricted to a small number of activities and can therefore not generalize well to novel movements or applications. In this work we propose a deep learning framework for human motion capture…

Computer Vision and Pattern Recognition · Computer Science 2017-04-14 Judith Bütepage , Michael Black , Danica Kragic , Hedvig Kjellström

A Temporal Densely Connected Recurrent Network for Event-based Human Pose Estimation

Event camera is an emerging bio-inspired vision sensors that report per-pixel brightness changes asynchronously. It holds noticeable advantage of high dynamic range, high speed response, and low power budget that enable it to best capture…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Zhanpeng Shao , Wen Zhou , Wuzhen Wang , Jianyu Yang , Youfu Li

Enhanced 3D Human Pose Estimation from Videos by using Attention-Based Neural Network with Dilated Convolutions

The attention mechanism provides a sequential prediction framework for learning spatial models with enhanced implicit temporal consistency. In this work, we show a systematic design (from 2D to 3D) for how conventional networks and other…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Ruixu Liu , Ju Shen , He Wang , Chen Chen , Sen-ching Cheung , Vijayan K. Asari

Learning Trajectory Dependencies for Human Motion Prediction

Human motion prediction, i.e., forecasting future body poses given observed pose sequence, has typically been tackled with recurrent neural networks (RNNs). However, as evidenced by prior work, the resulted RNN models suffer from prediction…

Computer Vision and Pattern Recognition · Computer Science 2020-07-08 Wei Mao , Miaomiao Liu , Mathieu Salzmann , Hongdong Li

Multitask Non-Autoregressive Model for Human Motion Prediction

Human motion prediction, which aims at predicting future human skeletons given the past ones, is a typical sequence-to-sequence problem. Therefore, extensive efforts have been continued on exploring different RNN-based encoder-decoder…

Computer Vision and Pattern Recognition · Computer Science 2021-02-24 Bin Li , Jian Tian , Zhongfei Zhang , Hailin Feng , Xi Li

Neural Rendering of Humans in Novel View and Pose from Monocular Video

We introduce a new method that generates photo-realistic humans under novel views and poses given a monocular video as input. Despite the significant progress recently on this topic, with several methods exploring shared canonical neural…

Computer Vision and Pattern Recognition · Computer Science 2023-04-21 Tiantian Wang , Nikolaos Sarafianos , Ming-Hsuan Yang , Tony Tung

Recurrent Human Pose Estimation

We propose a novel ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Vasileios Belagiannis , Andrew Zisserman

QuaterNet: A Quaternion-based Recurrent Model for Human Motion

Deep learning for predicting or generating 3D human pose sequences is an active research area. Previous work regresses either joint rotations or joint positions. The former strategy is prone to error accumulation along the kinematic chain,…

Computer Vision and Pattern Recognition · Computer Science 2018-08-02 Dario Pavllo , David Grangier , Michael Auli

MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction

Human motion prediction is a challenging task due to the stochasticity and aperiodicity of future poses. Recently, graph convolutional network has been proven to be very effective to learn dynamic relations among pose joints, which is…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Lingwei Dang , Yongwei Nie , Chengjiang Long , Qing Zhang , Guiqing Li

On human motion prediction using recurrent neural networks

Human motion modelling is a classical problem at the intersection of graphics and computer vision, with applications spanning human-computer interaction, motion synthesis, and motion prediction for virtual and augmented reality. Following…

Computer Vision and Pattern Recognition · Computer Science 2017-05-09 Julieta Martinez , Michael J. Black , Javier Romero

Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos

Deep ConvNets have been shown to be effective for the task of human pose estimation from single images. However, several challenging issues arise in the video-based case such as self-occlusion, motion blur, and uncommon poses with few or no…

Computer Vision and Pattern Recognition · Computer Science 2017-04-03 Jie Song , Limin Wang , Luc Van Gool , Otmar Hilliges

STPOTR: Simultaneous Human Trajectory and Pose Prediction Using a Non-Autoregressive Transformer for Robot Following Ahead

In this paper, we develop a neural network model to predict future human motion from an observed human motion history. We propose a non-autoregressive transformer architecture to leverage its parallel nature for easier training and fast,…

Robotics · Computer Science 2025-01-20 Mohammad Mahdavian , Payam Nikdel , Mahdi TaherAhmadi , Mo Chen

Pose-Aided Video-based Person Re-Identification via Recurrent Graph Convolutional Network

Existing methods for video-based person re-identification (ReID) mainly learn the appearance feature of a given pedestrian via a feature extractor and a feature aggregator. However, the appearance models would fail when different…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Honghu Pan , Qiao Liu , Yongyong Chen , Yunqi He , Yuan Zheng , Feng Zheng , Zhenyu He

Predicting Long-Term Skeletal Motions by a Spatio-Temporal Hierarchical Recurrent Network

The primary goal of skeletal motion prediction is to generate future motion by observing a sequence of 3D skeletons. A key challenge in motion prediction is the fact that a motion can often be performed in several different ways, with each…

Computer Vision and Pattern Recognition · Computer Science 2020-02-18 Junfeng Hu , Zhencheng Fan , Jun Liao , Li Liu

Recurrent Neural Network for Learning DenseDepth and Ego-Motion from Video

Learning-based, single-view depth estimation often generalizes poorly to unseen datasets. While learning-based, two-frame depth estimation solves this problem to some extent by learning to match features across frames, it performs poorly at…

Computer Vision and Pattern Recognition · Computer Science 2018-05-18 Rui Wang , Jan-Michael Frahm , Stephen M. Pizer

Video Description using Bidirectional Recurrent Neural Networks

Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions. The combination of Convolutional and Recurrent Neural Networks in…

Computer Vision and Pattern Recognition · Computer Science 2016-12-13 Álvaro Peris , Marc Bolaños , Petia Radeva , Francisco Casacuberta