Related papers: Motion Prediction Using Temporal Inception Module

Development of Human Motion Prediction Strategy using Inception Residual Block

Human Motion Prediction is a crucial task in computer vision and robotics. It has versatile application potentials such as in the area of human-robot interactions, human action tracking for airport security systems, autonomous car…

Artificial Intelligence · Computer Science 2021-08-10 Shekhar Gupta , Gaurav Kumar Yadav , G. C. Nandi

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation

Human-human motion generation is essential for understanding humans as social beings. Current methods fall into two main categories: single-person-based methods and separate modeling-based methods. To delve into this field, we abstract the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Yabiao Wang , Shuo Wang , Jiangning Zhang , Ke Fan , Jiafu Wu , Zhucun Xue , Yong Liu

Improving Human Motion Prediction Through Continual Learning

Human motion prediction is an essential component for enabling closer human-robot collaboration. The task of accurately predicting human motion is non-trivial. It is compounded by the variability of human motion, both at a skeletal level…

Robotics · Computer Science 2021-07-02 Mohammad Samin Yasar , Tariq Iqbal

Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video

Temporal modeling is crucial for multi-frame human pose estimation. Most existing methods directly employ optical flow or deformable convolution to predict full-spectrum motion fields, which might incur numerous irrelevant cues, such as a…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Runyang Feng , Yixing Gao , Xueqing Ma , Tze Ho Elden Tse , Hyung Jin Chang

Learning Multiscale Correlations for Human Motion Prediction

In spite of the great progress in human motion prediction, it is still a challenging task to predict those aperiodic and complicated motions. We believe that to capture the correlations among human body components is the key to understand…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Honghong Zhou , Caili Guo , Hao Zhang , Yanjun Wang

Motion-driven Visual Tempo Learning for Video-based Action Recognition

Action visual tempo characterizes the dynamics and the temporal scale of an action, which is helpful to distinguish human actions that share high similarities in visual dynamics and appearance. Previous methods capture the visual tempo…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Yuanzhong Liu , Junsong Yuan , Zhigang Tu

RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility

Predicting human mobility is inherently challenging due to complex long-range dependencies and multi-scale periodic behaviors. To address this, we introduce RHYTHM (Reasoning with Hierarchical Temporal Tokenization for Human Mobility), a…

Machine Learning · Computer Science 2026-02-25 Haoyu He , Haozheng Luo , Yan Chen , Qi R. Wang

Multi-Scale Incremental Modeling for Enhanced Human Motion Prediction in Human-Robot Collaboration

Accurate human motion prediction is crucial for safe human-robot collaboration but remains challenging due to the complexity of modeling intricate and variable human movements. This paper presents Parallel Multi-scale Incremental Prediction…

Robotics · Computer Science 2024-12-17 Juncheng Zou

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Diverse actions give rise to rich audio-visual signals in long videos. Recent works showcase that the two modalities of audio and video exhibit different temporal extents of events and distinct labels. We address the interplay between the…

Computer Vision and Pattern Recognition · Computer Science 2024-04-10 Jacob Chalk , Jaesung Huh , Evangelos Kazakos , Andrew Zisserman , Dima Damen

Efficient Temporal Tokenization for Mobility Prediction with Large Language Models

We introduce RHYTHM (Reasoning with Hierarchical Temporal Tokenization for Human Mobility), a framework that leverages large language models (LLMs) as spatio-temporal predictors and trajectory reasoners. RHYTHM partitions trajectories into…

Computation and Language · Computer Science 2025-10-01 Haoyu He , Haozheng Luo , Yan Chen , Qi R. Wang

CTM: Collaborative Temporal Modeling for Action Recognition

With the rapid development of digital multimedia, video understanding has become an important field. For action recognition, temporal dimension plays an important role, and this is quite different from image recognition. In order to learn…

Computer Vision and Pattern Recognition · Computer Science 2020-02-11 Qian Liu , Tao Wang , Jie Liu , Yang Guan , Qi Bu , Longfei Yang

AToM: Adaptive Theory-of-Mind-Based Human Motion Prediction in Long-Term Human-Robot Interactions

Humans learn from observations and experiences to adjust their behaviours towards better performance. Interacting with such dynamic humans is challenging, as the robot needs to predict the humans accurately for safe and efficient…

Robotics · Computer Science 2025-02-13 Yuwen Liao , Muqing Cao , Xinhang Xu , Lihua Xie

Representing motion as a sequence of latent primitives, a flexible approach for human motion modelling

We propose a new representation of human body motion which encodes a full motion in a sequence of latent motion primitives. Recently, task generic motion priors have been introduced and propose a coherent representation of human motion…

Computer Vision and Pattern Recognition · Computer Science 2022-09-02 Mathieu Marsot , Stefanie Wuhrer , Jean-Sebastien Franco , Anne Hélène Olivier

Uniformly Accelerated Motion Model for Inter Prediction

Inter prediction is a key technology to reduce the temporal redundancy in video coding. In natural videos, there are usually multiple moving objects with variable velocity, resulting in complex motion fields that are difficult to represent…

Image and Video Processing · Electrical Eng. & Systems 2024-07-23 Zhuoyuan Li , Yao Li , Chuanbo Tang , Li Li , Dong Liu , Feng Wu

Temporal-Spatial Mapping for Action Recognition

Deep learning models have enjoyed great success for image related computer vision tasks like image classification and object detection. For video related tasks like human action recognition, however, the advancements are not as significant…

Computer Vision and Pattern Recognition · Computer Science 2018-09-12 Xiaolin Song , Cuiling Lan , Wenjun Zeng , Junliang Xing , Jingyu Yang , Xiaoyan Sun

A Data-Efficient Approach for Long-Term Human Motion Prediction Using Maps of Dynamics

Human motion prediction is essential for the safe and smooth operation of mobile service robots and intelligent vehicles around people. Commonly used neural network-based approaches often require large amounts of complete trajectories to…

Robotics · Computer Science 2023-06-07 Yufei Zhu , Andrey Rudenko , Tomasz P. Kucner , Achim J. Lilienthal , Martin Magnusson

Motion Prediction via Joint Dependency Modeling in Phase Space

Motion prediction is a classic problem in computer vision, which aims at forecasting future motion given the observed pose sequence. Various deep learning models have been proposed, achieving state-of-the-art performance on motion…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Pengxiang Su , Zhenguang Liu , Shuang Wu , Lei Zhu , Yifang Yin , Xuanjing Shen

Human Motion Prediction via Pattern Completion in Latent Representation Space

Inspired by ideas in cognitive science, we propose a novel and general approach to solve human motion understanding via pattern completion on a learned latent representation space. Our model outperforms current state-of-the-art methods in…

Computer Vision and Pattern Recognition · Computer Science 2019-04-22 Yi Tian Xu , Yaqiao Li , David Meger

Temporal Pyramid Network for Pedestrian Trajectory Prediction with Multi-Supervision

Predicting human motion behavior in a crowd is important for many applications, ranging from the natural navigation of autonomous vehicles to intelligent security systems of video surveillance. All the previous works model and predict the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-07 Rongqin Liang , Yuanman Li , Xia Li , yi tang , Jiantao Zhou , Wenbin Zou

Token Turing Machines

We propose Token Turing Machines (TTM), a sequential, autoregressive Transformer model with memory for real-world sequential visual understanding. Our model is inspired by the seminal Neural Turing Machine, and has an external memory…

Machine Learning · Computer Science 2023-04-14 Michael S. Ryoo , Keerthana Gopalakrishnan , Kumara Kahatapitiya , Ted Xiao , Kanishka Rao , Austin Stone , Yao Lu , Julian Ibarz , Anurag Arnab