Related papers: Spatial-Temporal Alignment Network for Action Reco…

Spatial-Temporal Alignment Network for Action Recognition

This paper studies introducing viewpoint invariant feature representations in existing action recognition architecture. Despite significant progress in action recognition, efficiently handling geometric variations in large-scale datasets…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Jinhui Ye , Junwei Liang

A Survey on Deep Learning-based Spatio-temporal Action Detection

Spatio-temporal action detection (STAD) aims to classify the actions present in a video and localize them in space and time. It has become a particularly active area of research in computer vision because of its explosively emerging…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Peng Wang , Fanwei Zeng , Yuntao Qian

Spatial-Temporal Attention Network for Open-Set Fine-Grained Image Recognition

Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Jiayin Sun , Hong Wang , Qiulei Dong

STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition

Effective and Efficient spatio-temporal modeling is essential for action recognition. Existing methods suffer from the trade-off between model performance and model complexity. In this paper, we present a novel Spatio-Temporal Hybrid…

Computer Vision and Pattern Recognition · Computer Science 2020-03-19 Xu Li , Jingwen Wang , Lin Ma , Kaihao Zhang , Fengzong Lian , Zhanhui Kang , Jinjun Wang

STAN: Spatio-Temporal Attention Network for Next Location Recommendation

The next location recommendation is at the core of various location-based applications. Current state-of-the-art models have attempted to solve spatial sparsity with hierarchical gridding and model temporal relation with explicit time…

Information Retrieval · Computer Science 2021-02-09 Yingtao Luo , Qiang Liu , Zhaocheng Liu

Hierarchical Attention Network for Action Recognition in Videos

Understanding human actions in wild videos is an important task with a broad range of applications. In this paper we propose a novel approach named Hierarchical Attention Network (HAN), which enables to incorporate static spatial…

Computer Vision and Pattern Recognition · Computer Science 2016-07-22 Yilin Wang , Suhang Wang , Jiliang Tang , Neil O'Hare , Yi Chang , Baoxin Li

A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection

This paper presents a novel spatiotemporal transformer network that introduces several original components to detect actions in untrimmed videos. First, the multi-feature selective semantic attention model calculates the correlations…

Computer Vision and Pattern Recognition · Computer Science 2024-05-15 Matthew Korban , Peter Youngs , Scott T. Acton

STAN: Spatio-Temporal Adversarial Networks for Abnormal Event Detection

In this paper, we propose a novel abnormal event detection method with spatio-temporal adversarial networks (STAN). We devise a spatio-temporal generator which synthesizes an inter-frame by considering spatio-temporal characteristics with…

Computer Vision and Pattern Recognition · Computer Science 2018-04-24 Sangmin Lee , Hak Gu Kim , Yong Man Ro

STAR-Net: Action Recognition using Spatio-Temporal Activation Reprojection

While depth cameras and inertial sensors have been frequently leveraged for human action recognition, these sensing modalities are impractical in many scenarios where cost or environmental constraints prohibit their use. As such, there has…

Computer Vision and Pattern Recognition · Computer Science 2019-02-27 William McNally , Alexander Wong , John McPhee

A Structured Model For Action Detection

A dominant paradigm for learning-based approaches in computer vision is training generic models, such as ResNet for image recognition, or I3D for video understanding, on large datasets and allowing them to discover the optimal…

Computer Vision and Pattern Recognition · Computer Science 2019-06-06 Yubo Zhang , Pavel Tokmakov , Martial Hebert , Cordelia Schmid

Spatio-Temporal Pyramid Graph Convolutions for Human Action Recognition and Postural Assessment

Recognition of human actions and associated interactions with objects and the environment is an important problem in computer vision due to its potential applications in a variety of domains. The most versatile methods can generalize to…

Computer Vision and Pattern Recognition · Computer Science 2019-12-10 Behnoosh Parsa , Athma Narayanan , Behzad Dariush

Video Test-Time Adaptation for Action Recognition

Although action recognition systems can achieve top performance when evaluated on in-distribution test points, they are vulnerable to unanticipated distribution shifts in test data. However, test-time adaptation of video action recognition…

Computer Vision and Pattern Recognition · Computer Science 2023-03-22 Wei Lin , Muhammad Jehanzeb Mirza , Mateusz Kozinski , Horst Possegger , Hilde Kuehne , Horst Bischof

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR). The proposed STANNCR is based on action component and incorporates the…

Computer Vision and Pattern Recognition · Computer Science 2016-08-30 Jianhong Wang , Tian Lan , Xu Zhang , Limin Luo

Learning Coupled Spatial-temporal Attention for Skeleton-based Action Recognition

In this paper, we propose a coupled spatial-temporal attention (CSTA) model for skeleton-based action recognition, which aims to figure out the most discriminative joints and frames in spatial and temporal domains simultaneously.…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Jiayun Wang

Spatial-Temporal Adaptive Graph Convolution with Attention Network for Traffic Forecasting

Traffic forecasting is one canonical example of spatial-temporal learning task in Intelligent Traffic System. Existing approaches capture spatial dependency with a pre-determined matrix in graph convolution neural operators. However, the…

Machine Learning · Computer Science 2022-06-08 Chen Weikang , Li Yawen , Xue Zhe , Li Ang , Wu Guobin

View-invariant Deep Architecture for Human Action Recognition using late fusion

Human action Recognition for unknown views is a challenging task. We propose a view-invariant deep human action recognition framework, which is a novel integration of two important action cues: motion and shape temporal dynamics (STD). The…

Computer Vision and Pattern Recognition · Computer Science 2020-01-22 Chhavi Dhiman , Dinesh Kumar Vishwakarma

Self-Attention Network for Skeleton-based Human Action Recognition

Skeleton-based action recognition has recently attracted a lot of attention. Researchers are coming up with new approaches for extracting spatio-temporal relations and making considerable progress on large-scale skeleton-based datasets.…

Computer Vision and Pattern Recognition · Computer Science 2019-12-19 Sangwoo Cho , Muhammad Hasan Maqbool , Fei Liu , Hassan Foroosh

View-invariant action recognition

Human action recognition is an important problem in computer vision. It has a wide range of applications in surveillance, human-computer interaction, augmented reality, video indexing, and retrieval. The varying pattern of spatio-temporal…

Computer Vision and Pattern Recognition · Computer Science 2020-09-03 Yogesh S Rawat , Shruti Vyas

Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification

Video-based person re-identification (Re-ID) aims at matching video sequences of pedestrians across non-overlapping cameras. It is a practical yet challenging task of how to embed spatial and temporal information of a video into its feature…

Computer Vision and Pattern Recognition · Computer Science 2019-08-06 Chih-Ting Liu , Chih-Wei Wu , Yu-Chiang Frank Wang , Shao-Yi Chien

TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition

In this paper we propose a novel Temporal Attentive Relation Network (TARN) for the problems of few-shot and zero-shot action recognition. At the heart of our network is a meta-learning approach that learns to compare representations of…

Computer Vision and Pattern Recognition · Computer Science 2019-07-23 Mina Bishay , Georgios Zoumpourlis , Ioannis Patras