Related papers: Multi-Stream Single Shot Spatial-Temporal Action D…

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition

It remains a challenge to efficiently extract spatialtemporal information from skeleton sequences for 3D human action recognition. Although most recent action recognition methods are based on Recurrent Neural Networks which present…

Computer Vision and Pattern Recognition · Computer Science 2017-06-08 Hong Liu , Juanhui Tu , Mengyuan Liu

D3D: Distilled 3D Networks for Video Action Recognition

State-of-the-art methods for video action recognition commonly use an ensemble of two networks: the spatial stream, which takes RGB frames as input, and the temporal stream, which takes optical flow as input. In recent work, both of these…

Computer Vision and Pattern Recognition · Computer Science 2019-02-07 Jonathan C. Stroud , David A. Ross , Chen Sun , Jia Deng , Rahul Sukthankar

Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN

Learning the spatial-temporal representation of motion information is crucial to human action recognition. Nevertheless, most of the existing features or descriptors cannot capture motion information effectively, especially for long-term…

Computer Vision and Pattern Recognition · Computer Science 2017-02-13 Yemin Shi , Yonghong Tian , Yaowei Wang , Tiejun Huang

Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition

In recent years, a number of approaches based on 2D or 3D convolutional neural networks (CNN) have emerged for video action recognition, achieving state-of-the-art results on several large-scale benchmark datasets. In this paper, we carry…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Chun-Fu Chen , Rameswar Panda , Kandan Ramakrishnan , Rogerio Feris , John Cohn , Aude Oliva , Quanfu Fan

Spatial-temporal Fusion Convolutional Neural Network for Simulated Driving Behavior Recognition

Abnormal driving behaviour is one of the leading cause of terrible traffic accidents endangering human life. Therefore, study on driving behaviour surveillance has become essential to traffic security and public management. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2018-12-04 Yaocong Hu , MingQi Lu , Xiaobo Lu

ACDnet: An action detection network for real-time edge computing based on flow-guided feature approximation and memory aggregation

Interpreting human actions requires understanding the spatial and temporal context of the scenes. State-of-the-art action detectors based on Convolutional Neural Network (CNN) have demonstrated remarkable results by adopting two-stream or…

Computer Vision and Pattern Recognition · Computer Science 2021-03-01 Yu Liu , Fan Yang , Dominique Ginhac

Three-stream network for enriched Action Recognition

Understanding accurate information on human behaviours is one of the most important tasks in machine intelligence. Human Activity Recognition that aims to understand human activities from a video is a challenging task due to various…

Computer Vision and Pattern Recognition · Computer Science 2021-06-25 Ivaxi Sheth

Two-Stream RNN/CNN for Action Recognition in 3D Videos

The recognition of actions from video sequences has many applications in health monitoring, assisted living, surveillance, and smart homes. Despite advances in sensing, in particular related to 3D video, the methodologies to process the…

Computer Vision and Pattern Recognition · Computer Science 2018-10-03 Rui Zhao , Haider Ali , Patrick van der Smagt

3D Convolutional with Attention for Action Recognition

Human action recognition is one of the challenging tasks in computer vision. The current action recognition methods use computationally expensive models for learning spatio-temporal dependencies of the action. Models utilizing RGB channels…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Labina Shrestha , Shikha Dubey , Farrukh Olimov , Muhammad Aasim Rafique , Moongu Jeon

S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Networks

In this paper, we present a novel Single Shot multi-Span Detector for temporal activity detection in long, untrimmed videos using a simple end-to-end fully three-dimensional convolutional (Conv3D) network. Our architecture, named S3D,…

Computer Vision and Pattern Recognition · Computer Science 2018-08-09 Da Zhang , Xiyang Dai , Xin Wang , Yuan-Fang Wang

Joint Network based Attention for Action Recognition

By extracting spatial and temporal characteristics in one network, the two-stream ConvNets can achieve the state-of-the-art performance in action recognition. However, such a framework typically suffers from the separately processing of…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Yemin Shi , Yonghong Tian , Yaowei Wang , Tiejun Huang

Segmental Spatiotemporal CNNs for Fine-grained Action Segmentation

Joint segmentation and classification of fine-grained actions is important for applications of human-robot interaction, video surveillance, and human skill evaluation. However, despite substantial recent progress in large-scale action…

Computer Vision and Pattern Recognition · Computer Science 2016-10-03 Colin Lea , Austin Reiter , Rene Vidal , Gregory D. Hager

Learning Spatiotemporal Features for Infrared Action Recognition with 3D Convolutional Neural Networks

Infrared (IR) imaging has the potential to enable more robust action recognition systems compared to visible spectrum cameras due to lower sensitivity to lighting conditions and appearance variability. While the action recognition task on…

Computer Vision and Pattern Recognition · Computer Science 2017-05-19 Zhuolin Jiang , Viktor Rozgic , Sancar Adali

Temporal Convolutional Networks: A Unified Approach to Action Segmentation

The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally,…

Computer Vision and Pattern Recognition · Computer Science 2016-08-31 Colin Lea , Rene Vidal , Austin Reiter , Gregory D. Hager

An Information-rich Sampling Technique over Spatio-Temporal CNN for Classification of Human Actions in Videos

We propose a novel scheme for human action recognition in videos, using a 3-dimensional Convolutional Neural Network (3D CNN) based classifier. Traditionally in deep learning based human activity recognition approaches, either a few random…

Computer Vision and Pattern Recognition · Computer Science 2020-02-10 S. H. Shabbeer Basha , Viswanath Pulabaigari , Snehasis Mukherjee

Discovering Spatio-Temporal Action Tubes

In this paper, we address the challenging problem of spatial and temporal action detection in videos. We first develop an effective approach to localize frame-level action regions through integrating static and kinematic information by the…

Computer Vision and Pattern Recognition · Computer Science 2018-11-30 Yuancheng Ye , Xiaodong Yang , Yingli Tian

Multi-View Region Adaptive Multi-temporal DMM and RGB Action Recognition

Human action recognition remains an important yet challenging task. This work proposes a novel action recognition system. It uses a novel Multiple View Region Adaptive Multi-resolution in time Depth Motion Map (MV-RAMDMM) formulation…

Computer Vision and Pattern Recognition · Computer Science 2019-04-15 Mahmoud Al-Faris , John P. Chiverton , Yanyan Yang , David L. Ndzi

Deep Learning Approaches for Human Action Recognition in Video Data

Human action recognition in videos is a critical task with significant implications for numerous applications, including surveillance, sports analytics, and healthcare. The challenge lies in creating models that are both precise in their…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Yufei Xie

Spatio-Temporal Action Detection with Multi-Object Interaction

Spatio-temporal action detection in videos requires localizing the action both spatially and temporally in the form of an "action tube". Nowadays, most spatio-temporal action detection datasets (e.g. UCF101-24, AVA, DALY) are annotated with…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Huijuan Xu , Lizhi Yang , Stan Sclaroff , Kate Saenko , Trevor Darrell

Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems

This chapter aims to aid the development of Cyber-Physical Systems (CPS) in automated understanding of events and activities in various applications of video-surveillance. These events are mostly captured by drones, CCTVs or novice and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-04 Swarnabja Bhaumik , Prithwish Jana , Partha Pratim Mohanta