Related papers: A Structured Model For Action Detection

Recurrent Residual Learning for Action Recognition

Action recognition is a fundamental problem in computer vision with a lot of potential applications such as video surveillance, human computer interaction, and robot learning. Given pre-segmented videos, the task is to recognize actions…

Computer Vision and Pattern Recognition · Computer Science 2017-06-28 Ahsan Iqbal , Alexander Richard , Hilde Kuehne , Juergen Gall

VideoCapsuleNet: A Simplified Network for Action Detection

The recent advances in Deep Convolutional Neural Networks (DCNNs) have shown extremely good results for video human action classification, however, action detection is still a challenging problem. The current action detection approaches…

Computer Vision and Pattern Recognition · Computer Science 2018-05-22 Kevin Duarte , Yogesh S Rawat , Mubarak Shah

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition

Action recognition is an open and challenging problem in computer vision. While current state-of-the-art models offer excellent recognition results, their computational expense limits their impact for many real-world applications. In this…

Computer Vision and Pattern Recognition · Computer Science 2020-08-03 Yue Meng , Chung-Ching Lin , Rameswar Panda , Prasanna Sattigeri , Leonid Karlinsky , Aude Oliva , Kate Saenko , Rogerio Feris

AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture

Dominant approaches to action detection can only provide sub-optimal solutions to the problem, as they rely on seeking frame-level detections, to later compose them into "action tubes" in a post-processing step. With this paper we radically…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Suman Saha , Gurkirt Singh , Fabio Cuzzolin

Action Recognition: From Static Datasets to Moving Robots

Deep learning models have achieved state-of-the- art performance in recognizing human activities, but often rely on utilizing background cues present in typical computer vision datasets that predominantly have a stationary camera. If these…

Robotics · Computer Science 2017-09-20 Fahimeh Rezazadegan , Sareh Shirazi , Ben Upcroft , Michael Milford

A3D: Adaptive 3D Networks for Video Action Recognition

This paper presents A3D, an adaptive 3D network that can infer at a wide range of computational constraints with one-time training. Instead of training multiple models in a grid-search manner, it generates good configurations by trading off…

Computer Vision and Pattern Recognition · Computer Science 2020-11-26 Sijie Zhu , Taojiannan Yang , Matias Mendieta , Chen Chen

Spatial-Temporal Alignment Network for Action Recognition and Detection

This paper studies how to introduce viewpoint-invariant feature representations that can help action recognition and detection. Although we have witnessed great progress of action recognition in the past decade, it remains challenging yet…

Computer Vision and Pattern Recognition · Computer Science 2020-12-07 Junwei Liang , Liangliang Cao , Xuehan Xiong , Ting Yu , Alexander Hauptmann

An Efficient 3D Convolutional Neural Network with Channel-wise, Spatial-grouped, and Temporal Convolutions

There has been huge progress on video action recognition in recent years. However, many works focus on tweaking existing 2D backbones due to the reliance of ImageNet pretraining, which restrains the models from achieving higher efficiency…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Zhe Wang , Xulei Yang

Learning and Recognizing Human Action from Skeleton Movement with Deep Residual Neural Networks

Automatic human action recognition is indispensable for almost artificial intelligent systems such as video surveillance, human-computer interfaces, video retrieval, etc. Despite a lot of progress, recognizing actions in an unknown video is…

Computer Vision and Pattern Recognition · Computer Science 2018-03-22 Huy-Hieu Pham , Louahdi Khoudour , Alain Crouzil , Pablo Zegers , Sergio A. Velastin

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Deep convolutional networks have achieved great success for visual recognition in still images. However, for action recognition in videos, the advantage over traditional methods is not so evident. This paper aims to discover the principles…

Computer Vision and Pattern Recognition · Computer Science 2016-08-03 Limin Wang , Yuanjun Xiong , Zhe Wang , Yu Qiao , Dahua Lin , Xiaoou Tang , Luc Van Gool

Two-Stream Convolutional Networks for Action Recognition in Videos

We investigate architectures of discriminatively trained deep Convolutional Networks (ConvNets) for action recognition in video. The challenge is to capture the complementary information on appearance from still frames and motion between…

Computer Vision and Pattern Recognition · Computer Science 2014-11-13 Karen Simonyan , Andrew Zisserman

Exploiting deep residual networks for human action recognition from skeletal data

The computer vision community is currently focusing on solving action recognition problems in real videos, which contain thousands of samples with many challenges. In this process, Deep Convolutional Neural Networks (D-CNNs) have played a…

Computer Vision and Pattern Recognition · Computer Science 2018-03-22 Huy-Hieu Pham , Louahdi Khoudour , Alain Crouzil , Pablo Zegers , Sergio A. Velastin

3D Convolutional with Attention for Action Recognition

Human action recognition is one of the challenging tasks in computer vision. The current action recognition methods use computationally expensive models for learning spatio-temporal dependencies of the action. Models utilizing RGB channels…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Labina Shrestha , Shikha Dubey , Farrukh Olimov , Muhammad Aasim Rafique , Moongu Jeon

Action Machine: Rethinking Action Recognition in Trimmed Videos

Existing methods in video action recognition mostly do not distinguish human body from the environment and easily overfit the scenes and objects. In this work, we present a conceptually simple, general and high-performance framework for…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Jiagang Zhu , Wei Zou , Liang Xu , Yiming Hu , Zheng Zhu , Manyu Chang , Junjie Huang , Guan Huang , Dalong Du

Video action detection by learning graph-based spatio-temporal interactions

Action Detection is a complex task that aims to detect and classify human actions in video clips. Typically, it has been addressed by processing fine-grained features extracted from a video classification backbone. Recently, thanks to the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Matteo Tomei , Lorenzo Baraldi , Simone Calderara , Simone Bronzin , Rita Cucchiara

A Comprehensive Study of Deep Video Action Recognition

Video action recognition is one of the representative tasks for video understanding. Over the last decade, we have witnessed great advancements in video action recognition thanks to the emergence of deep learning. But we also encountered…

Computer Vision and Pattern Recognition · Computer Science 2020-12-14 Yi Zhu , Xinyu Li , Chunhui Liu , Mohammadreza Zolfaghari , Yuanjun Xiong , Chongruo Wu , Zhi Zhang , Joseph Tighe , R. Manmatha , Mu Li

Computer Vision based group activity detection and action spotting

Group activity detection in multi-person scenes is challenging due to complex human interactions, occlusions, and variations in appearance over time. This work presents a computer vision based framework for group activity recognition and…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Narthana Sivalingam , Santhirarajah Sivasthigan , Thamayanthi Mahendranathan , G. M. R. I. Godaliyadda , M. P. B. Ekanayake , H. M. V. R. Herath

Video Modeling with Correlation Networks

Motion is a salient cue to recognize actions in video. Modern action recognition models leverage motion information either explicitly by using optical flow as input or implicitly by means of 3D convolutional filters that simultaneously…

Computer Vision and Pattern Recognition · Computer Science 2020-05-28 Heng Wang , Du Tran , Lorenzo Torresani , Matt Feiszli

RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems

Interactive autonomous applications require robustness of the perception engine to artifacts in unconstrained videos. In this paper, we examine the effect of camera motion on the task of action detection. We develop a novel ranking method…

Computer Vision and Pattern Recognition · Computer Science 2022-05-03 Burhan A. Mudassar , Sho Ko , Maojingjing Li , Priyabrata Saha , Saibal Mukhopadhyay

More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation

Current state-of-the-art models for video action recognition are mostly based on expensive 3D ConvNets. This results in a need for large GPU clusters to train and evaluate such architectures. To address this problem, we present a…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Quanfu Fan , Chun-Fu Chen , Hilde Kuehne , Marco Pistoia , David Cox