Related papers: Timeception for Complex Action Recognition

Long-term Temporal Convolutions for Action Recognition

Typical human actions last several seconds and exhibit characteristic spatio-temporal structure. Recent methods attempt to capture this structure and learn action representations with convolutional neural networks. Such representations,…

Computer Vision and Pattern Recognition · Computer Science 2017-06-05 Gül Varol , Ivan Laptev , Cordelia Schmid

Efficient Modelling Across Time of Human Actions and Interactions

This thesis focuses on video understanding for human action and interaction recognition. We start by identifying the main challenges related to action recognition from videos and review how they have been addressed by current methods. Based…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Alexandros Stergiou

Interpretable Action Recognition on Hard to Classify Actions

We investigate a human-like interpretable model of video understanding. Humans recognise complex activities in video by recognising critical spatio-temporal relations among explicitly recognised objects and parts, for example, an object…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Anastasia Anichenko , Frank Guerin , Andrew Gilbert

Action2Activity: Recognizing Complex Activities from Sensor Data

As compared to simple actions, activities are much more complex, but semantically consistent with a human's real life. Techniques for action recognition from sensor generated data are mature. However, there has been relatively little work…

Computer Vision and Pattern Recognition · Computer Science 2016-11-08 Ye Liu , Liqiang Nie , Lei Han , Luming Zhang , David S Rosenblum

Human-like Relational Models for Activity Recognition in Video

Video activity recognition by deep neural networks is impressive for many classes. However, it falls short of human performance, especially for challenging to discriminate activities. Humans differentiate these complex activities by…

Computer Vision and Pattern Recognition · Computer Science 2022-01-12 Joseph Chrol-Cannon , Andrew Gilbert , Ranko Lazic , Adithya Madhusoodanan , Frank Guerin

Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset

Moments capture a huge part of our lives. Accurate recognition of these moments is challenging due to the diverse and complex interpretation of the moments. Action recognition refers to the act of classifying the desired action/activity…

Computer Vision and Pattern Recognition · Computer Science 2018-09-14 Ankit Shah , Harini Kesavamoorthy , Poorva Rane , Pramati Kalwad , Alexander Hauptmann , Florian Metze

Attentive Action and Context Factorization

We propose a method for human action recognition, one that can localize the spatiotemporal regions that `define' the actions. This is a challenging task due to the subtlety of human actions in video and the co-occurrence of contextual…

Computer Vision and Pattern Recognition · Computer Science 2019-04-12 Yang Wang , Vinh Tran , Gedas Bertasius , Lorenzo Torresani , Minh Hoai

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks

Human activity understanding with 3D/depth sensors has received increasing attention in multimedia processing and interactions. This work targets on developing a novel deep model for automatic activity recognition from RGB-D videos. We…

Computer Vision and Pattern Recognition · Computer Science 2017-08-01 Keze Wang , Xiaolong Wang , Liang Lin , Meng Wang , Wangmeng Zuo

Collaborative Spatio-temporal Feature Learning for Video Action Recognition

Spatio-temporal feature learning is of central importance for action recognition in videos. Existing deep neural network models either learn spatial and temporal features independently (C2D) or jointly with unconstrained parameters (C3D).…

Computer Vision and Pattern Recognition · Computer Science 2019-03-05 Chao Li , Qiaoyong Zhong , Di Xie , Shiliang Pu

PIC: Permutation Invariant Convolution for Recognizing Long-range Activities

Neural operations as convolutions, self-attention, and vector aggregation are the go-to choices for recognizing short-range actions. However, they have three limitations in modeling long-range activities. This paper presents PIC,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-19 Noureldien Hussein , Efstratios Gavves , Arnold W. M. Smeulders

Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

In this paper, we newly introduce the concept of temporal attention filters, and describe how they can be used for human activity recognition from videos. Many high-level activities are often composed of multiple temporal parts (e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2016-12-28 AJ Piergiovanni , Chenyou Fan , Michael S. Ryoo

Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks

This thesis explore different approaches using Convolutional and Recurrent Neural Networks to classify and temporally localize activities on videos, furthermore an implementation to achieve it has been proposed. As the first step, features…

Computer Vision and Pattern Recognition · Computer Science 2017-03-06 Alberto Montes , Amaia Salvador , Santiago Pascual , Xavier Giro-i-Nieto

Spatio-temporal Action Recognition: A Survey

The task of action recognition or action detection involves analyzing videos and determining what action or motion is being performed. The primary subject of these videos are predominantly humans performing some action. However, this…

Computer Vision and Pattern Recognition · Computer Science 2019-01-29 Amlaan Bhoi

Human Activity Recognition based on Dynamic Spatio-Temporal Relations

Human activity, which usually consists of several actions, generally covers interactions among persons and or objects. In particular, human actions involve certain spatial and temporal relationships, are the components of more complicated…

Computer Vision and Pattern Recognition · Computer Science 2020-06-30 Zhenyu Liu , Yaqiang Yao , Yan Liu , Yuening Zhu , Zhenchao Tao , Lei Wang , Yuhong Feng

Spatio-Temporal FAST 3D Convolutions for Human Action Recognition

Effective processing of video input is essential for the recognition of temporally varying events such as human actions. Motivated by the often distinctive temporal characteristics of actions in either horizontal or vertical direction, we…

Computer Vision and Pattern Recognition · Computer Science 2020-06-24 Alexandros Stergiou , Ronald Poppe

Video action detection by learning graph-based spatio-temporal interactions

Action Detection is a complex task that aims to detect and classify human actions in video clips. Typically, it has been addressed by processing fine-grained features extracted from a video classification backbone. Recently, thanks to the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Matteo Tomei , Lorenzo Baraldi , Simone Calderara , Simone Bronzin , Rita Cucchiara

MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection

Action detection is an essential and challenging task, especially for densely labelled datasets of untrimmed videos. The temporal relation is complex in those datasets, including challenges like composite action, and co-occurring action.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Rui Dai , Srijan Das , Kumara Kahatapitiya , Michael S. Ryoo , Francois Bremond

Action recognition in real-world videos

The goal of human action recognition is to temporally or spatially localize the human action of interest in video sequences. Temporal localization (i.e. indicating the start and end frames of the action in a video) is referred to as…

Computer Vision and Pattern Recognition · Computer Science 2020-04-24 Waqas Sultani , Qazi Ammar Arshad , Chen Chen

Flatten: Video Action Recognition is an Image Classification task

In recent years, video action recognition, as a fundamental task in the field of video understanding, has been deeply explored by numerous researchers.Most traditional video action recognition methods typically involve converting videos…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Junlin Chen , Chengcheng Xu , Yangfan Xu , Jian Yang , Jun Li , Zhiping Shi

Multi-Temporal Convolutions for Human Action Recognition in Videos

Effective extraction of temporal patterns is crucial for the recognition of temporally varying actions in video. We argue that the fixed-sized spatio-temporal convolution kernels used in convolutional neural networks (CNNs) can be improved…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Alexandros Stergiou , Ronald Poppe