Related papers: PIC: Permutation Invariant Convolution for Recogni…

Timeception for Complex Action Recognition

This paper focuses on the temporal aspect for recognizing human activities in videos; an important visual cue that has long been undervalued. We revisit the conventional definition of activity and restrict it to Complex Action: a set of…

Computer Vision and Pattern Recognition · Computer Science 2019-04-30 Noureldien Hussein , Efstratios Gavves , Arnold W. M. Smeulders

Invariant recognition drives neural representations of action sequences

Recognizing the actions of others from visual stimuli is a crucial aspect of human visual perception that allows individuals to respond to social cues. Humans are able to identify similar behaviors and discriminate between distinct actions…

Neurons and Cognition · Quantitative Biology 2018-02-07 Andrea Tacchetti , Leyla Isik , Tomaso Poggio

Directional Temporal Modeling for Action Recognition

Many current activity recognition models use 3D convolutional neural networks (e.g. I3D, I3D-NL) to generate local spatial-temporal features. However, such features do not encode clip-level ordered temporal information. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Xinyu Li , Bing Shuai , Joseph Tighe

Collaborative Spatio-temporal Feature Learning for Video Action Recognition

Spatio-temporal feature learning is of central importance for action recognition in videos. Existing deep neural network models either learn spatial and temporal features independently (C2D) or jointly with unconstrained parameters (C3D).…

Computer Vision and Pattern Recognition · Computer Science 2019-03-05 Chao Li , Qiaoyong Zhong , Di Xie , Shiliang Pu

Efficient Modelling Across Time of Human Actions and Interactions

This thesis focuses on video understanding for human action and interaction recognition. We start by identifying the main challenges related to action recognition from videos and review how they have been addressed by current methods. Based…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Alexandros Stergiou

Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening

Our objective is to develop compact video representations that are sensitive to visual change over time. To measure such time-sensitivity, we introduce a new task: chiral action recognition, where one needs to distinguish between a pair of…

Computer Vision and Pattern Recognition · Computer Science 2025-09-25 Piyush Bagad , Andrew Zisserman

Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection

Fine-grained action detection is an important task with numerous applications in robotics and human-computer interaction. Existing methods typically utilize a two-stage approach including extraction of local spatio-temporal features…

Computer Vision and Pattern Recognition · Computer Science 2019-11-11 Khoi-Nguyen C. Mac , Dhiraj Joshi , Raymond A. Yeh , Jinjun Xiong , Rogerio S. Feris , Minh N. Do

Recurrent Models of Visual Attention

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of…

Machine Learning · Computer Science 2014-06-25 Volodymyr Mnih , Nicolas Heess , Alex Graves , Koray Kavukcuoglu

Long-term Temporal Convolutions for Action Recognition

Typical human actions last several seconds and exhibit characteristic spatio-temporal structure. Recent methods attempt to capture this structure and learn action representations with convolutional neural networks. Such representations,…

Computer Vision and Pattern Recognition · Computer Science 2017-06-05 Gül Varol , Ivan Laptev , Cordelia Schmid

View-invariant action recognition

Human action recognition is an important problem in computer vision. It has a wide range of applications in surveillance, human-computer interaction, augmented reality, video indexing, and retrieval. The varying pattern of spatio-temporal…

Computer Vision and Pattern Recognition · Computer Science 2020-09-03 Yogesh S Rawat , Shruti Vyas

Deep ConvLSTM with self-attention for human activity decoding using wearables

Decoding human activity accurately from wearable sensors can aid in applications related to healthcare and context awareness. The present approaches in this domain use recurrent and/or convolutional models to capture the spatio-temporal…

Human-Computer Interaction · Computer Science 2020-12-21 Satya P. Singh , Aimé Lay-Ekuakille , Deepak Gangwar , Madan Kumar Sharma , Sukrit Gupta

Behavior Recognition Based on the Integration of Multigranular Motion Features

The recognition of behaviors in videos usually requires a combinatorial analysis of the spatial information about objects and their dynamic action information in the temporal dimension. Specifically, behavior recognition may even rely more…

Computer Vision and Pattern Recognition · Computer Science 2022-03-08 Lizong Zhang , Yiming Wang , Bei Hui , Xiujian Zhang , Sijuan Liu , Shuxin Feng

Coarse-Fine Networks for Temporal Activity Detection in Videos

In this paper, we introduce Coarse-Fine Networks, a two-stream architecture which benefits from different abstractions of temporal resolution to learn better video representations for long-term motion. Traditional Video models process…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Kumara Kahatapitiya , Michael S. Ryoo

PiNet: A Permutation Invariant Graph Neural Network for Graph Classification

We propose an end-to-end deep learning learning model for graph classification and representation learning that is invariant to permutation of the nodes of the input graphs. We address the challenge of learning a fixed size graph…

Machine Learning · Computer Science 2019-05-09 Peter Meltzer , Marcelo Daniel Gutierrez Mallea , Peter J. Bentley

A Framework for Learning Invariant Physical Relations in Multimodal Sensory Processing

Perceptual learning enables humans to recognize and represent stimuli invariant to various transformations and build a consistent representation of the self and physical world. Such representations preserve the invariant physical relations…

Neural and Evolutionary Computing · Computer Science 2020-07-02 Du Xiaorui , Yavuzhan Erdem , Immanuel Schweizer , Cristian Axenie

Perception Over Time: Temporal Dynamics for Robust Image Understanding

While deep learning surpasses human-level performance in narrow and specific vision tasks, it is fragile and over-confident in classification. For example, minor transformations in perspective, illumination, or object deformation in the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Maryam Daniali , Edward Kim

Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks

A number of techniques for interpretability have been presented for deep learning in computer vision, typically with the goal of understanding what the networks have based their classification on. However, interpretability for deep video…

Computer Vision and Pattern Recognition · Computer Science 2020-07-13 Joonatan Mänttäri , Sofia Broomé , John Folkesson , Hedvig Kjellström

Video Action Recognition Collaborative Learning with Dynamics via PSO-ConvNet Transformer

Recognizing human actions in video sequences, known as Human Action Recognition (HAR), is a challenging task in pattern recognition. While Convolutional Neural Networks (ConvNets) have shown remarkable success in image recognition, they are…

Computer Vision and Pattern Recognition · Computer Science 2023-09-22 Nguyen Huu Phong , Bernardete Ribeiro

Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks

This thesis explore different approaches using Convolutional and Recurrent Neural Networks to classify and temporally localize activities on videos, furthermore an implementation to achieve it has been proposed. As the first step, features…

Computer Vision and Pattern Recognition · Computer Science 2017-03-06 Alberto Montes , Amaia Salvador , Santiago Pascual , Xavier Giro-i-Nieto

3D Convolutional with Attention for Action Recognition

Human action recognition is one of the challenging tasks in computer vision. The current action recognition methods use computationally expensive models for learning spatio-temporal dependencies of the action. Models utilizing RGB channels…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Labina Shrestha , Shikha Dubey , Farrukh Olimov , Muhammad Aasim Rafique , Moongu Jeon