Related papers: Sequence Summarization Using Order-constrained Ker…

Non-Linear Temporal Subspace Representations for Activity Recognition

Representations that can compactly and effectively capture the temporal evolution of semantic content are important to computer vision and machine learning algorithms that operate on multi-variate time-series data. We investigate such…

Computer Vision and Pattern Recognition · Computer Science 2018-03-30 Anoop Cherian , Suvrit Sra , Stephen Gould , Richard Hartley

Generalized Rank Pooling for Activity Recognition

Most popular deep models for action recognition split video sequences into short sub-sequences consisting of a few frames; frame-based features are then pooled for recognizing the activity. Usually, this pooling step discards the temporal…

Computer Vision and Pattern Recognition · Computer Science 2017-07-25 Anoop Cherian , Basura Fernando , Mehrtash Harandi , Stephen Gould

Rank Pooling for Action Recognition

We propose a function-based temporal pooling method that captures the latent structure of the video sequence data - e.g. how frame-level features evolve over time in a video. We show how the parameters of a function that has been fit to the…

Computer Vision and Pattern Recognition · Computer Science 2016-05-17 Basura Fernando , Efstratios Gavves , Jose Oramas , Amir Ghodrati , Tinne Tuytelaars

Higher-order Pooling of CNN Features via Kernel Linearization for Action Recognition

Most successful deep learning algorithms for action recognition extend models designed for image-based tasks such as object recognition to video. Such extensions are typically trained for actions on single video frames or very short clips,…

Computer Vision and Pattern Recognition · Computer Science 2017-01-20 Anoop Cherian , Piotr Koniusz , Stephen Gould

Action Recognition with Dynamic Image Networks

We introduce the concept of "dynamic image", a novel compact representation of videos useful for video analysis, particularly in combination with convolutional neural networks (CNNs). A dynamic image encodes temporal data such as RGB or…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Hakan Bilen , Basura Fernando , Efstratios Gavves , Andrea Vedaldi

Human Action Recognition with Deep Temporal Pyramids

Deep convolutional neural networks (CNNs) are nowadays achieving significant leaps in different pattern recognition tasks including action recognition. Current CNNs are increasingly deeper, data-hungrier and this makes their success…

Computer Vision and Pattern Recognition · Computer Science 2019-05-03 Ahmed Mazari , Hichem Sahbi

Order-aware Convolutional Pooling for Video Based Action Recognition

Most video based action recognition approaches create the video-level representation by temporally pooling the features extracted at each frame. The pooling methods that they adopt, however, usually completely or partially neglect the…

Computer Vision and Pattern Recognition · Computer Science 2016-02-02 Peng Wang , Lingqiao Liu , Chunhua Shen , Heng Tao Shen

Subspace Clustering for Action Recognition with Covariance Representations and Temporal Pruning

This paper tackles the problem of human action recognition, defined as classifying which action is displayed in a trimmed sequence, from skeletal data. Albeit state-of-the-art approaches designed for this application are all supervised, in…

Computer Vision and Pattern Recognition · Computer Science 2022-07-22 Giancarlo Paoletti , Jacopo Cavazza , Cigdem Beyan , Alessio Del Bue

Discriminatively Learned Hierarchical Rank Pooling Networks

In this work, we present novel temporal encoding methods for action and activity classification by extending the unsupervised rank pooling temporal encoding method in two ways. First, we present "discriminative rank pooling" in which the…

Computer Vision and Pattern Recognition · Computer Science 2017-05-31 Basura Fernando , Stephen Gould

Action Representation Using Classifier Decision Boundaries

Most popular deep learning based models for action recognition are designed to generate separate predictions within their short temporal windows, which are often aggregated by heuristic means to assign an action label to the full video…

Computer Vision and Pattern Recognition · Computer Science 2017-04-07 Jue Wang , Anoop Cherian , Fatih Porikli , Stephen Gould

Ordered Pooling of Optical Flow Sequences for Action Recognition

Training of Convolutional Neural Networks (CNNs) on long video sequences is computationally expensive due to the substantial memory requirements and the massive number of parameters that deep architectures demand. Early fusion of video…

Computer Vision and Pattern Recognition · Computer Science 2017-04-07 Jue Wang , Anoop Cherian , Fatih Porikli

Second-order Temporal Pooling for Action Recognition

Deep learning models for video-based action recognition usually generate features for short clips (consisting of a few frames); such clip-level features are aggregated to video-level representations by computing statistics on these…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 Anoop Cherian , Stephen Gould

Kernelized Low Rank Representation on Grassmann Manifolds

Low rank representation (LRR) has recently attracted great interest due to its pleasing efficacy in exploring low-dimensional subspace structures embedded in data. One of its successful applications is subspace clustering which means data…

Computer Vision and Pattern Recognition · Computer Science 2015-04-09 Boyue Wang , Yongli Hu , Junbin Gao , Yanfeng Sun , Baocai Yin

Deep hierarchical pooling design for cross-granularity action recognition

In this paper, we introduce a novel hierarchical aggregation design that captures different levels of temporal granularity in action recognition. Our design principle is coarse-to-fine and achieved using a tree-structured network; as we…

Computer Vision and Pattern Recognition · Computer Science 2020-06-09 Ahmed Mazari , Hichem Sahbi

Linear-Time Sequence Classification using Restricted Boltzmann Machines

Classification of sequence data is the topic of interest for dynamic Bayesian models and Recurrent Neural Networks (RNNs). While the former can explicitly model the temporal dependencies between class variables, the latter have a capability…

Machine Learning · Computer Science 2018-03-12 Son N. Tran , Srikanth Cherla , Artur Garcez , Tillman Weyde

Object Level Deep Feature Pooling for Compact Image Representation

Convolutional Neural Network (CNN) features have been successfully employed in recent works as an image descriptor for various vision tasks. But the inability of the deep CNN features to exhibit invariance to geometric transformations and…

Computer Vision and Pattern Recognition · Computer Science 2015-04-27 Konda Reddy Mopuri , R. Venkatesh Babu

Approximation and Learning with Deep Convolutional Models: a Kernel Perspective

The empirical success of deep convolutional networks on tasks involving high-dimensional data such as images or audio suggests that they can efficiently approximate certain functions that are well-suited for such tasks. In this paper, we…

Machine Learning · Statistics 2022-03-22 Alberto Bietti

Action Recognition with Deep Multiple Aggregation Networks

Most of the current action recognition algorithms are based on deep networks which stack multiple convolutional, pooling and fully connected layers. While convolutional and fully connected operations have been widely studied in the…

Computer Vision and Pattern Recognition · Computer Science 2020-06-09 Ahmed Mazari , Hichem Sahbi

Self-Supervision based Task-Specific Image Collection Summarization

Successful applications of deep learning (DL) requires large amount of annotated data. This often restricts the benefits of employing DL to businesses and individuals with large budgets for data-collection and computation. Summarization…

Multimedia · Computer Science 2021-01-05 Anurag Singh , Deepak Kumar Sharma , Sudhir Kumar Sharma

Tensor Representations for Action Recognition

Human actions in video sequences are characterized by the complex interplay between spatial features and their temporal dynamics. In this paper, we propose novel tensor representations for compactly capturing such higher-order relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Piotr Koniusz , Lei Wang , Anoop Cherian