Related papers: Action Classification with Locality-constrained Li…

CSIFT Based Locality-constrained Linear Coding for Image Classification

In the past decade, SIFT descriptor has been witnessed as one of the most robust local invariant feature descriptors and widely used in various vision tasks. Most traditional image classification systems depend on the luminance-based SIFT…

Computer Vision and Pattern Recognition · Computer Science 2013-10-01 Chen Junzhou , Li Qing , Peng Qiang , Kin Hong Wong

Visual Tracking via Nonnegative Regularization Multiple Locality Coding

This paper presents a novel object tracking method based on approximated Locality-constrained Linear Coding (LLC). Rather than using a non-negativity constraint on encoding coefficients to guarantee these elements nonnegative, in this…

Computer Vision and Pattern Recognition · Computer Science 2015-10-30 Fanghui Liu , Tao Zhou , Irene Y. H. Gu , Jie Yang

Linear centralization classifier

A classification algorithm, called the Linear Centralization Classifier (LCC), is introduced. The algorithm seeks to find a transformation that best maps instances from the feature space to a space where they concentrate towards the center…

Machine Learning · Computer Science 2017-12-25 Mohammad Reza Bonyadi , Viktor Vegh , David C. Reutens

Discriminatively Learned Hierarchical Rank Pooling Networks

In this work, we present novel temporal encoding methods for action and activity classification by extending the unsupervised rank pooling temporal encoding method in two ways. First, we present "discriminative rank pooling" in which the…

Computer Vision and Pattern Recognition · Computer Science 2017-05-31 Basura Fernando , Stephen Gould

Action parsing using context features

We propose an action parsing algorithm to parse a video sequence containing an unknown number of actions into its action segments. We argue that context information, particularly the temporal information about other actions in the video…

Computer Vision and Pattern Recognition · Computer Science 2022-05-23 Nagita Mehrseresht

Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment

Robust frame-wise embeddings are essential to perform video analysis and understanding tasks. We present a self-supervised method for representation learning based on aligning temporal video sequences. Our framework uses a transformer-based…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Keyne Oei , Amr Gomaa , Anna Maria Feit , João Belo

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Temporal action segmentation is a topic of increasing interest, however, annotating each frame in a video is cumbersome and costly. Weakly supervised approaches therefore aim at learning temporal action segmentation from videos that are…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Mohsen Fayyaz , Juergen Gall

Spatio-temporal Human Action Localisation and Instance Segmentation in Temporally Untrimmed Videos

Current state-of-the-art human action recognition is focused on the classification of temporally trimmed videos in which only one action occurs per frame. In this work we address the problem of action localisation and instance segmentation…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Suman Saha , Gurkirt Singh , Michael Sapienza , Philip H. S. Torr , Fabio Cuzzolin

Spatio-temporal prediction in video coding by best approximation

Within the scope of this contribution we propose a novel efficient spatio-temporal prediction algorithm for video coding. The algorithm operates in two stages. First, motion compensation is performed on the block to be predicted in order to…

Image and Video Processing · Electrical Eng. & Systems 2022-07-21 Jürgen Seiler , Haricharan Lakshman , André Kaup

Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks

Human action recognition from well-segmented 3D skeleton data has been intensively studied and has been attracting an increasing attention. Online action detection goes one step further and is more challenging, which identifies the action…

Computer Vision and Pattern Recognition · Computer Science 2016-07-27 Yanghao Li , Cuiling Lan , Junliang Xing , Wenjun Zeng , Chunfeng Yuan , Jiaying Liu

Attentive Action and Context Factorization

We propose a method for human action recognition, one that can localize the spatiotemporal regions that `define' the actions. This is a challenging task due to the subtlety of human actions in video and the co-occurrence of contextual…

Computer Vision and Pattern Recognition · Computer Science 2019-04-12 Yang Wang , Vinh Tran , Gedas Bertasius , Lorenzo Torresani , Minh Hoai

Long-short Term Motion Feature for Action Classification and Retrieval

We propose a method for representing motion information for video classification and retrieval. We improve upon local descriptor based methods that have been among the most popular and successful models for representing videos. The desired…

Computer Vision and Pattern Recognition · Computer Science 2015-02-17 Zhenzhong Lan , Xuanchong Li , Ming Lin , Alexander G. Hauptmann

Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning

Prior works on action representation learning mainly focus on designing various architectures to extract the global representations for short video clips. In contrast, many practical applications such as video alignment have strong demand…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Minghao Chen , Fangyun Wei , Chong Li , Deng Cai

Modeling long-term interactions to enhance action recognition

In this paper, we propose a new approach to under-stand actions in egocentric videos that exploits the semantics of object interactions at both frame and temporal levels. At the frame level, we use a region-based approach that takes as…

Computer Vision and Pattern Recognition · Computer Science 2021-04-26 Alejandro Cartas , Petia Radeva , Mariella Dimiccoli

Learning Linear Dynamical Systems with High-Order Tensor Data for Skeleton based Action Recognition

In recent years, there has been renewed interest in developing methods for skeleton-based human action recognition. A skeleton sequence can be naturally represented as a high-order tensor time series. In this paper, we model and analyze…

Computer Vision and Pattern Recognition · Computer Science 2017-01-17 Wenwen Ding , Kai Liu

An Information-rich Sampling Technique over Spatio-Temporal CNN for Classification of Human Actions in Videos

We propose a novel scheme for human action recognition in videos, using a 3-dimensional Convolutional Neural Network (3D CNN) based classifier. Traditionally in deep learning based human activity recognition approaches, either a few random…

Computer Vision and Pattern Recognition · Computer Science 2020-02-10 S. H. Shabbeer Basha , Viswanath Pulabaigari , Snehasis Mukherjee

Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action Recognition

Action recognition in videos is a challenging task due to the complexity of the spatio-temporal patterns to model and the difficulty to acquire and learn on large quantities of video data. Deep learning, although a breakthrough for image…

Computer Vision and Pattern Recognition · Computer Science 2016-08-26 César Roberto de Souza , Adrien Gaidon , Eleonora Vig , Antonio Manuel López

Action Classification and Highlighting in Videos

Inspired by recent advances in neural machine translation, that jointly align and translate using encoder-decoder networks equipped with attention, we propose an attentionbased LSTM model for human activity recognition. Our model jointly…

Computer Vision and Pattern Recognition · Computer Science 2017-09-01 Atousa Torabi , Leonid Sigal

MAC: Mining Activity Concepts for Language-based Temporal Localization

We address the problem of language-based temporal localization in untrimmed videos. Compared to temporal localization with fixed categories, this problem is more challenging as the language-based queries not only have no pre-defined…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Runzhou Ge , Jiyang Gao , Kan Chen , Ram Nevatia

OpenCL-accelerated object classification in video streams using Spatial Pooler of Hierarchical Temporal Memory

We present a method to classify objects in video streams using a brain-inspired Hierarchical Temporal Memory (HTM) algorithm. Object classification is a challenging task where humans still significantly outperform machine learning…

Computer Vision and Pattern Recognition · Computer Science 2017-03-03 Maciej Wielgosz , Marcin Pietroń