Related papers: SMART Frame Selection for Action Recognition

Search-Map-Search: A Frame Selection Paradigm for Action Recognition

Despite the success of deep learning in video understanding tasks, processing every frame in a video is computationally expensive and often unnecessary in real-time applications. Frame selection aims to extract the most informative and…

Computer Vision and Pattern Recognition · Computer Science 2023-04-21 Mingjun Zhao , Yakun Yu , Xiaoli Wang , Lei Yang , Di Niu

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition

Action recognition is an open and challenging problem in computer vision. While current state-of-the-art models offer excellent recognition results, their computational expense limits their impact for many real-world applications. In this…

Computer Vision and Pattern Recognition · Computer Science 2020-08-03 Yue Meng , Chung-Ching Lin , Rameswar Panda , Prasanna Sattigeri , Leonid Karlinsky , Aude Oliva , Kate Saenko , Rogerio Feris

Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration

Training an effective video action recognition model poses significant computational challenges, particularly under limited resource budgets. Current methods primarily aim to either reduce model size or utilize pre-trained models, limiting…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Harry Cheng , Yangyang Guo , Liqiang Nie , Zhiyong Cheng , Mohan Kankanhalli

Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision

Action recognition in videos has attracted a lot of attention in the past decade. In order to learn robust models, previous methods usually assume videos are trimmed as short sequences and require ground-truth annotations of each video…

Computer Vision and Pattern Recognition · Computer Science 2019-02-21 Xiao-Yu Zhang , Haichao Shi , Changsheng Li , Kai Zheng , Xiaobin Zhu , Lixin Duan

Selective Feature Compression for Efficient Activity Recognition Inference

Most action recognition solutions rely on dense sampling to precisely cover the informative temporal clip. Extensively searching temporal region is expensive for a real-world application. In this work, we focus on improving the inference…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Chunhui Liu , Xinyu Li , Hao Chen , Davide Modolo , Joseph Tighe

AdaFrame: Adaptive Frame Selection for Fast Video Recognition

We present AdaFrame, a framework that adaptively selects relevant frames on a per-input basis for fast video recognition. AdaFrame contains a Long Short-Term Memory network augmented with a global memory that provides context information…

Computer Vision and Pattern Recognition · Computer Science 2019-04-11 Zuxuan Wu , Caiming Xiong , Chih-Yao Ma , Richard Socher , Larry S. Davis

Challenge report:VIPriors Action Recognition Challenge

This paper is a brief report to our submission to the VIPriors Action Recognition Challenge. Action recognition has attracted many researchers attention for its full application, but it is still challenging. In this paper, we study previous…

Computer Vision and Pattern Recognition · Computer Science 2020-07-17 Zhipeng Luo , Dawei Xu , Zhiguang Zhang

Action Machine: Rethinking Action Recognition in Trimmed Videos

Existing methods in video action recognition mostly do not distinguish human body from the environment and easily overfit the scenes and objects. In this work, we present a conceptually simple, general and high-performance framework for…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Jiagang Zhu , Wei Zou , Liang Xu , Yiming Hu , Zheng Zhu , Manyu Chang , Junjie Huang , Guan Huang , Dalong Du

No frame left behind: Full Video Action Recognition

Not all video frames are equally informative for recognizing an action. It is computationally infeasible to train deep networks on all video frames when actions develop over hundreds of frames. A common heuristic is uniformly sampling a…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Xin Liu , Silvia L. Pintea , Fatemeh Karimi Nejadasl , Olaf Booij , Jan C. van Gemert

Dynamic Inference: A New Approach Toward Efficient Video Action Recognition

Though action recognition in videos has achieved great success recently, it remains a challenging task due to the massive computational cost. Designing lightweight networks is a possible solution, but it may degrade the recognition…

Computer Vision and Pattern Recognition · Computer Science 2020-02-11 Wenhao Wu , Dongliang He , Xiao Tan , Shifeng Chen , Yi Yang , Shilei Wen

SCSampler: Sampling Salient Clips from Video for Efficient Action Recognition

While many action recognition datasets consist of collections of brief, trimmed videos each containing a relevant action, videos in the real-world (e.g., on YouTube) exhibit very different properties: they are often several minutes long,…

Computer Vision and Pattern Recognition · Computer Science 2019-09-02 Bruno Korbar , Du Tran , Lorenzo Torresani

Action Recognition in Untrimmed Videos with Composite Self-Attention Two-Stream Framework

With the rapid development of deep learning algorithms, action recognition in video has achieved many important research results. One issue in action recognition, Zero-Shot Action Recognition (ZSAR), has recently attracted considerable…

Computer Vision and Pattern Recognition · Computer Science 2020-04-24 Dong Cao , Lisha Xu , HaiBo Chen

Skimming and Scanning for Untrimmed Video Action Recognition

Video action recognition (VAR) is a primary task of video understanding, and untrimmed videos are more common in real-life scenes. Untrimmed videos have redundant and diverse clips containing contextual information, so sampling dense clips…

Computer Vision and Pattern Recognition · Computer Science 2021-04-22 Yunyan Hong , Ailing Zeng , Min Li , Cewu Lu , Li Jiang , Qiang Xu

Reinforcement Learning Based Sparse Black-box Adversarial Attack on Video Recognition Models

We explore the black-box adversarial attack on video recognition models. Attacks are only performed on selected key regions and key frames to reduce the high computation cost of searching adversarial perturbations on a video due to its high…

Cryptography and Security · Computer Science 2021-09-01 Zeyuan Wang , Chaofeng Sha , Su Yang

Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space

Given a video with $T$ frames, frame sampling is a task to select $N \ll T$ frames, so as to maximize the performance of a fixed video classifier. Not just brute-force search, but most existing methods suffer from its vast search space of…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Junho Lee , Jeongwoo Shin , Seung Woo Ko , Seongsu Ha , Joonseok Lee

Feature Sampling Strategies for Action Recognition

Although dense local spatial-temporal features with bag-of-features representation achieve state-of-the-art performance for action recognition, the huge feature number and feature size prevent current methods from scaling up to real size…

Computer Vision and Pattern Recognition · Computer Science 2015-01-29 Youjie Zhou , Hongkai Yu , Song Wang

Online Learnable Keyframe Extraction in Videos and its Application with Semantic Word Vector in Action Recognition

Video processing has become a popular research direction in computer vision due to its various applications such as video summarization, action recognition, etc. Recently, deep learning-based methods have achieved impressive results in…

Computer Vision and Pattern Recognition · Computer Science 2020-09-29 G M Mashrur E Elahi , Yee-Hong Yang

Budget-Aware Activity Detection with A Recurrent Policy Network

In this paper, we address the challenging problem of efficient temporal activity detection in untrimmed long videos. While most recent work has focused and advanced the detection accuracy, the inference time can take seconds to minutes in…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Behrooz Mahasseni , Xiaodong Yang , Pavlo Molchanov , Jan Kautz

Memory Group Sampling Based Online Action Recognition Using Kinetic Skeleton Features

Online action recognition is an important task for human centered intelligent services, which is still difficult to achieve due to the varieties and uncertainties of spatial and temporal scales of human actions. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2020-11-04 Guoliang Liu , Qinghui Zhang , Yichao Cao , Junwei Li , Hao Wu , Guohui Tian

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

Recent incremental learning for action recognition usually stores representative videos to mitigate catastrophic forgetting. However, only a few bulky videos can be stored due to the limited memory. To address this problem, we propose…

Computer Vision and Pattern Recognition · Computer Science 2022-11-03 Yixuan Pei , Zhiwu Qing , Jun Cen , Xiang Wang , Shiwei Zhang , Yaxiong Wang , Mingqian Tang , Nong Sang , Xueming Qian