Related papers: Precise Temporal Action Localization by Evolving T…

Temporal Action Localization with Multi-temporal Scales

Temporal action localization plays an important role in video analysis, which aims to localize and classify actions in untrimmed videos. The previous methods often predict actions on a feature space of a single-temporal scale. However, the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Zan Gao , Xinglei Cui , Tao Zhuo , Zhiyong Cheng , An-An Liu , Meng Wang , Shenyong Chen

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

We address temporal action localization in untrimmed long videos. This is important because videos in real applications are usually unconstrained and contain multiple action instances plus video content of background scenes or other…

Computer Vision and Pattern Recognition · Computer Science 2016-04-25 Zheng Shou , Dongang Wang , Shih-Fu Chang

Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization

State-of-the-art temporal action detectors inefficiently search the entire video for specific actions. Despite the encouraging progress these methods achieve, it is crucial to design automated approaches that only explore parts of the video…

Computer Vision and Pattern Recognition · Computer Science 2018-07-30 Humam Alwassel , Fabian Caba Heilbron , Bernard Ghanem

Exploring Feature Representation and Training strategies in Temporal Action Localization

Temporal action localization has recently attracted significant interest in the Computer Vision community. However, despite the great progress, it is hard to identify which aspects of the proposed methods contribute most to the increase in…

Computer Vision and Pattern Recognition · Computer Science 2019-05-30 Tingting Xie , Xiaoshan Yang , Tianzhu Zhang , Changsheng Xu , Ioannis Patras

A Novel Online Action Detection Framework from Untrimmed Video Streams

Online temporal action localization from an untrimmed video stream is a challenging problem in computer vision. It is challenging because of i) in an untrimmed video stream, more than one action instance may appear, including background…

Computer Vision and Pattern Recognition · Computer Science 2020-03-18 Da-Hye Yoon , Nam-Gyu Cho , Seong-Whan Lee

Temporal Action Localization by Structured Maximal Sums

We address the problem of temporal action localization in videos. We pose action localization as a structured prediction over arbitrary-length temporal windows, where each window is scored as the sum of frame-wise classification scores.…

Computer Vision and Pattern Recognition · Computer Science 2017-04-18 Zehuan Yuan , Jonathan C. Stroud , Tong Lu , Jia Deng

Temporal Action Localization using Long Short-Term Dependency

Temporal action localization in untrimmed videos is an important but difficult task. Difficulties are encountered in the application of existing methods when modeling temporal structures of videos. In the present study, we developed a novel…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Yuan Zhou , Hongru Li , Sun-Yuan Kung

Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)

This technical report analyzes a temporal action localization method we used in the HACS competition which is hosted in Activitynet Challenge 2020.The goal of our task is to locate the start time and end time of the action in the untrimmed…

Computer Vision and Pattern Recognition · Computer Science 2020-06-16 Zhiwu Qing , Xiang Wang , Yongpeng Sang , Changxin Gao , Shiwei Zhang , Nong Sang

Efficient Action Detection in Untrimmed Videos via Multi-Task Learning

This paper studies the joint learning of action recognition and temporal localization in long, untrimmed videos. We employ a multi-task learning framework that performs the three highly related steps of action proposal, action recognition,…

Computer Vision and Pattern Recognition · Computer Science 2017-04-05 Yi Zhu , Shawn Newsam

PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization

Temporal action localization is an important and challenging task that aims to locate temporal regions in real-world untrimmed videos where actions occur and recognize their classes. It is widely acknowledged that video context is a…

Computer Vision and Pattern Recognition · Computer Science 2021-03-10 Xin Qin , Hanbin Zhao , Guangchen Lin , Hao Zeng , Songcen Xu , Xi Li

A Pursuit of Temporal Accuracy in General Activity Detection

Detecting activities in untrimmed videos is an important but challenging task. The performance of existing methods remains unsatisfactory, e.g., they often meet difficulties in locating the beginning and end of a long complex action. In…

Computer Vision and Pattern Recognition · Computer Science 2017-03-09 Yuanjun Xiong , Yue Zhao , Limin Wang , Dahua Lin , Xiaoou Tang

Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020

This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2020 Task 1 (\textbf{temporal action localization/detection}). Temporal action localization requires to not only precisely locate the…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Haisheng Su , Jinyuan Feng , Hao Shao , Zhenyu Jiang , Manyuan Zhang , Wei Wu , Yu Liu , Hongsheng Li , Junjie Yan

Temporal Context Network for Activity Localization in Videos

We present a Temporal Context Network (TCN) for precise temporal localization of human activities. Similar to the Faster-RCNN architecture, proposals are placed at equal intervals in a video which span multiple temporal scales. We propose a…

Computer Vision and Pattern Recognition · Computer Science 2017-08-09 Xiyang Dai , Bharat Singh , Guyue Zhang , Larry S. Davis , Yan Qiu Chen

Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos

Temporal action localization is a recently-emerging task, aiming to localize video segments from untrimmed videos that contain specific actions. Despite the remarkable recent progress, most two-stage action localization methods still suffer…

Computer Vision and Pattern Recognition · Computer Science 2019-08-05 Guoqiang Gong , Liangfeng Zheng , Kun Bai , Yadong Mu

Weakly Supervised Action Localization by Sparse Temporal Pooling Network

We propose a weakly supervised temporal action localization algorithm on untrimmed videos using convolutional neural networks. Our algorithm learns from video-level class labels and predicts temporal intervals of human actions with no…

Computer Vision and Pattern Recognition · Computer Science 2018-04-04 Phuc Nguyen , Ting Liu , Gautam Prasad , Bohyung Han

Progression-Guided Temporal Action Detection in Videos

We present a novel framework, Action Progression Network (APN), for temporal action detection (TAD) in videos. The framework locates actions in videos by detecting the action evolution process. To encode the action evolution, we quantify a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Chongkai Lu , Man-Wai Mak , Ruimin Li , Zheru Chi , Hong Fu

Rethinking the Faster R-CNN Architecture for Temporal Action Localization

We propose TAL-Net, an improved approach to temporal action localization in video that is inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key shortcomings of existing approaches: (1) we improve receptive…

Computer Vision and Pattern Recognition · Computer Science 2018-04-23 Yu-Wei Chao , Sudheendra Vijayanarasimhan , Bryan Seybold , David A. Ross , Jia Deng , Rahul Sukthankar

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization

Video action detectors are usually trained using datasets with fully-supervised temporal annotations. Building such datasets is an expensive task. To alleviate this problem, recent methods have tried to leverage weak labeling, where videos…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Alejandro Pardo , Humam Alwassel , Fabian Caba Heilbron , Ali Thabet , Bernard Ghanem

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Accurate temporal action proposals play an important role in detecting actions from untrimmed videos. The existing approaches have difficulties in capturing global contextual information and simultaneously localizing actions with different…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Jialin Gao , Zhixiang Shi , Jiani Li , Guanshuo Wang , Yufeng Yuan , Shiming Ge , Xi Zhou