Related papers: CTAP: Complementary Temporal Action Proposal Gener…

Deep Point-wise Prediction for Action Temporal Proposal

Detecting actions in videos is an important yet challenging task. Previous works usually utilize (a) sliding window paradigms, or (b) per-frame action scoring and grouping to enumerate the possible temporal locations. Their performances are…

Computer Vision and Pattern Recognition · Computer Science 2019-09-18 Luxuan Li , Tao Kong , Fuchun Sun , Huaping Liu

Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network

Temporal action detection (TAD) aims to detect the semantic labels and boundaries of action instances in untrimmed videos. Current mainstream approaches are multi-step solutions, which fall short in efficiency and flexibility. In this…

Computer Vision and Pattern Recognition · Computer Science 2022-04-07 Shimin Chen , Chen Chen , Wei Li , Xunqiang Tao , Yandong Guo

Multi-granularity Generator for Temporal Action Proposal

Temporal action proposal generation is an important task, aiming to localize the video segments containing human actions in an untrimmed video. In this paper, we propose a multi-granularity generator (MGG) to perform the temporal action…

Computer Vision and Pattern Recognition · Computer Science 2019-04-15 Yuan Liu , Lin Ma , Yifeng Zhang , Wei Liu , Shih-Fu Chang

Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020

This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2020 Task 1 (\textbf{temporal action localization/detection}). Temporal action localization requires to not only precisely locate the…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Haisheng Su , Jinyuan Feng , Hao Shao , Zhenyu Jiang , Manyuan Zhang , Wei Wu , Yu Liu , Hongsheng Li , Junjie Yan

Temporal Action Proposal Generation with Background Constraint

Temporal action proposal generation (TAPG) is a challenging task that aims to locate action instances in untrimmed videos with temporal boundaries. To evaluate the confidence of proposals, the existing works typically predict action score…

Computer Vision and Pattern Recognition · Computer Science 2021-12-16 Haosen Yang , Wenhao Wu , Lining Wang , Sheng Jin , Boyang Xia , Hongxun Yao , Hujie Huang

Context-aware Proposal Network for Temporal Action Detection

This technical report presents our first place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. The task aims to localize temporal boundaries of action instances with specific classes in long…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Xiang Wang , Huaxin Zhang , Shiwei Zhang , Changxin Gao , Yuanjie Shao , Nong Sang

Towards High-Quality Temporal Action Detection with Sparse Proposals

Temporal Action Detection (TAD) is an essential and challenging topic in video understanding, aiming to localize the temporal segments containing human action instances and predict the action categories. The previous works greatly rely upon…

Computer Vision and Pattern Recognition · Computer Science 2021-09-21 Jiannan Wu , Peize Sun , Shoufa Chen , Jiewen Yang , Zihao Qi , Lan Ma , Ping Luo

DCAN: Improving Temporal Action Detection via Dual Context Aggregation

Temporal action detection aims to locate the boundaries of action in the video. The current method based on boundary matching enumerates and calculates all possible boundary matchings to generate proposals. However, these methods neglect…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Guo Chen , Yin-Dong Zheng , Limin Wang , Tong Lu

Proposal-based Temporal Action Localization with Point-level Supervision

Point-level supervised temporal action localization (PTAL) aims at recognizing and localizing actions in untrimmed videos where only a single point (frame) within every action instance is annotated in training data. Without temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Yuan Yin , Yifei Huang , Ryosuke Furuta , Yoichi Sato

Introducing Gating and Context into Temporal Action Detection

Temporal Action Detection (TAD), the task of localizing and classifying actions in untrimmed video, remains challenging due to action overlaps and variable action durations. Recent findings suggest that TAD performance is dependent on the…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Aglind Reka , Diana Laura Borza , Dominick Reilly , Michal Balazia , Francois Bremond

Boundary-Aware Proposal Generation Method for Temporal Action Localization

The goal of Temporal Action Localization (TAL) is to find the categories and temporal boundaries of actions in an untrimmed video. Most TAL methods rely heavily on action recognition models that are sensitive to action labels rather than…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Hao Zhang , Chunyan Feng , Jiahui Yang , Zheng Li , Caili Guo

TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals

Temporal Action Proposal (TAP) generation is an important problem, as fast and accurate extraction of semantically important (e.g. human actions) segments from untrimmed videos is an important step for large-scale video analysis. We propose…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Jiyang Gao , Zhenheng Yang , Chen Sun , Kan Chen , Ram Nevatia

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it as a proposal generation - classification problem. However,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-04 Chen Zhao , Merey Ramazanova , Mengmeng Xu , Bernard Ghanem

Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Point-Level temporal action localization (PTAL) aims to localize actions in untrimmed videos with only one timestamp annotation for each action instance. Existing methods adopt the frame-level prediction paradigm to learn from the sparse…

Computer Vision and Pattern Recognition · Computer Science 2020-12-16 Chen Ju , Peisen Zhao , Ya Zhang , Yanfeng Wang , Qi Tian

Temporal Context Aggregation Network for Temporal Action Proposal Refinement

Temporal action proposal generation aims to estimate temporal intervals of actions in untrimmed videos, which is a challenging yet important task in the video understanding field. The proposals generated by current methods still suffer from…

Computer Vision and Pattern Recognition · Computer Science 2021-03-25 Zhiwu Qing , Haisheng Su , Weihao Gan , Dongliang Wang , Wei Wu , Xiang Wang , Yu Qiao , Junjie Yan , Changxin Gao , Nong Sang

Post-Processing Temporal Action Detection

Existing Temporal Action Detection (TAD) methods typically take a pre-processing step in converting an input varying-length video into a fixed-length snippet representation sequence, before temporal boundary estimation and action…

Computer Vision and Pattern Recognition · Computer Science 2023-03-06 Sauradip Nag , Xiatian Zhu , Yi-Zhe Song , Tao Xiang

Temporal Action Segmentation with High-level Complex Activity Labels

The temporal action segmentation task segments videos temporally and predicts action labels for all frames. Fully supervising such a segmentation model requires dense frame-wise action annotations, which are expensive and tedious to…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Guodong Ding , Angela Yao

Searching Action Proposals via Spatial Actionness Estimation and Temporal Path Inference and Tracking

In this paper, we address the problem of searching action proposals in unconstrained video clips. Our approach starts from actionness estimation on frame-level bounding boxes, and then aggregates the bounding boxes belonging to the same…

Computer Vision and Pattern Recognition · Computer Science 2016-08-24 Nannan Li , Dan Xu , Zhenqiang Ying , Zhihao Li , Ge Li

A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning

Existing action detection algorithms usually generate action proposals through an extensive search over the video at multiple temporal scales, which brings about huge computational overhead and deviates from the human perception procedure.…

Computer Vision and Pattern Recognition · Computer Science 2017-06-23 Jingjia Huang , Nannan Li , Tao Zhang , Ge Li

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation

Temporal action proposal generation (TAPG) is a fundamental and challenging task in video understanding, especially in temporal action detection. Most previous works focus on capturing the local temporal context and can well locate simple…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Shuning Chang , Pichao Wang , Fan Wang , Hao Li , Jiashi Feng