Related papers: Post-Processing Temporal Action Detection

Towards High-Quality Temporal Action Detection with Sparse Proposals

Temporal Action Detection (TAD) is an essential and challenging topic in video understanding, aiming to localize the temporal segments containing human action instances and predict the action categories. The previous works greatly rely upon…

Computer Vision and Pattern Recognition · Computer Science 2021-09-21 Jiannan Wu , Peize Sun , Shoufa Chen , Jiewen Yang , Zihao Qi , Lan Ma , Ping Luo

Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network

Temporal action detection (TAD) aims to detect the semantic labels and boundaries of action instances in untrimmed videos. Current mainstream approaches are multi-step solutions, which fall short in efficiency and flexibility. In this…

Computer Vision and Pattern Recognition · Computer Science 2022-04-07 Shimin Chen , Chen Chen , Wei Li , Xunqiang Tao , Yandong Guo

Temporal Action Detection Model Compression by Progressive Block Drop

Temporal action detection (TAD) aims to identify and localize action instances in untrimmed videos, which is essential for various video understanding tasks. However, recent improvements in model performance, driven by larger feature…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Xiaoyong Chen , Yong Guo , Jiaming Liang , Sitong Zhuang , Runhao Zeng , Xiping Hu

Progression-Guided Temporal Action Detection in Videos

We present a novel framework, Action Progression Network (APN), for temporal action detection (TAD) in videos. The framework locates actions in videos by detecting the action evolution process. To encode the action evolution, we quantify a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Chongkai Lu , Man-Wai Mak , Ruimin Li , Zheru Chi , Hong Fu

Harnessing Temporal Causality for Advanced Temporal Action Detection

As a fundamental task in long-form video understanding, temporal action detection (TAD) aims to capture inherent temporal relations in untrimmed videos and identify candidate actions with precise boundaries. Over the years, various…

Computer Vision and Pattern Recognition · Computer Science 2024-07-29 Shuming Liu , Lin Sui , Chen-Lin Zhang , Fangzhou Mu , Chen Zhao , Bernard Ghanem

Introducing Gating and Context into Temporal Action Detection

Temporal Action Detection (TAD), the task of localizing and classifying actions in untrimmed video, remains challenging due to action overlaps and variable action durations. Recent findings suggest that TAD performance is dependent on the…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Aglind Reka , Diana Laura Borza , Dominick Reilly , Michal Balazia , Francois Bremond

End-to-end Temporal Action Detection with Transformer

Temporal action detection (TAD) aims to determine the semantic label and the temporal interval of every action instance in an untrimmed video. It is a fundamental and challenging task in video understanding. Previous methods tackle this…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Xiaolong Liu , Qimeng Wang , Yao Hu , Xu Tang , Shiwei Zhang , Song Bai , Xiang Bai

An Empirical Study of End-to-End Temporal Action Detection

Temporal action detection (TAD) is an important yet challenging task in video understanding. It aims to simultaneously predict the semantic label and the temporal interval of every action instance in an untrimmed video. Rather than…

Computer Vision and Pattern Recognition · Computer Science 2022-04-07 Xiaolong Liu , Song Bai , Xiang Bai

Semi-Supervised Temporal Action Detection with Proposal-Free Masking

Existing temporal action detection (TAD) methods rely on a large number of training data with segment-level annotations. Collecting and annotating such a training set is thus highly expensive and unscalable. Semi-supervised TAD (SS-TAD)…

Computer Vision and Pattern Recognition · Computer Science 2022-07-15 Sauradip Nag , Xiatian Zhu , Yi-Zhe Song , Tao Xiang

Temporal Action Localization with Enhanced Instant Discriminability

Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video. The unclear boundaries of actions in videos often result in imprecise predictions of action boundaries by…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Dingfeng Shi , Qiong Cao , Yujie Zhong , Shan An , Jian Cheng , Haogang Zhu , Dacheng Tao

Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning

Existing temporal action detection (TAD) methods rely on generating an overwhelmingly large number of proposals per video. This leads to complex model designs due to proposal generation and/or per-proposal action instance evaluation and the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-22 Sauradip Nag , Xiatian Zhu , Yi-Zhe Song , Tao Xiang

LiquidTAD: Efficient Temporal Action Detection via Parallel Liquid-Inspired Temporal Relaxation

Temporal Action Detection (TAD) requires precise localization of action boundaries within long, untrimmed video sequences. While current high-performing methods achieve strong accuracy, they are often characterized by excessive parameter…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Zepeng Sun , Naichuan Zheng , Hailun Xia , Junjie Wu , Liwei Bao , Xiaotai Zhang

ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection

Temporal action detection (TAD) involves the localization and classification of action instances within untrimmed videos. While standard TAD follows fully supervised learning with closed-set setting on large training data, recent zero-shot…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Thinh Phan , Khoa Vo , Duy Le , Gianfranco Doretto , Donald Adjeroh , Ngan Le

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

Temporal action detection (TAD) is a fundamental video understanding task that aims to identify human actions and localize their temporal boundaries in videos. Although this field has achieved remarkable progress in recent years, further…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Shuming Liu , Chen Zhao , Fatimah Zohra , Mattia Soldan , Alejandro Pardo , Mengmeng Xu , Lama Alssum , Merey Ramazanova , Juan León Alcázar , Anthony Cioppa , Silvio Giancola , Carlos Hinojosa , Bernard Ghanem

TadML: A fast temporal action detection with Mechanics-MLP

Temporal Action Detection(TAD) is a crucial but challenging task in video understanding.It is aimed at detecting both the type and start-end frame for each action instance in a long, untrimmed video.Most current models adopt both RGB and…

Computer Vision and Pattern Recognition · Computer Science 2024-02-05 Bowen Deng , Dongchang Liu

One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features

Open-vocabulary Temporal Action Detection (Open-vocab TAD) is an advanced video analysis approach that expands Closed-vocabulary Temporal Action Detection (Closed-vocab TAD) capabilities. Closed-vocab TAD is typically confined to localizing…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Trung Thanh Nguyen , Yasutomo Kawanishi , Takahiro Komamizu , Ichiro Ide

Single Shot Temporal Action Detection

Temporal action detection is a very important yet challenging problem, since videos in real applications are usually long, untrimmed and contain multiple action instances. This problem requires not only recognizing action categories but…

Computer Vision and Pattern Recognition · Computer Science 2017-10-18 Tianwei Lin , Xu Zhao , Zheng Shou

Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions

Temporal action detection (TAD) aims to locate action positions and recognize action categories in long-term untrimmed videos. Although many methods have achieved promising results, their robustness has not been thoroughly studied. In…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Runhao Zeng , Xiaoyong Chen , Jiaming Liang , Huisi Wu , Guangzhong Cao , Yong Guo

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

We propose a new formulation of temporal action detection (TAD) with denoising diffusion, DiffTAD in short. Taking as input random temporal proposals, it can yield action proposals accurately given an untrimmed long video. This presents a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-17 Sauradip Nag , Xiatian Zhu , Jiankang Deng , Yi-Zhe Song , Tao Xiang

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it as a proposal generation - classification problem. However,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-04 Chen Zhao , Merey Ramazanova , Mengmeng Xu , Bernard Ghanem