Related papers: Sub-action Prototype Learning for Point-level Weak…

Proposal-based Temporal Action Localization with Point-level Supervision

Point-level supervised temporal action localization (PTAL) aims at recognizing and localizing actions in untrimmed videos where only a single point (frame) within every action instance is annotated in training data. Without temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Yuan Yin , Yifei Huang , Ryosuke Furuta , Yoichi Sato

Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Point-Level temporal action localization (PTAL) aims to localize actions in untrimmed videos with only one timestamp annotation for each action instance. Existing methods adopt the frame-level prediction paradigm to learn from the sparse…

Computer Vision and Pattern Recognition · Computer Science 2020-12-16 Chen Ju , Peisen Zhao , Ya Zhang , Yanfeng Wang , Qi Tian

Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization

Alleviating noisy pseudo labels remains a key challenge in Semi-Supervised Temporal Action Localization (SS-TAL). Existing methods often filter pseudo labels based on strict conditions, but they typically assess classification and…

Computer Vision and Pattern Recognition · Computer Science 2024-07-26 Feixiang Zhou , Bryan Williams , Hossein Rahmani

POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization

This paper tackles the challenge of point-supervised temporal action detection, wherein only a single frame is annotated for each action instance in the training set. Most of the current methods, hindered by the sparse nature of annotated…

Computer Vision and Pattern Recognition · Computer Science 2024-06-07 Elahe Vahdani , Yingli Tian

Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Weakly Supervised Temporal Action Localization (WSTAL) aims to localize and classify action instances in long untrimmed videos with only video-level category labels. Due to the lack of snippet-level supervision for indicating action…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Jia-Run Du , Jia-Chang Feng , Kun-Yu Lin , Fa-Ting Hong , Xiao-Ming Wu , Zhongang Qi , Ying Shan , Wei-Shi Zheng

Exploring the Temporal Consistency for Point-Level Weakly-Supervised Temporal Action Localization

Point-supervised Temporal Action Localization (PTAL) adopts a lightly frame-annotated paradigm (\textit{i.e.}, labeling only a single frame per action instance) to train a model to effectively locate action instances within untrimmed…

Computer Vision and Pattern Recognition · Computer Science 2026-02-06 Yunchuan Ma , Laiyun Qing , Guorong Li , Yuqing Liu , Yuankai Qi , Qingming Huang

Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization

Weakly supervised temporal action localization (WTAL) aims to detect action instances in untrimmed videos using only video-level annotations. Since many existing works optimize WTAL models based on action classification labels, they…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Geuntaek Lim , Hyunwoo Kim , Joonsoo Kim , Yukyung Choi

Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization

Weakly-supervised Temporal Action Localization (WSTAL) aims to localize actions in untrimmed videos using only video-level supervision. Latest WSTAL methods introduce pseudo label learning framework to bridge the gap between…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Qianhan Feng , Wenshuo Li , Tong Lin , Xinghao Chen

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to localize temporal starts and ends of action instances in a video under only video-level supervision. Existing WS-TAL methods rely on deep features learned for action…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Ziyi Liu , Le Wang , Wei Tang , Junsong Yuan , Nanning Zheng , Gang Hua

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels

The task of weakly supervised temporal action localization targets at generating temporal boundaries for actions of interest, meanwhile the action category should also be classified. Pseudo-label-based methods, which serve as an effective…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Jingqiu Zhou , Linjiang Huang , Liang Wang , Si Liu , Hongsheng Li

Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint

Weakly Supervised Temporal Action Localization (WTAL) aims to classify and localize temporal boundaries of actions for the video, given only video-level category labels in the training datasets. Due to the lack of boundary information…

Computer Vision and Pattern Recognition · Computer Science 2023-04-26 Guozhang Li , De Cheng , Xinpeng Ding , Nannan Wang , Jie Li , Xinbo Gao

Towards Train-Test Consistency for Semi-supervised Temporal Action Localization

Recently, Weakly-supervised Temporal Action Localization (WTAL) has been densely studied but there is still a large gap between weakly-supervised models and fully-supervised models. It is practical and intuitive to annotate temporal…

Computer Vision and Pattern Recognition · Computer Science 2020-03-24 Xudong Lin , Zheng Shou , Shih-Fu Chang

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to recognize and localize action segments in untrimmed videos given only video-level action labels for training. Without the boundary information of action segments, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Bo He , Xitong Yang , Le Kang , Zhiyu Cheng , Xin Zhou , Abhinav Shrivastava

Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize and recognize actions in untrimmed videos with only video-level category labels during training. Without instance-level annotations, most existing methods follow the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Huan Ren , Wenfei Yang , Tianzhu Zhang , Yongdong Zhang

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation

Weakly supervised temporal action localization aims to localize temporal boundaries of actions and simultaneously identify their categories with only video-level category labels. Many existing methods seek to generate pseudo labels for…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Linjiang Huang , Liang Wang , Hongsheng Li

Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) learns to detect and classify action instances with only category labels. Most methods widely adopt the off-the-shelf Classification-Based Pre-training (CBP) to generate video features…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Chen Ju , Kunhao Zheng , Jinxiang Liu , Peisen Zhao , Ya Zhang , Jianlong Chang , Yanfeng Wang , Qi Tian

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Most activity localization methods in the literature suffer from the burden of frame-wise annotation requirement. Learning from weak labels may be a potential solution towards reducing such manual labeling effort. Recent years have…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Sujoy Paul , Sourya Roy , Amit K Roy-Chowdhury

Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation

Recent weakly supervised semantic segmentation (WSSS) methods strive to incorporate contextual knowledge to improve the completeness of class activation maps (CAM). In this work, we argue that the knowledge bias between instances and…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Feilong Tang , Zhongxing Xu , Zhaojun Qu , Wei Feng , Xingjian Jiang , Zongyuan Ge

Rethinking the Route Towards Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) aims to localize objects with only image-level labels. Previous methods often try to utilize feature maps and classification weights to localize objects using image level annotations indirectly.…

Computer Vision and Pattern Recognition · Computer Science 2020-03-04 Chen-Lin Zhang , Yun-Hao Cao , Jianxin Wu

Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-resolution Information in Temporal Domain

Weakly supervised temporal action localization is a challenging task as only the video-level annotation is available during the training process. To address this problem, we propose a two-stage approach to fully exploit multi-resolution…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Rui Su , Dong Xu , Luping Zhou , Wanli Ouyang