Related papers: Learning Action Completeness from Points for Weakl…

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Pseudo-label learning methods have been widely applied in weakly-supervised temporal action localization. Existing works directly utilize weakly-supervised base model to generate instance-level pseudo-labels for training the…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Quan Zhang , Yuxin Qi , Xi Tang , Rui Yuan , Xi Lin , Ke Zhang , Chun Yuan

Weakly Supervised Temporal Action Localization with Segment-Level Labels

Temporal action localization presents a trade-off between test performance and annotation-time cost. Fully supervised methods achieve good performance with time-consuming boundary annotations. Weakly supervised methods with cheaper…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Xinpeng Ding , Nannan Wang , Xinbo Gao , Jie Li , Xiaoyu Wang , Tongliang Liu

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels

The task of weakly supervised temporal action localization targets at generating temporal boundaries for actions of interest, meanwhile the action category should also be classified. Pseudo-label-based methods, which serve as an effective…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Jingqiu Zhou , Linjiang Huang , Liang Wang , Si Liu , Hongsheng Li

Weakly-supervised Action Localization with Background Modeling

We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Phuc Xuan Nguyen , Deva Ramanan , Charless C. Fowlkes

Learning Temporal Action Proposals With Fewer Labels

Temporal action proposals are a common module in action detection pipelines today. Most current methods for training action proposal modules rely on fully supervised approaches that require large amounts of annotated temporal action…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Jingwei Ji , Kaidi Cao , Juan Carlos Niebles

Localizing Actions from Video Labels and Pseudo-Annotations

The goal of this paper is to determine the spatio-temporal location of actions in video. Where training from hard to obtain box annotations is the norm, we propose an intuitive and effective algorithm that localizes actions from their class…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Pascal Mettes , Cees G. M. Snoek , Shih-Fu Chang

Spatio-Temporal Action Localization in a Weakly Supervised Setting

Enabling computational systems with the ability to localize actions in video-based content has manifold applications. Traditionally, such a problem is approached in a fully-supervised setting where video-clips with complete frame-by-frame…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Kurt Degiorgio , Fabio Cuzzolin

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

Spatio-Temporal Instance Learning: Action Tubes from Class Supervision

The goal of this work is spatio-temporal action localization in videos, using only the supervision from video-level class labels. The state-of-the-art casts this weakly-supervised action localization regime as a Multiple Instance Learning…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Pascal Mettes , Cees G. M. Snoek

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

Weakly-supervised temporal action localization aims to learn detecting temporal intervals of action classes with only video-level labels. To this end, it is crucial to separate frames of action classes from the background frames (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Pilhyeon Lee , Jinglu Wang , Yan Lu , Hyeran Byun

Weakly-Supervised Dense Action Anticipation

Dense anticipation aims to forecast future actions and their durations for long horizons. Existing approaches rely on fully-labelled data, i.e. sequences labelled with all future actions and their durations. We present a (semi-) weakly…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Haotong Zhang , Fuhai Chen , Angela Yao

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation

Weakly supervised temporal action localization aims to localize temporal boundaries of actions and simultaneously identify their categories with only video-level category labels. Many existing methods seek to generate pseudo labels for…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Linjiang Huang , Liang Wang , Hongsheng Li

Temporal Action Segmentation from Timestamp Supervision

Temporal action segmentation approaches have been very successful recently. However, annotating videos with frame-wise labels to train such models is very expensive and time consuming. While weakly supervised methods trained using only…

Computer Vision and Pattern Recognition · Computer Science 2021-03-29 Zhe Li , Yazan Abu Farha , Juergen Gall

Proposal-based Temporal Action Localization with Point-level Supervision

Point-level supervised temporal action localization (PTAL) aims at recognizing and localizing actions in untrimmed videos where only a single point (frame) within every action instance is annotated in training data. Without temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Yuan Yin , Yifei Huang , Ryosuke Furuta , Yoichi Sato

Weakly-Supervised Completion Moment Detection using Temporal Attention

Monitoring the progression of an action towards completion offers fine grained insight into the actor's behaviour. In this work, we target detecting the completion moment of actions, that is the moment when the action's goal has been…

Computer Vision and Pattern Recognition · Computer Science 2019-10-23 Farnoosh Heidarivincheh , Majid Mirmehdi , Dima Damen

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Few-Shot Action Localization without Knowing Boundaries

Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the literature has typically been addressed assuming the availability of large amounts of annotated training samples for each class -- either in a…

Computer Vision and Pattern Recognition · Computer Science 2021-09-24 Ting-Ting Xie , Christos Tzelepis , Fan Fu , Ioannis Patras

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

Temporal action localization is a challenging computer vision problem with numerous real-world applications. Most existing methods require laborious frame-level supervision to train action localization models. In this work, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2019-11-19 Sanath Narayan , Hisham Cholakkal , Fahad Shahbaz Khan , Ling Shao

Weakly-guided Self-supervised Pretraining for Temporal Activity Detection

Temporal Activity Detection aims to predict activity classes per frame, in contrast to video-level predictions in Activity Classification (i.e., Activity Recognition). Due to the expensive frame-level annotations required for detection, the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Kumara Kahatapitiya , Zhou Ren , Haoxiang Li , Zhenyu Wu , Michael S. Ryoo , Gang Hua