Related papers: RefineLoc: Iterative Refinement for Weakly-Supervi…

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

AutoLoc: Weakly-supervised Temporal Action Localization

Temporal Action Localization (TAL) in untrimmed video is important for many applications. But it is very expensive to annotate the segment-level ground truth (action class and temporal boundary). This raises the interest of addressing TAL…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Zheng Shou , Hang Gao , Lei Zhang , Kazuyuki Miyazawa , Shih-Fu Chang

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Most activity localization methods in the literature suffer from the burden of frame-wise annotation requirement. Learning from weak labels may be a potential solution towards reducing such manual labeling effort. Recent years have…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Sujoy Paul , Sourya Roy , Amit K Roy-Chowdhury

Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-resolution Information in Temporal Domain

Weakly supervised temporal action localization is a challenging task as only the video-level annotation is available during the training process. To address this problem, we propose a two-stage approach to fully exploit multi-resolution…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Rui Su , Dong Xu , Luping Zhou , Wanli Ouyang

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation

Weakly supervised temporal action localization aims to localize temporal boundaries of actions and simultaneously identify their categories with only video-level category labels. Many existing methods seek to generate pseudo labels for…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Linjiang Huang , Liang Wang , Hongsheng Li

Weakly-supervised Action Localization with Background Modeling

We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Phuc Xuan Nguyen , Deva Ramanan , Charless C. Fowlkes

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector

Weakly supervised temporal action detection is a Herculean task in understanding untrimmed videos, since no supervisory signal except the video-level category label is available on training data. Under the supervision of category labels,…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jia-Xing Zhong , Nannan Li , Weijie Kong , Tao Zhang , Thomas H. Li , Ge Li

Mining Forgery Traces from Reconstruction Error: A Weakly Supervised Framework for Multimodal Deepfake Temporal Localization

Modern deepfakes have evolved into localized and intermittent manipulations that require fine-grained temporal localization to mitigate severe digital security risks. The prohibitive cost of frame-level annotation makes weakly supervised…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Midou Guo , Qilin Yin , Wei Lu , Rui Yang

POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization

This paper tackles the challenge of point-supervised temporal action detection, wherein only a single frame is annotated for each action instance in the training set. Most of the current methods, hindered by the sparse nature of annotated…

Computer Vision and Pattern Recognition · Computer Science 2024-06-07 Elahe Vahdani , Yingli Tian

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Pseudo-label learning methods have been widely applied in weakly-supervised temporal action localization. Existing works directly utilize weakly-supervised base model to generate instance-level pseudo-labels for training the…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Quan Zhang , Yuxin Qi , Xi Tang , Rui Yuan , Xi Lin , Ke Zhang , Chun Yuan

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Weakly Supervised Action Localization by Sparse Temporal Pooling Network

We propose a weakly supervised temporal action localization algorithm on untrimmed videos using convolutional neural networks. Our algorithm learns from video-level class labels and predicts temporal intervals of human actions with no…

Computer Vision and Pattern Recognition · Computer Science 2018-04-04 Phuc Nguyen , Ting Liu , Gautam Prasad , Bohyung Han

TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos

Temporal localization in untrimmed videos, which aims to identify specific timestamps, is crucial for video understanding but remains challenging. This task encompasses several subtasks, including temporal action localization, temporal…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Chen-Lin Zhang , Lin Sui , Shuming Liu , Fangzhou Mu , Zhangcheng Wang , Bernard Ghanem

Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. When humans watch videos, we can adapt our abstract-level knowledge about actions…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Xijun Wang , Aggelos K. Katsaggelos

Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks

We present a method for weakly-supervised action localization based on graph convolutions. In order to find and classify video time segments that correspond to relevant action classes, a system must be able to both identify discriminative…

Computer Vision and Pattern Recognition · Computer Science 2020-02-05 Maheen Rashid , Hedvig Kjellström , Yong Jae Lee

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

Temporal action localization is a challenging computer vision problem with numerous real-world applications. Most existing methods require laborious frame-level supervision to train action localization models. In this work, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2019-11-19 Sanath Narayan , Hisham Cholakkal , Fahad Shahbaz Khan , Ling Shao

Precise Temporal Action Localization by Evolving Temporal Proposals

Locating actions in long untrimmed videos has been a challenging problem in video content analysis. The performances of existing action localization approaches remain unsatisfactory in precisely determining the beginning and the end of an…

Computer Vision and Pattern Recognition · Computer Science 2018-06-27 Haonan Qiu , Yingbin Zheng , Hao Ye , Yao Lu , Feng Wang , Liang He

Towards Train-Test Consistency for Semi-supervised Temporal Action Localization

Recently, Weakly-supervised Temporal Action Localization (WTAL) has been densely studied but there is still a large gap between weakly-supervised models and fully-supervised models. It is practical and intuitive to annotate temporal…

Computer Vision and Pattern Recognition · Computer Science 2020-03-24 Xudong Lin , Zheng Shou , Shih-Fu Chang

Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions

Action understanding has evolved into the era of fine granularity, as most human behaviors in real life have only minor differences. To detect these fine-grained actions accurately in a label-efficient way, we tackle the problem of…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Zhi Li , Lu He , Huijuan Xu