Related papers: Weakly-supervised Action Localization with Backgro…

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

Weakly-supervised temporal action localization aims to learn detecting temporal intervals of action classes with only video-level labels. To this end, it is crucial to separate frames of action classes from the background frames (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Pilhyeon Lee , Jinglu Wang , Yan Lu , Hyeran Byun

Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision

Action recognition in videos has attracted a lot of attention in the past decade. In order to learn robust models, previous methods usually assume videos are trimmed as short sequences and require ground-truth annotations of each video…

Computer Vision and Pattern Recognition · Computer Science 2019-02-21 Xiao-Yu Zhang , Haichao Shi , Changsheng Li , Kai Zheng , Xiaobin Zhu , Lixin Duan

Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization

We tackle the problem of localizing temporal intervals of actions with only a single frame label for each action instance for training. Owing to label sparsity, existing work fails to learn action completeness, resulting in fragmentary…

Computer Vision and Pattern Recognition · Computer Science 2021-08-12 Pilhyeon Lee , Hyeran Byun

Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks

We present a method for weakly-supervised action localization based on graph convolutions. In order to find and classify video time segments that correspond to relevant action classes, a system must be able to both identify discriminative…

Computer Vision and Pattern Recognition · Computer Science 2020-02-05 Maheen Rashid , Hedvig Kjellström , Yong Jae Lee

Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization

With video-level labels, weakly supervised temporal action localization (WTAL) applies a localization-by-classification paradigm to detect and classify the action in untrimmed videos. Due to the characteristic of classification,…

Computer Vision and Pattern Recognition · Computer Science 2022-07-15 Ziqiang Li , Yongxin Ge , Jiaruo Yu , Zhongming Chen

Weakly-Supervised Action Localization by Generative Attention Modeling

Weakly-supervised temporal action localization is a problem of learning an action localization model with only video-level action labeling available. The general framework largely relies on the classification activation, which employs an…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Baifeng Shi , Qi Dai , Yadong Mu , Jingdong Wang

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to localize temporal starts and ends of action instances in a video under only video-level supervision. Existing WS-TAL methods rely on deep features learned for action…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Ziyi Liu , Le Wang , Wei Tang , Junsong Yuan , Nanning Zheng , Gang Hua

Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. When humans watch videos, we can adapt our abstract-level knowledge about actions…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Xijun Wang , Aggelos K. Katsaggelos

Action Unit Memory Network for Weakly Supervised Temporal Action Localization

Weakly supervised temporal action localization aims to detect and localize actions in untrimmed videos with only video-level labels during training. However, without frame-level annotations, it is challenging to achieve localization…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Wang Luo , Tianzhu Zhang , Wenfei Yang , Jingen Liu , Tao Mei , Feng Wu , Yongdong Zhang

Few-Shot Action Localization without Knowing Boundaries

Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the literature has typically been addressed assuming the availability of large amounts of annotated training samples for each class -- either in a…

Computer Vision and Pattern Recognition · Computer Science 2021-09-24 Ting-Ting Xie , Christos Tzelepis , Fan Fu , Ioannis Patras

Weakly Supervised Action Localization by Sparse Temporal Pooling Network

We propose a weakly supervised temporal action localization algorithm on untrimmed videos using convolutional neural networks. Our algorithm learns from video-level class labels and predicts temporal intervals of human actions with no…

Computer Vision and Pattern Recognition · Computer Science 2018-04-04 Phuc Nguyen , Ting Liu , Gautam Prasad , Bohyung Han

Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization

Temporally localizing activities within untrimmed videos has been extensively studied in recent years. Despite recent advances, existing methods for weakly-supervised temporal activity localization struggle to recognize when an activity is…

Computer Vision and Pattern Recognition · Computer Science 2020-07-15 Kyle Min , Jason J. Corso

A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization

Weakly supervised temporal action localization is a challenging vision task due to the absence of ground-truth temporal locations of actions in the training videos. With only video-level supervision during training, most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2021-03-26 Ashraful Islam , Chengjiang Long , Richard Radke

Background Suppression Network for Weakly-supervised Temporal Action Localization

Weakly-supervised temporal action localization is a very challenging problem because frame-wise labels are not given in the training stage while the only hint is video-level labels: whether each video contains action frames of interest.…

Computer Vision and Pattern Recognition · Computer Science 2019-11-25 Pilhyeon Lee , Youngjung Uh , Hyeran Byun

Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling

Weakly-Supervised Temporal Action Localization (WS-TAL) task aims to recognize and localize temporal starts and ends of action instances in an untrimmed video with only video-level label supervision. Due to lack of negative samples of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Xiang Wang , Zhiwu Qing , Ziyuan Huang , Yutong Feng , Shiwei Zhang , Jianwen Jiang , Mingqian Tang , Yuanjie Shao , Nong Sang

Pose-Aware Weakly-Supervised Action Segmentation

Understanding human behavior is an important problem in the pursuit of visual intelligence. A challenge in this endeavor is the extensive and costly effort required to accurately label action segments. To address this issue, we consider…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Seth Z. Zhao , Reza Ghoddoosian , Isht Dwivedi , Nakul Agarwal , Behzad Dariush

Background-Click Supervision for Temporal Action Localization

Weakly supervised temporal action localization aims at learning the instance-level action pattern from the video-level labels, where a significant challenge is action-context confusion. To overcome this challenge, one recent work builds an…

Computer Vision and Pattern Recognition · Computer Science 2021-11-25 Le Yang , Junwei Han , Tao Zhao , Tianwei Lin , Dingwen Zhang , Jianxin Chen

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

Weakly-supervised action localization requires training a model to localize the action segments in the video given only video level action label. It can be solved under the Multiple Instance Learning (MIL) framework, where a bag (video)…

Computer Vision and Pattern Recognition · Computer Science 2020-12-23 Zhekun Luo , Devin Guillory , Baifeng Shi , Wei Ke , Fang Wan , Trevor Darrell , Huijuan Xu