Related papers: Adversarial Seeded Sequence Growing for Weakly-Sup…

Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize actions in untrimmed videos with only video-level action category labels. Most of previous methods ignore the incompleteness issue of Class Activation Sequences (CAS),…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Chen Ju , Peisen Zhao , Siheng Chen , Ya Zhang , Xiaoyun Zhang , Qi Tian

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector

Weakly supervised temporal action detection is a Herculean task in understanding untrimmed videos, since no supervisory signal except the video-level category label is available on training data. Under the supervision of category labels,…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jia-Xing Zhong , Nannan Li , Weijie Kong , Tao Zhang , Thomas H. Li , Ge Li

Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization

Weakly supervised temporal action localization, which aims at temporally locating action instances in untrimmed videos using only video-level class labels during training, is an important yet challenging problem in video analysis. Many…

Computer Vision and Pattern Recognition · Computer Science 2018-10-30 Haisheng Su , Xu Zhao , Tianwei Lin

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to recognize and localize action segments in untrimmed videos given only video-level action labels for training. Without the boundary information of action segments, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Bo He , Xitong Yang , Le Kang , Zhiyu Cheng , Xin Zhou , Abhinav Shrivastava

Weakly Supervised Action Localization by Sparse Temporal Pooling Network

We propose a weakly supervised temporal action localization algorithm on untrimmed videos using convolutional neural networks. Our algorithm learns from video-level class labels and predicts temporal intervals of human actions with no…

Computer Vision and Pattern Recognition · Computer Science 2018-04-04 Phuc Nguyen , Ting Liu , Gautam Prasad , Bohyung Han

Weakly Supervised Action Selection Learning in Video

Localizing actions in video is a core task in computer vision. The weakly supervised temporal localization problem investigates whether this task can be adequately solved with only video-level labels, significantly reducing the amount of…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Junwei Ma , Satya Krishna Gorti , Maksims Volkovs , Guangwei Yu

Pose-Aware Weakly-Supervised Action Segmentation

Understanding human behavior is an important problem in the pursuit of visual intelligence. A challenge in this endeavor is the extensive and costly effort required to accurately label action segments. To address this issue, we consider…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Seth Z. Zhao , Reza Ghoddoosian , Isht Dwivedi , Nakul Agarwal , Behzad Dariush

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021

This technical report presents an overview of our solution used in the submission to 2021 HACS Temporal Action Localization Challenge on both Supervised Learning Track and Weakly-Supervised Learning Track. Temporal Action Localization (TAL)…

Computer Vision and Pattern Recognition · Computer Science 2021-07-28 Haisheng Su , Peiqin Zhuang , Yukun Li , Dongliang Wang , Weihao Gan , Wei Wu , Yu Qiao

Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision

Action recognition in videos has attracted a lot of attention in the past decade. In order to learn robust models, previous methods usually assume videos are trimmed as short sequences and require ground-truth annotations of each video…

Computer Vision and Pattern Recognition · Computer Science 2019-02-21 Xiao-Yu Zhang , Haichao Shi , Changsheng Li , Kai Zheng , Xiaobin Zhu , Lixin Duan

Weakly Supervised Temporal Action Localization with Segment-Level Labels

Temporal action localization presents a trade-off between test performance and annotation-time cost. Fully supervised methods achieve good performance with time-consuming boundary annotations. Weakly supervised methods with cheaper…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Xinpeng Ding , Nannan Wang , Xinbo Gao , Jie Li , Xiaoyu Wang , Tongliang Liu

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to localize temporal starts and ends of action instances in a video under only video-level supervision. Existing WS-TAL methods rely on deep features learned for action…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Ziyi Liu , Le Wang , Wei Tang , Junsong Yuan , Nanning Zheng , Gang Hua

Weakly-supervised Action Localization with Background Modeling

We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Phuc Xuan Nguyen , Deva Ramanan , Charless C. Fowlkes

Action Sensitivity Learning for Temporal Action Localization

Temporal action localization (TAL), which involves recognizing and locating action instances, is a challenging task in video understanding. Most existing approaches directly predict action classes and regress offsets to boundaries, while…

Computer Vision and Pattern Recognition · Computer Science 2023-09-14 Jiayi Shao , Xiaohan Wang , Ruijie Quan , Junjun Zheng , Jiang Yang , Yi Yang

2by2: Weakly-Supervised Learning for Global Action Segmentation

This paper presents a simple yet effective approach for the poorly investigated task of global action segmentation, aiming at grouping frames capturing the same action across videos of different activities. Unlike the case of videos…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Elena Bueno-Benito , Mariella Dimiccoli

UntrimmedNets for Weakly Supervised Action Recognition and Detection

Current action recognition methods heavily rely on trimmed videos for model training. However, it is expensive and time-consuming to acquire a large-scale trimmed video dataset. This paper presents a new weakly supervised architecture,…

Computer Vision and Pattern Recognition · Computer Science 2017-05-23 Limin Wang , Yuanjun Xiong , Dahua Lin , Luc Van Gool

Temporal Action Detection with Structured Segment Networks

Detecting actions in untrimmed videos is an important yet challenging task. In this paper, we present the structured segment network (SSN), a novel framework which models the temporal structure of each action instance via a structured…

Computer Vision and Pattern Recognition · Computer Science 2017-09-19 Yue Zhao , Yuanjun Xiong , Limin Wang , Zhirong Wu , Xiaoou Tang , Dahua Lin

Spatio-Temporal Action Localization in a Weakly Supervised Setting

Enabling computational systems with the ability to localize actions in video-based content has manifold applications. Traditionally, such a problem is approached in a fully-supervised setting where video-clips with complete frame-by-frame…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Kurt Degiorgio , Fabio Cuzzolin