Related papers: Weakly Supervised Action Localization by Sparse Te…

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks

We present a method for weakly-supervised action localization based on graph convolutions. In order to find and classify video time segments that correspond to relevant action classes, a system must be able to both identify discriminative…

Computer Vision and Pattern Recognition · Computer Science 2020-02-05 Maheen Rashid , Hedvig Kjellström , Yong Jae Lee

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation

Weakly supervised temporal action localization aims to localize temporal boundaries of actions and simultaneously identify their categories with only video-level category labels. Many existing methods seek to generate pseudo labels for…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Linjiang Huang , Liang Wang , Hongsheng Li

Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. When humans watch videos, we can adapt our abstract-level knowledge about actions…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Xijun Wang , Aggelos K. Katsaggelos

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Most activity localization methods in the literature suffer from the burden of frame-wise annotation requirement. Learning from weak labels may be a potential solution towards reducing such manual labeling effort. Recent years have…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Sujoy Paul , Sourya Roy , Amit K Roy-Chowdhury

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to recognize and localize action segments in untrimmed videos given only video-level action labels for training. Without the boundary information of action segments, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Bo He , Xitong Yang , Le Kang , Zhiyu Cheng , Xin Zhou , Abhinav Shrivastava

Few-Shot Action Localization without Knowing Boundaries

Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the literature has typically been addressed assuming the availability of large amounts of annotated training samples for each class -- either in a…

Computer Vision and Pattern Recognition · Computer Science 2021-09-24 Ting-Ting Xie , Christos Tzelepis , Fan Fu , Ioannis Patras

Temporal Action Localization with Multi-temporal Scales

Temporal action localization plays an important role in video analysis, which aims to localize and classify actions in untrimmed videos. The previous methods often predict actions on a feature space of a single-temporal scale. However, the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Zan Gao , Xinglei Cui , Tao Zhuo , Zhiyong Cheng , An-An Liu , Meng Wang , Shenyong Chen

Weakly Supervised Semantic Segmentation using Web-Crawled Videos

We propose a novel algorithm for weakly supervised semantic segmentation based on image-level class labels only. In weakly supervised setting, it is commonly observed that trained model overly focuses on discriminative parts rather than the…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Seunghoon Hong , Donghun Yeo , Suha Kwak , Honglak Lee , Bohyung Han

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to localize temporal starts and ends of action instances in a video under only video-level supervision. Existing WS-TAL methods rely on deep features learned for action…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Ziyi Liu , Le Wang , Wei Tang , Junsong Yuan , Nanning Zheng , Gang Hua

Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

We address temporal action localization in untrimmed long videos. This is important because videos in real applications are usually unconstrained and contain multiple action instances plus video content of background scenes or other…

Computer Vision and Pattern Recognition · Computer Science 2016-04-25 Zheng Shou , Dongang Wang , Shih-Fu Chang

Weakly-supervised Action Localization with Background Modeling

We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Phuc Xuan Nguyen , Deva Ramanan , Charless C. Fowlkes

Weakly Supervised Gaussian Networks for Action Detection

Detecting temporal extents of human actions in videos is a challenging computer vision problem that requires detailed manual supervision including frame-level labels. This expensive annotation process limits deploying action detectors to a…

Computer Vision and Pattern Recognition · Computer Science 2020-01-07 Basura Fernando , Cheston Tan Yin Chet , Hakan Bilen

Spatio-Temporal Action Localization in a Weakly Supervised Setting

Enabling computational systems with the ability to localize actions in video-based content has manifold applications. Traditionally, such a problem is approached in a fully-supervised setting where video-clips with complete frame-by-frame…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Kurt Degiorgio , Fabio Cuzzolin

Learning Temporal Action Proposals With Fewer Labels

Temporal action proposals are a common module in action detection pipelines today. Most current methods for training action proposal modules rely on fully supervised approaches that require large amounts of annotated temporal action…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Jingwei Ji , Kaidi Cao , Juan Carlos Niebles

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

Temporal action localization is a challenging computer vision problem with numerous real-world applications. Most existing methods require laborious frame-level supervision to train action localization models. In this work, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2019-11-19 Sanath Narayan , Hisham Cholakkal , Fahad Shahbaz Khan , Ling Shao

UntrimmedNets for Weakly Supervised Action Recognition and Detection

Current action recognition methods heavily rely on trimmed videos for model training. However, it is expensive and time-consuming to acquire a large-scale trimmed video dataset. This paper presents a new weakly supervised architecture,…

Computer Vision and Pattern Recognition · Computer Science 2017-05-23 Limin Wang , Yuanjun Xiong , Dahua Lin , Luc Van Gool

AutoLoc: Weakly-supervised Temporal Action Localization

Temporal Action Localization (TAL) in untrimmed video is important for many applications. But it is very expensive to annotate the segment-level ground truth (action class and temporal boundary). This raises the interest of addressing TAL…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Zheng Shou , Hang Gao , Lei Zhang , Kazuyuki Miyazawa , Shih-Fu Chang