Related papers: Equivalent Classification Mapping for Weakly Super…

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector

Weakly supervised temporal action detection is a Herculean task in understanding untrimmed videos, since no supervisory signal except the video-level category label is available on training data. Under the supervision of category labels,…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jia-Xing Zhong , Nannan Li , Weijie Kong , Tao Zhang , Thomas H. Li , Ge Li

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

Weakly-supervised action localization requires training a model to localize the action segments in the video given only video level action label. It can be solved under the Multiple Instance Learning (MIL) framework, where a bag (video)…

Computer Vision and Pattern Recognition · Computer Science 2020-12-23 Zhekun Luo , Devin Guillory , Baifeng Shi , Wei Ke , Fang Wan , Trevor Darrell , Huijuan Xu

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning

Supervised object detection and semantic segmentation require object or even pixel level annotations. When there exist image level labels only, it is challenging for weakly supervised algorithms to achieve accurate predictions. The accuracy…

Computer Vision and Pattern Recognition · Computer Science 2018-03-06 Weifeng Ge , Sibei Yang , Yizhou Yu

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Improving action segmentation via explicit similarity measurement

Existing supervised action segmentation methods depend on the quality of frame-wise classification using attention mechanisms or temporal convolutions to capture temporal dependencies. Even boundary detection-based methods primarily depend…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Kamel Aouaidjia , Wenhao Zhang , Aofan Li , Chongsheng Zhang

Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks

We present a method for weakly-supervised action localization based on graph convolutions. In order to find and classify video time segments that correspond to relevant action classes, a system must be able to both identify discriminative…

Computer Vision and Pattern Recognition · Computer Science 2020-02-05 Maheen Rashid , Hedvig Kjellström , Yong Jae Lee

Weakly-Supervised Action Localization by Generative Attention Modeling

Weakly-supervised temporal action localization is a problem of learning an action localization model with only video-level action labeling available. The general framework largely relies on the classification activation, which employs an…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Baifeng Shi , Qi Dai , Yadong Mu , Jingdong Wang

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation

Weakly supervised temporal action localization aims to localize temporal boundaries of actions and simultaneously identify their categories with only video-level category labels. Many existing methods seek to generate pseudo labels for…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Linjiang Huang , Liang Wang , Hongsheng Li

Connectionist Temporal Modeling for Weakly Supervised Action Labeling

We propose a weakly-supervised framework for action labeling in video, where only the order of occurring actions is required during training time. The key challenge is that the per-frame alignments between the input (video) and label…

Computer Vision and Pattern Recognition · Computer Science 2016-07-29 De-An Huang , Li Fei-Fei , Juan Carlos Niebles

Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. When humans watch videos, we can adapt our abstract-level knowledge about actions…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Xijun Wang , Aggelos K. Katsaggelos

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize actions in untrimmed videos with only video-level action category labels. Most of previous methods ignore the incompleteness issue of Class Activation Sequences (CAS),…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Chen Ju , Peisen Zhao , Siheng Chen , Ya Zhang , Xiaoyun Zhang , Qi Tian

Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization

Weakly supervised object localization aims to find a target object region in a given image with only weak supervision, such as image-level labels. Most existing methods use a class activation map (CAM) to generate a localization map;…

Computer Vision and Pattern Recognition · Computer Science 2022-04-04 Eunji Kim , Siwon Kim , Jungbeom Lee , Hyunwoo Kim , Sungroh Yoon

Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach

Weakly-supervised temporal action localization aims to localize action instances in videos with only video-level action labels. Existing methods mainly embrace a localization-by-classification pipeline that optimizes the snippet-level…

Computer Vision and Pattern Recognition · Computer Science 2023-12-22 Qinying Liu , Zilei Wang , Shenghai Rong , Junjie Li , Yixin Zhang

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Most activity localization methods in the literature suffer from the burden of frame-wise annotation requirement. Learning from weak labels may be a potential solution towards reducing such manual labeling effort. Recent years have…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Sujoy Paul , Sourya Roy , Amit K Roy-Chowdhury

Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize and recognize actions in untrimmed videos with only video-level category labels during training. Without instance-level annotations, most existing methods follow the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Huan Ren , Wenfei Yang , Tianzhu Zhang , Yongdong Zhang

Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis

Video Correlation Learning (VCL), which aims to analyze the relationships between videos, has been widely studied and applied in various general video tasks. However, applying VCL to instructional videos is still quite challenging due to…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Tianyao He , Huabin Liu , Yuxi Li , Xiao Ma , Cheng Zhong , Yang Zhang , Weiyao Lin

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to recognize and localize action segments in untrimmed videos given only video-level action labels for training. Without the boundary information of action segments, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Bo He , Xitong Yang , Le Kang , Zhiyu Cheng , Xin Zhou , Abhinav Shrivastava

ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize action instances temporal boundary and identify the corresponding action category with only video-level labels. Traditional methods mainly focus on foreground and background…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Sanqing Qu , Guang Chen , Zhijun Li , Lijun Zhang , Fan Lu , Alois Knoll