Related papers: Weakly-Supervised Action Localization with Expecta…

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize and recognize actions in untrimmed videos with only video-level category labels during training. Without instance-level annotations, most existing methods follow the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Huan Ren , Wenfei Yang , Tianzhu Zhang , Yongdong Zhang

Rethinking Multiple Instance Learning: Developing an Instance-Level Classifier via Weakly-Supervised Self-Training

Multiple instance learning (MIL) problem is currently solved from either bag-classification or instance-classification perspective, both of which ignore important information contained in some instances and result in limited performance.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-12 Yingfan Ma , Xiaoyuan Luo , Mingzhi Yuan , Xinrong Chen , Manning Wang

Spatio-Temporal Action Localization in a Weakly Supervised Setting

Enabling computational systems with the ability to localize actions in video-based content has manifold applications. Traditionally, such a problem is approached in a fully-supervised setting where video-clips with complete frame-by-frame…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Kurt Degiorgio , Fabio Cuzzolin

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to recognize and localize action segments in untrimmed videos given only video-level action labels for training. Without the boundary information of action segments, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Bo He , Xitong Yang , Le Kang , Zhiyu Cheng , Xin Zhou , Abhinav Shrivastava

Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos

Despite the recent advances in video classification, progress in spatio-temporal action recognition has lagged behind. A major contributing factor has been the prohibitive cost of annotating videos frame-by-frame. In this paper, we present…

Computer Vision and Pattern Recognition · Computer Science 2020-07-22 Anurag Arnab , Chen Sun , Arsha Nagrani , Cordelia Schmid

Weakly-Supervised Action Localization by Generative Attention Modeling

Weakly-supervised temporal action localization is a problem of learning an action localization model with only video-level action labeling available. The general framework largely relies on the classification activation, which employs an…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Baifeng Shi , Qi Dai , Yadong Mu , Jingdong Wang

Deep Multiple Instance Feature Learning via Variational Autoencoder

We describe a novel weakly supervised deep learning framework that combines both the discriminative and generative models to learn meaningful representation in the multiple instance learning (MIL) setting. MIL is a weakly supervised…

Machine Learning · Computer Science 2018-07-09 Shabnam Ghaffarzadegan

Weakly Supervised Action Selection Learning in Video

Localizing actions in video is a core task in computer vision. The weakly supervised temporal localization problem investigates whether this task can be adequately solved with only video-level labels, significantly reducing the amount of…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Junwei Ma , Satya Krishna Gorti , Maksims Volkovs , Guangwei Yu

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to localize temporal starts and ends of action instances in a video under only video-level supervision. Existing WS-TAL methods rely on deep features learned for action…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Ziyi Liu , Le Wang , Wei Tang , Junsong Yuan , Nanning Zheng , Gang Hua

Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need

Weakly supervised whole slide image classification is usually formulated as a multiple instance learning (MIL) problem, where each slide is treated as a bag, and the patches cut out of it are treated as instances. Existing methods either…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 Linhao Qu , Yingfan Ma , Xiaoyuan Luo , Manning Wang , Zhijian Song

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

Object category localization is a challenging problem in computer vision. Standard supervised training requires bounding box annotations of object instances. This time-consuming annotation process is sidestepped in weakly supervised…

Computer Vision and Pattern Recognition · Computer Science 2016-05-30 Ramazan Gokberk Cinbis , Jakob Verbeek , Cordelia Schmid

Spatio-Temporal Instance Learning: Action Tubes from Class Supervision

The goal of this work is spatio-temporal action localization in videos, using only the supervision from video-level class labels. The state-of-the-art casts this weakly-supervised action localization regime as a Multiple Instance Learning…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Pascal Mettes , Cees G. M. Snoek

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

Weakly-supervised temporal action localization aims to learn detecting temporal intervals of action classes with only video-level labels. To this end, it is crucial to separate frames of action classes from the background frames (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Pilhyeon Lee , Jinglu Wang , Yan Lu , Hyeran Byun

MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection

We address the weakly supervised video highlight detection problem for learning to detect segments that are more attractive in training videos given their video event label but without expensive supervision of manually annotating highlight…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Fa-Ting Hong , Xuanteng Huang , Wei-Hong Li , Wei-Shi Zheng

A Multiclass Multiple Instance Learning Method with Exact Likelihood

We study a multiclass multiple instance learning (MIL) problem where the labels only suggest whether any instance of a class exists or does not exist in a training sample or example. No further information, e.g., the number of instances of…

Machine Learning · Statistics 2019-03-15 Xi-Lin Li

Sm: enhanced localization in Multiple Instance Learning for medical imaging classification

Multiple Instance Learning (MIL) is widely used in medical imaging classification to reduce the labeling effort. While only bag labels are available for training, one typically seeks predictions at both bag and instance levels…

Computer Vision and Pattern Recognition · Computer Science 2024-11-19 Francisco M. Castro-Macías , Pablo Morales-Álvarez , Yunan Wu , Rafael Molina , Aggelos K. Katsaggelos

Multimodal Visual Concept Learning with Weakly Supervised Techniques

Despite the availability of a huge amount of video data accompanied by descriptive texts, it is not always easy to exploit the information contained in natural language in order to automatically recognize video concepts. Towards this goal,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Giorgos Bouritsas , Petros Koutras , Athanasia Zlatintsi , Petros Maragos

Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling

Weakly-Supervised Temporal Action Localization (WS-TAL) task aims to recognize and localize temporal starts and ends of action instances in an untrimmed video with only video-level label supervision. Due to lack of negative samples of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Xiang Wang , Zhiwu Qing , Ziyuan Huang , Yutong Feng , Shiwei Zhang , Jianwen Jiang , Mingqian Tang , Yuanjie Shao , Nong Sang