Related papers: Weakly-Supervised Temporal Localization via Occurr…

Data-Efficient Weakly Supervised Learning for Low-Resource Audio Event Detection Using Deep Learning

We propose a method to perform audio event detection under the common constraint that only limited training data are available. In training a deep learning system to perform audio event detection, two practical problems arise. Firstly, most…

Sound · Computer Science 2018-10-29 Veronica Morfi , Dan Stowell

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events

Audio-visual representation learning is an important task from the perspective of designing machines with the ability to understand complex events. To this end, we propose a novel multimodal framework that instantiates multiple instance…

Computer Vision and Pattern Recognition · Computer Science 2018-07-10 Sanjeel Parekh , Slim Essid , Alexey Ozerov , Ngoc Q. K. Duong , Patrick Pérez , Gaël Richard

LoANs: Weakly Supervised Object Detection with Localizer Assessor Networks

Recently, deep neural networks have achieved remarkable performance on the task of object detection and recognition. The reason for this success is mainly grounded in the availability of large scale, fully annotated datasets, but the…

Computer Vision and Pattern Recognition · Computer Science 2018-11-16 Christian Bartz , Haojin Yang , Joseph Bethge , Christoph Meinel

Audio Event Detection using Weakly Labeled Data

Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data.…

Sound · Computer Science 2016-07-07 Anurag Kumar , Bhiksha Raj

Learning from Counting: Leveraging Temporal Classification for Weakly Supervised Object Localization and Detection

This paper reports a new solution of leveraging temporal classification to support weakly supervised object detection (WSOD). Specifically, we introduce raster scan-order techniques to serialize 2D images into 1D sequence data, and then…

Computer Vision and Pattern Recognition · Computer Science 2021-03-10 Chia-Yu Hsu , Wenwen Li

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Weakly-supervised Action Localization with Background Modeling

We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Phuc Xuan Nguyen , Deva Ramanan , Charless C. Fowlkes

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

Weakly-supervised temporal action localization aims to learn detecting temporal intervals of action classes with only video-level labels. To this end, it is crucial to separate frames of action classes from the background frames (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Pilhyeon Lee , Jinglu Wang , Yan Lu , Hyeran Byun

Transfer Learning by Ranking for Weakly Supervised Object Annotation

Most existing approaches to training object detectors rely on fully supervised learning, which requires the tedious manual annotation of object location in a training set. Recently there has been an increasing interest in developing weakly…

Computer Vision and Pattern Recognition · Computer Science 2017-05-03 Zhiyuan Shi , Parthipan Siva , Tao Xiang

Consistency-based Self-supervised Learning for Temporal Anomaly Localization

This work tackles Weakly Supervised Anomaly detection, in which a predictor is allowed to learn not only from normal examples but also from a few labeled anomalies made available during training. In particular, we deal with the localization…

Computer Vision and Pattern Recognition · Computer Science 2022-08-11 Aniello Panariello , Angelo Porrello , Simone Calderara , Rita Cucchiara

Weakly Supervised Localization using Deep Feature Maps

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in…

Computer Vision and Pattern Recognition · Computer Science 2016-03-30 Archith J. Bency , Heesung Kwon , Hyungtae Lee , S. Karthikeyan , B. S. Manjunath

Understanding temporally weakly supervised training: A case study for keyword spotting

The currently most prominent algorithm to train keyword spotting (KWS) models with deep neural networks (DNNs) requires strong supervision i.e., precise knowledge of the spoken keyword location in time. Thus, most KWS approaches treat the…

Sound · Computer Science 2023-05-31 Heinrich Dinkel , Weiji Zhuang , Zhiyong Yan , Yongqing Wang , Junbo Zhang , Yujun Wang

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Attention and Localization based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging

Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed task in the Detection and Classification of Acoustic Scenes and Events 2016 (DCASE 2016) challenge. This task encourages research efforts to…

Sound · Computer Science 2017-03-20 Yong Xu , Qiuqiang Kong , Qiang Huang , Wenwu Wang , Mark D. Plumbley

CELNet: Evidence Localization for Pathology Images using Weakly Supervised Learning

Despite deep convolutional neural networks boost the performance of image classification and segmentation in digital pathology analysis, they are usually weak in interpretability for clinical applications or require heavy annotations to…

Computer Vision and Pattern Recognition · Computer Science 2020-02-10 Yongxiang Huang , Albert C. S. Chung

Weakly-supervised Joint Anomaly Detection and Classification

Anomaly activities such as robbery, explosion, accidents, etc. need immediate actions for preventing loss of human life and property in real world surveillance systems. Although the recent automation in surveillance systems are capable of…

Computer Vision and Pattern Recognition · Computer Science 2021-08-23 Snehashis Majhi , Srijan Das , Francois Bremond , Ratnakar Dash , Pankaj Kumar Sa

Weakly Supervised Semantic Segmentation using Web-Crawled Videos

We propose a novel algorithm for weakly supervised semantic segmentation based on image-level class labels only. In weakly supervised setting, it is commonly observed that trained model overly focuses on discriminative parts rather than the…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Seunghoon Hong , Donghun Yeo , Suha Kwak , Honglak Lee , Bohyung Han

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

Object category localization is a challenging problem in computer vision. Standard supervised training requires bounding box annotations of object instances. This time-consuming annotation process is sidestepped in weakly supervised…

Computer Vision and Pattern Recognition · Computer Science 2016-05-30 Ramazan Gokberk Cinbis , Jakob Verbeek , Cordelia Schmid

Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data

The development of audio event recognition systems require labeled training data, which are generally hard to obtain. One promising source of recordings of audio events is the large amount of multimedia data on the web. In particular, if…

Sound · Computer Science 2022-10-04 Anurag Kumar , Bhiksha Raj