English
Related papers

Related papers: Weakly-Supervised Temporal Localization via Occurr…

200 papers

We propose a method to perform audio event detection under the common constraint that only limited training data are available. In training a deep learning system to perform audio event detection, two practical problems arise. Firstly, most…

Sound · Computer Science 2018-10-29 Veronica Morfi , Dan Stowell

Audio-visual representation learning is an important task from the perspective of designing machines with the ability to understand complex events. To this end, we propose a novel multimodal framework that instantiates multiple instance…

Computer Vision and Pattern Recognition · Computer Science 2018-07-10 Sanjeel Parekh , Slim Essid , Alexey Ozerov , Ngoc Q. K. Duong , Patrick Pérez , Gaël Richard

Recently, deep neural networks have achieved remarkable performance on the task of object detection and recognition. The reason for this success is mainly grounded in the availability of large scale, fully annotated datasets, but the…

Computer Vision and Pattern Recognition · Computer Science 2018-11-16 Christian Bartz , Haojin Yang , Joseph Bethge , Christoph Meinel

Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data.…

Sound · Computer Science 2016-07-07 Anurag Kumar , Bhiksha Raj

This paper reports a new solution of leveraging temporal classification to support weakly supervised object detection (WSOD). Specifically, we introduce raster scan-order techniques to serialize 2D images into 1D sequence data, and then…

Computer Vision and Pattern Recognition · Computer Science 2021-03-10 Chia-Yu Hsu , Wenwen Li

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Phuc Xuan Nguyen , Deva Ramanan , Charless C. Fowlkes

Weakly-supervised temporal action localization aims to learn detecting temporal intervals of action classes with only video-level labels. To this end, it is crucial to separate frames of action classes from the background frames (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Pilhyeon Lee , Jinglu Wang , Yan Lu , Hyeran Byun

Most existing approaches to training object detectors rely on fully supervised learning, which requires the tedious manual annotation of object location in a training set. Recently there has been an increasing interest in developing weakly…

Computer Vision and Pattern Recognition · Computer Science 2017-05-03 Zhiyuan Shi , Parthipan Siva , Tao Xiang

This work tackles Weakly Supervised Anomaly detection, in which a predictor is allowed to learn not only from normal examples but also from a few labeled anomalies made available during training. In particular, we deal with the localization…

Computer Vision and Pattern Recognition · Computer Science 2022-08-11 Aniello Panariello , Angelo Porrello , Simone Calderara , Rita Cucchiara

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in…

Computer Vision and Pattern Recognition · Computer Science 2016-03-30 Archith J. Bency , Heesung Kwon , Hyungtae Lee , S. Karthikeyan , B. S. Manjunath

The currently most prominent algorithm to train keyword spotting (KWS) models with deep neural networks (DNNs) requires strong supervision i.e., precise knowledge of the spoken keyword location in time. Thus, most KWS approaches treat the…

Sound · Computer Science 2023-05-31 Heinrich Dinkel , Weiji Zhuang , Zhiyong Yan , Yongqing Wang , Junbo Zhang , Yujun Wang

Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision. Pseudo label generation is a promising…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Wulian Yun , Mengshi Qi , Chuanming Wang , Huadong Ma

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed task in the Detection and Classification of Acoustic Scenes and Events 2016 (DCASE 2016) challenge. This task encourages research efforts to…

Sound · Computer Science 2017-03-20 Yong Xu , Qiuqiang Kong , Qiang Huang , Wenwu Wang , Mark D. Plumbley

Despite deep convolutional neural networks boost the performance of image classification and segmentation in digital pathology analysis, they are usually weak in interpretability for clinical applications or require heavy annotations to…

Computer Vision and Pattern Recognition · Computer Science 2020-02-10 Yongxiang Huang , Albert C. S. Chung

Anomaly activities such as robbery, explosion, accidents, etc. need immediate actions for preventing loss of human life and property in real world surveillance systems. Although the recent automation in surveillance systems are capable of…

Computer Vision and Pattern Recognition · Computer Science 2021-08-23 Snehashis Majhi , Srijan Das , Francois Bremond , Ratnakar Dash , Pankaj Kumar Sa

We propose a novel algorithm for weakly supervised semantic segmentation based on image-level class labels only. In weakly supervised setting, it is commonly observed that trained model overly focuses on discriminative parts rather than the…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Seunghoon Hong , Donghun Yeo , Suha Kwak , Honglak Lee , Bohyung Han

Object category localization is a challenging problem in computer vision. Standard supervised training requires bounding box annotations of object instances. This time-consuming annotation process is sidestepped in weakly supervised…

Computer Vision and Pattern Recognition · Computer Science 2016-05-30 Ramazan Gokberk Cinbis , Jakob Verbeek , Cordelia Schmid

The development of audio event recognition systems require labeled training data, which are generally hard to obtain. One promising source of recordings of audio events is the large amount of multimedia data on the web. In particular, if…

Sound · Computer Science 2022-10-04 Anurag Kumar , Bhiksha Raj
‹ Prev 1 2 3 10 Next ›