Related papers: Audio Event Detection using Weakly Labeled Data

Weakly Supervised Scalable Audio Content Analysis

Audio Event Detection is an important task for content analysis of multimedia data. Most of the current works on detection of audio events is driven through supervised learning approaches. We propose a weakly supervised learning framework…

Sound · Computer Science 2016-06-14 Anurag Kumar , Bhiksha Raj

Data-Efficient Weakly Supervised Learning for Low-Resource Audio Event Detection Using Deep Learning

We propose a method to perform audio event detection under the common constraint that only limited training data are available. In training a deep learning system to perform audio event detection, two practical problems arise. Firstly, most…

Sound · Computer Science 2018-10-29 Veronica Morfi , Dan Stowell

Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data

The development of audio event recognition systems require labeled training data, which are generally hard to obtain. One promising source of recordings of audio events is the large amount of multimedia data on the web. In particular, if…

Sound · Computer Science 2022-10-04 Anurag Kumar , Bhiksha Raj

A Closer Look at Weak Label Learning for Audio Events

Audio content analysis in terms of sound events is an important research problem for a variety of applications. Recently, the development of weak labeling approaches for audio or sound event detection (AED) and availability of large scale…

Sound · Computer Science 2018-04-26 Ankit Shah , Anurag Kumar , Alexander G. Hauptmann , Bhiksha Raj

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Training of Sound Events With Partial Labels

Annotating time boundaries of sound events is labor-intensive, limiting the scalability of strongly supervised learning in audio detection. To reduce annotation costs, weakly-supervised learning with only clip-level labels has been widely…

Sound · Computer Science 2025-10-30 Keisuke Imoto

Audio Event and Scene Recognition: A Unified Approach using Strongly and Weakly Labeled Data

In this paper we propose a novel learning framework called Supervised and Weakly Supervised Learning where the goal is to learn simultaneously from weakly and strongly labeled data. Strongly labeled data can be simply understood as fully…

Machine Learning · Computer Science 2017-02-21 Anurag Kumar , Bhiksha Raj

Learning Sound Events From Webly Labeled Data

In the last couple of years, weakly labeled learning has turned out to be an exciting approach for audio event detection. In this work, we introduce webly labeled learning for sound events which aims to remove human supervision altogether…

Sound · Computer Science 2019-07-16 Anurag Kumar , Ankit Shah , Bhiksha Raj , Alex Hauptmann

Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Sound event detection is a challenging task, especially for scenes with multiple simultaneous events. While event classification methods tend to be fairly accurate, event localization presents additional challenges, especially when large…

Audio and Speech Processing · Electrical Eng. & Systems 2018-11-12 Sandeep Kothinti , Keisuke Imoto , Debmalya Chakrabarty , Gregory Sell , Shinji Watanabe , Mounya Elhilali

A Global-local Attention Framework for Weakly Labelled Audio Tagging

Weakly labelled audio tagging aims to predict the classes of sound events within an audio clip, where the onset and offset times of the sound events are not provided. Previous works have used the multiple instance learning (MIL) framework,…

Audio and Speech Processing · Electrical Eng. & Systems 2021-02-04 Helin Wang , Yuexian Zou , Wenwu Wang

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events

Audio-visual representation learning is an important task from the perspective of designing machines with the ability to understand complex events. To this end, we propose a novel multimodal framework that instantiates multiple instance…

Computer Vision and Pattern Recognition · Computer Science 2018-07-10 Sanjeel Parekh , Slim Essid , Alexey Ozerov , Ngoc Q. K. Duong , Patrick Pérez , Gaël Richard

Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data

Considering that acoustic scenes and sound events are closely related to each other, in some previous papers, a joint analysis of acoustic scenes and sound events utilizing multitask learning (MTL)-based neural networks was proposed. In…

Sound · Computer Science 2022-07-12 Shunsuke Tsubaki , Keisuke Imoto , Nobutaka Ono

Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning

This paper considers a semi-supervised learning framework for weakly labeled polyphonic sound event detection problems for the DCASE 2019 challenge's task4 by combining both the tri-training and adversarial learning. The goal of the task4…

Sound · Computer Science 2019-10-16 Hyoungwoo Park , Sungrack Yun , Jungyun Eum , Janghoon Cho , Kyuwoong Hwang

Overcoming label noise in audio event detection using sequential labeling

This paper addresses the noisy label issue in audio event detection (AED) by refining strong labels as sequential labels with inaccurate timestamps removed. In AED, strong labels contain the occurrence of a specific event and its timestamps…

Sound · Computer Science 2020-07-13 Jae-Bin Kim , Seongkyu Mun , Myungwoo Oh , Soyeon Choe , Yong-Hyeok Lee , Hyung-Min Park

Multi-level Attention Model for Weakly Supervised Audio Classification

In this paper, we propose a multi-level attention model to solve the weakly labelled audio classification problem. The objective of audio classification is to predict the presence or absence of audio events in an audio clip. Recently,…

Audio and Speech Processing · Electrical Eng. & Systems 2018-03-08 Changsong Yu , Karim Said Barsim , Qiuqiang Kong , Bin Yang

Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network

This paper proposes a neural network architecture and training scheme to learn the start and end time of sound events (strong labels) in an audio recording given just the list of sound events existing in the audio without time information…

Sound · Computer Science 2017-10-10 Sharath Adavanne , Tuomas Virtanen

Self-supervised Attention Model for Weakly Labeled Audio Event Classification

We describe a novel weakly labeled Audio Event Classification approach based on a self-supervised attention model. The weakly labeled framework is used to eliminate the need for expensive data labeling procedure and self-supervised…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-09 Bongjun Kim , Shabnam Ghaffarzadegan

Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments

This paper presents DCASE 2018 task 4. The task evaluates systems for the large-scale detection of sound events using weakly labeled data (without time boundaries). The target of the systems is to provide not only the event class but also…

Sound · Computer Science 2018-07-30 Romain Serizel , Nicolas Turpault , Hamid Eghbal-Zadeh , Ankit Parag Shah

A Joint Detection-Classification Model for Audio Tagging of Weakly Labelled Data

Audio tagging aims to assign one or several tags to an audio clip. Most of the datasets are weakly labelled, which means only the tags of the clip are known, without knowing the occurrence time of the tags. The labeling of an audio clip is…

Sound · Computer Science 2019-12-10 Qiuqiang Kong , Yong Xu , Wenwu Wang , Mark Plumbley

Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling

The Audio-Visual Video Parsing task aims to identify and temporally localize the events that occur in either or both the audio and visual streams of audible videos. It often performs in a weakly-supervised manner, where only video event…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Jinxing Zhou , Dan Guo , Yiran Zhong , Meng Wang

Guided learning for weakly-labeled semi-supervised sound event detection

We propose a simple but efficient method termed Guided Learning for weakly-labeled semi-supervised sound event detection (SED). There are two sub-targets implied in weakly-labeled SED: audio tagging and boundary detection. Instead of…

Machine Learning · Computer Science 2020-02-05 Liwei Lin , Xiangdong Wang , Hong Liu , Yueliang Qian