English
Related papers

Related papers: Guided Learning Convolution System for DCASE 2019 …

200 papers

In this paper, we present a gated convolutional recurrent neural network based approach to solve task 4, large-scale weakly labelled semi-supervised sound event detection in domestic environments, of the DCASE 2018 challenge. Gated linear…

Sound · Computer Science 2018-10-17 Robert Harb , Franz Pernkopf

This report proposes a polyphonic sound event detection (SED) method for the DCASE 2020 Challenge Task 4. The proposed SED method is based on semi-supervised learning to deal with the different combination of training datasets such as…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-03 Nam Kyun Kim , Hong Kook Kim

In this paper, we describe in detail our systems for DCASE 2020 Task 4. The systems are based on the 1st-place system of DCASE 2019 Task 4, which adopts weakly-supervised framework with an attention-based embedding-level pooling module and…

Sound · Computer Science 2020-11-03 Yuxin Huang , Liwei Lin , Shuo Ma , Xiangdong Wang , Hong Liu , Yueliang Qian , Min Liu , Kazushige Ouch

In this paper, a combinative approach using Nonnegative Matrix Factorization (NMF) and Convolutional Neural Network (CNN) is proposed for audio clip Sound Event Detection (SED). The main idea begins with the use of NMF to approximate strong…

Audio and Speech Processing · Electrical Eng. & Systems 2020-09-22 Chan Teck Kai , Chin Cheng Siong , Li Ye

In this paper, we propose a method for home activity monitoring. We demonstrate our model on dataset of Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 Challenge Task 5. This task aims to classify multi-channel…

Sound · Computer Science 2018-11-15 Yu-Han Shen , Ke-Xin He , Wei-Qiang Zhang

In this paper, we describe in detail our system for DCASE 2022 Task4. The system combines two considerably different models: an end-to-end Sound Event Detection Transformer (SEDT) and a frame-wise model, Metric Learning and Focal Loss CNN…

State-of-the-art sound event detection (SED) methods usually employ a series of convolutional neural networks (CNNs) to extract useful features from the input audio signal, and then recurrent neural networks (RNNs) to model longer temporal…

Sound event detection (SED) entails identifying the type of sound and estimating its temporal boundaries from acoustic signals. These events are uniquely characterized by their spatio-temporal features, which are determined by the way they…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-19 Tanmay Khandelwal , Rohan Kumar Das

In this technique report, we present a bunch of methods for the task 4 of Detection and Classification of Acoustic Scenes and Events 2017 (DCASE2017) challenge. This task evaluates systems for the large-scale detection of sound events using…

Sound · Computer Science 2017-11-28 Yong Xu , Qiuqiang Kong , Wenwu Wang , Mark D. Plumbley

This report proposes a frequency dynamic convolution (FDY) with a large kernel attention (LKA)-convolutional recurrent neural network (CRNN) with a pre-trained bidirectional encoder representation from audio transformers (BEATs)…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-13 Ji Won Kim , Sang Won Son , Yoonah Song , Hong Kook Kim , Il Hoon Song , Jeong Eun Lim

In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D convolutional neural network (CNN) in the first layer for the multichannel sound event detection (SED) task. The 3D CNN enables the network to…

Sound · Computer Science 2018-01-30 Sharath Adavanne , Archontis Politis , Tuomas Virtanen

In this report, we propose three novel methods for developing a sound event detection (SED) model for the DCASE 2024 Challenge Task 4. First, we propose an auxiliary decoder attached to the final convolutional block to improve feature…

Audio and Speech Processing · Electrical Eng. & Systems 2024-06-25 Sang Won Son , Jongyeon Park , Hong Kook Kim , Sulaiman Vesal , Jeong Eun Lim

In this paper we present our system for the detection and classification of acoustic scenes and events (DCASE) 2020 Challenge Task 4: Sound event detection and separation in domestic environments. We introduce two new models: the…

Audio and Speech Processing · Electrical Eng. & Systems 2021-03-12 Janek Ebbers , Reinhold Haeb-Umbach

The main scientific question of this year DCASE challenge, Task 4 - Sound Event Detection in Domestic Environments, is to investigate the types of data (strongly labeled synthetic data, weakly labeled data, unlabeled in domain data)…

Sound · Computer Science 2020-01-23 Teck Kai Chan , Cheng Siong Chin , Ye Li

This report presents our audio event detection system submitted for Task 2, "Detection of rare sound events", of DCASE 2017 challenge. The proposed system is based on convolutional neural networks (CNNs) and deep neural networks (DNNs)…

Sound · Computer Science 2017-10-19 Huy Phan , Martin Krawczyk-Becker , Timo Gerkmann , Alfred Mertins

Sound event detection (SED) is a task to detect sound events in an audio recording. One challenge of the SED task is that many datasets such as the Detection and Classification of Acoustic Scenes and Events (DCASE) datasets are weakly…

Sound · Computer Science 2020-08-25 Qiuqiang Kong , Yong Xu , Wenwu Wang , Mark D. Plumbley

In this technical report, the systems we submitted for subtask 4 of the DCASE 2021 challenge, regarding sound event detection, are described in detail. These models are closely related to the baseline provided for this problem, as they are…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-20 Wim Boes , Hugo Van hamme

In this paper, we present a method called HODGEPODGE\footnotemark[1] for large-scale detection of sound events using weakly labeled, synthetic, and unlabeled data proposed in the Detection and Classification of Acoustic Scenes and Events…

Sound · Computer Science 2019-07-18 Ziqiang Shi , Liu Liu , Huibin Lin , Rujie Liu , Anyan Shi

We propose a simple but efficient method termed Guided Learning for weakly-labeled semi-supervised sound event detection (SED). There are two sub-targets implied in weakly-labeled SED: audio tagging and boundary detection. Instead of…

Machine Learning · Computer Science 2020-02-05 Liwei Lin , Xiangdong Wang , Hong Liu , Yueliang Qian

This report proposes a polyphonic sound event detection (SED) method for the DCASE 2021 Challenge Task 4. The proposed SED model consists of two stages: a mean-teacher model for providing target labels regarding weakly labeled or unlabeled…

Sound · Computer Science 2021-07-07 Nam Kyun Kim , Hong Kook Kim
‹ Prev 1 2 3 10 Next ›