English
Related papers

Related papers: Learning Multi-instrument Classification with Part…

200 papers

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings. Datasets for polyphonic instrument…

Information Retrieval · Computer Science 2019-07-10 Siddharth Gururani , Mohit Sharma , Alexander Lerch

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem. Previous works have seen some success in improving…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-23 Karn Watcharasupat , Siddharth Gururani , Alexander Lerch

Audio tagging is the task of predicting the presence or absence of sound classes within an audio clip. Previous work in audio tagging focused on relatively small datasets limited to recognising a small number of sound classes. We…

Sound · Computer Science 2019-12-11 Qiuqiang Kong , Changsong Yu , Turab Iqbal , Yong Xu , Wenwu Wang , Mark D. Plumbley

In this paper, we propose a multi-level attention model to solve the weakly labelled audio classification problem. The objective of audio classification is to predict the presence or absence of audio events in an audio clip. Recently,…

Audio and Speech Processing · Electrical Eng. & Systems 2018-03-08 Changsong Yu , Karim Said Barsim , Qiuqiang Kong , Bin Yang

Instrument recognition is a fundamental task in music information retrieval, yet little has been done to predict the presence of instruments in multi-instrument music for each time frame. This task is important for not only automatic…

Sound · Computer Science 2018-06-26 Yun-Ning Hung , Yi-Hsuan Yang

Multi-label classification is a widely encountered problem in daily life, where an instance can be associated with multiple classes. In theory, this is a supervised learning method that requires a large amount of labeling. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-02 XIn Zhang , Yuqi Song , Fei Zuo , Xiaofeng Wang

For many music analysis problems, we need to know the presence of instruments for each time frame in a multi-instrument musical piece. However, such a frame-level instrument recognition task remains difficult, mainly due to the lack of…

Sound · Computer Science 2019-02-19 Yun-Ning Hung , Yi-An Chen , Yi-Hsuan Yang

Deep ConvNets have shown great performance for single-label image classification (e.g. ImageNet), but it is necessary to move beyond the single-label classification task because pictures of everyday life are inherently multi-label.…

Computer Vision and Pattern Recognition · Computer Science 2019-02-27 Thibaut Durand , Nazanin Mehrasa , Greg Mori

Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data.…

Sound · Computer Science 2016-07-07 Anurag Kumar , Bhiksha Raj

Audio classification has seen great progress with the increasing availability of large-scale datasets. These large datasets, however, are often only partially labeled as collecting full annotations is a tedious and expensive process. This…

Sound · Computer Science 2021-11-29 Siddharth Gururani , Alexander Lerch

Although music is typically multi-label, many works have studied hierarchical music tagging with simplified settings such as single-label data. Moreover, there lacks a framework to describe various joint training methods under the…

Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Emanuel Ben-Baruch , Tal Ridnik , Itamar Friedman , Avi Ben-Cohen , Nadav Zamir , Asaf Noy , Lihi Zelnik-Manor

While deep learning has been incredibly successful in modeling tasks with large, carefully curated labeled datasets, its application to problems with limited labeled data remains a challenge. The aim of the present work is to improve the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-29 Tyler Lee , Ting Gong , Suchismita Padhy , Andrew Rouditchenko , Anthony Ndirango

We propose a method to perform audio event detection under the common constraint that only limited training data are available. In training a deep learning system to perform audio event detection, two practical problems arise. Firstly, most…

Sound · Computer Science 2018-10-29 Veronica Morfi , Dan Stowell

Partial multi-label learning aims to extract knowledge from incompletely annotated data, which includes known correct labels, known incorrect labels, and unknown labels. The core challenge lies in accurately identifying the ambiguous…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Xin Wu , Fei Teng , Yue Feng , Kaibo Shi , Zhuosheng Lin , Ji Zhang , James Wang

Annotating time boundaries of sound events is labor-intensive, limiting the scalability of strongly supervised learning in audio detection. To reduce annotation costs, weakly-supervised learning with only clip-level labels has been widely…

Sound · Computer Science 2025-10-30 Keisuke Imoto

Compared with multi-class classification, multi-label classification that contains more than one class is more suitable in real life scenarios. Obtaining fully labeled high-quality datasets for multi-label classification problems, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Xin Zhang , Rabab Abdelfattah , Yuqi Song , Xiaofeng Wang

Music source separation (MSS) faces challenges due to the limited availability of correctly-labeled individual instrument tracks. With the push to acquire larger datasets to improve MSS performance, the inevitability of encountering…

Audio and Speech Processing · Electrical Eng. & Systems 2023-07-25 Junghyun Koo , Yunkee Chae , Chang-Bin Jeon , Kyogu Lee

Prior approaches to lead instrument detection primarily analyze mixture audio, limited to coarse classifications and lacking generalization ability. This paper presents a novel approach to lead instrument detection in multitrack music audio…

Sound · Computer Science 2025-03-06 Longshen Ou , Yu Takahashi , Ye Wang

Learning from noisy labels (LNL) is crucial in deep learning, in which one of the approaches is to identify clean-label samples from poorly-annotated datasets. Such an identification is challenging because the conventional LNL problem,…

Machine Learning · Computer Science 2025-09-26 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro
‹ Prev 1 2 3 10 Next ›