Related papers: Learning Multi-instrument Classification with Part…

An Attention Mechanism for Musical Instrument Recognition

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings. Datasets for polyphonic instrument…

Information Retrieval · Computer Science 2019-07-10 Siddharth Gururani , Mohit Sharma , Alexander Lerch

Visual Attention for Musical Instrument Recognition

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem. Previous works have seen some success in improving…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-23 Karn Watcharasupat , Siddharth Gururani , Alexander Lerch

Weakly Labelled AudioSet Tagging with Attention Neural Networks

Audio tagging is the task of predicting the presence or absence of sound classes within an audio clip. Previous work in audio tagging focused on relatively small datasets limited to recognising a small number of sound classes. We…

Sound · Computer Science 2019-12-11 Qiuqiang Kong , Changsong Yu , Turab Iqbal , Yong Xu , Wenwu Wang , Mark D. Plumbley

Multi-level Attention Model for Weakly Supervised Audio Classification

In this paper, we propose a multi-level attention model to solve the weakly labelled audio classification problem. The objective of audio classification is to predict the presence or absence of audio events in an audio clip. Recently,…

Audio and Speech Processing · Electrical Eng. & Systems 2018-03-08 Changsong Yu , Karim Said Barsim , Qiuqiang Kong , Bin Yang

Frame-level Instrument Recognition by Timbre and Pitch

Instrument recognition is a fundamental task in music information retrieval, yet little has been done to predict the presence of instruments in multi-instrument music for each time frame. This task is important for not only automatic…

Sound · Computer Science 2018-06-26 Yun-Ning Hung , Yi-Hsuan Yang

Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels

Multi-label classification is a widely encountered problem in daily life, where an instance can be associated with multiple classes. In theory, this is a supervised learning method that requires a large amount of labeling. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-02 XIn Zhang , Yuqi Song , Fei Zuo , Xiaofeng Wang

Multitask learning for frame-level instrument recognition

For many music analysis problems, we need to know the presence of instruments for each time frame in a multi-instrument musical piece. However, such a frame-level instrument recognition task remains difficult, mainly due to the lack of…

Sound · Computer Science 2019-02-19 Yun-Ning Hung , Yi-An Chen , Yi-Hsuan Yang

Learning a Deep ConvNet for Multi-label Classification with Partial Labels

Deep ConvNets have shown great performance for single-label image classification (e.g. ImageNet), but it is necessary to move beyond the single-label classification task because pictures of everyday life are inherently multi-label.…

Computer Vision and Pattern Recognition · Computer Science 2019-02-27 Thibaut Durand , Nazanin Mehrasa , Greg Mori

Audio Event Detection using Weakly Labeled Data

Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data.…

Sound · Computer Science 2016-07-07 Anurag Kumar , Bhiksha Raj

Semi-Supervised Audio Classification with Partially Labeled Data

Audio classification has seen great progress with the increasing availability of large-scale datasets. These large datasets, however, are often only partially labeled as collecting full annotations is a tedious and expensive process. This…

Sound · Computer Science 2021-11-29 Siddharth Gururani , Alexander Lerch

An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification

Although music is typically multi-label, many works have studied hierarchical music tagging with simplified settings such as single-label data. Moreover, there lacks a framework to describe various joint training methods under the…

Sound · Computer Science 2023-02-17 Zhi Zhong , Masato Hirano , Kazuki Shimada , Kazuya Tateishi , Shusuke Takahashi , Yuki Mitsufuji

Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Emanuel Ben-Baruch , Tal Ridnik , Itamar Friedman , Avi Ben-Cohen , Nadav Zamir , Asaf Noy , Lihi Zelnik-Manor

Label-efficient audio classification through multitask learning and self-supervision

While deep learning has been incredibly successful in modeling tasks with large, carefully curated labeled datasets, its application to problems with limited labeled data remains a challenge. The aim of the present work is to improve the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-29 Tyler Lee , Ting Gong , Suchismita Padhy , Andrew Rouditchenko , Anthony Ndirango

Data-Efficient Weakly Supervised Learning for Low-Resource Audio Event Detection Using Deep Learning

We propose a method to perform audio event detection under the common constraint that only limited training data are available. In training a deep learning system to perform audio event detection, two practical problems arise. Firstly, most…

Sound · Computer Science 2018-10-29 Veronica Morfi , Dan Stowell

Exploring Partial Multi-Label Learning via Integrating Semantic Co-occurrence Knowledge

Partial multi-label learning aims to extract knowledge from incompletely annotated data, which includes known correct labels, known incorrect labels, and unknown labels. The core challenge lies in accurately identifying the ambiguous…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Xin Wu , Fei Teng , Yue Feng , Kaibo Shi , Zhuosheng Lin , Ji Zhang , James Wang

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Training of Sound Events With Partial Labels

Annotating time boundaries of sound events is labor-intensive, limiting the scalability of strongly supervised learning in audio detection. To reduce annotation costs, weakly-supervised learning with only clip-level labels has been widely…

Sound · Computer Science 2025-10-30 Keisuke Imoto

An Effective Approach for Multi-label Classification with Missing Labels

Compared with multi-class classification, multi-label classification that contains more than one class is more suitable in real life scenarios. Obtaining fully labeled high-quality datasets for multi-label classification problems, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Xin Zhang , Rabab Abdelfattah , Yuqi Song , Xiaofeng Wang

Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data

Music source separation (MSS) faces challenges due to the limited availability of correctly-labeled individual instrument tracks. With the push to acquire larger datasets to improve MSS performance, the inevitability of encountering…

Audio and Speech Processing · Electrical Eng. & Systems 2023-07-25 Junghyun Koo , Yunkee Chae , Chang-Bin Jeon , Kyogu Lee

Lead Instrument Detection from Multitrack Music

Prior approaches to lead instrument detection primarily analyze mixture audio, limited to coarse classifications and lacking generalization ability. This paper presents a novel approach to lead instrument detection in multitrack music audio…

Sound · Computer Science 2025-03-06 Longshen Ou , Yu Takahashi , Ye Wang

Towards the Identifiability in Noisy Label Learning: A Multinomial Mixture Modelling Approach

Learning from noisy labels (LNL) is crucial in deep learning, in which one of the approaches is to identify clean-label samples from poorly-annotated datasets. Such an identification is challenging because the conventional LNL problem,…

Machine Learning · Computer Science 2025-09-26 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro