Multi-label Open-set Audio Classification

Sripathi Sridhar; Mark Cartwright

Multi-label Open-set Audio Classification

Sound 2023-10-24 v1 Audio and Speech Processing

Authors: Sripathi Sridhar , Mark Cartwright

Abstract

Current audio classification models have small class vocabularies relative to the large number of sound event classes of interest in the real world. Thus, they provide a limited view of the world that may miss important yet unexpected or unknown sound events. To address this issue, open-set audio classification techniques have been developed to detect sound events from unknown classes. Although these methods have been applied to a multi-class context in audio, such as sound scene classification, they have yet to be investigated for polyphonic audio in which sound events overlap, requiring the use of multi-label models. In this study, we establish the problem of multi-label open-set audio classification by creating a dataset with varying unknown class distributions and evaluating baseline approaches built upon existing techniques.

Keywords

multi-label classification audio classification generalized category discovery

Cite

@article{arxiv.2310.13759,
  title  = {Multi-label Open-set Audio Classification},
  author = {Sripathi Sridhar and Mark Cartwright},
  journal= {arXiv preprint arXiv:2310.13759},
  year   = {2023}
}

Comments

Published at the Workshop on Detection and Classification of Acoustic Scenes and Events, 2023 (DCASE 2023)

Multi-label Open-set Audio Classification

Abstract

Keywords

Cite

Comments

Related papers