Related papers: Audio-based Musical Version Identification: Elemen…

Assessing Algorithmic Biases for Musical Version Identification

Version identification (VI) systems now offer accurate and scalable solutions for detecting different renditions of a musical composition, allowing the use of these systems in industrial applications and throughout the wider music…

Sound · Computer Science 2021-10-01 Furkan Yesiler , Marius Miron , Joan Serrà , Emilia Gómez

And what if two musical versions don't share melody, harmony, rhythm, or lyrics ?

Version identification (VI) has seen substantial progress over the past few years. On the one hand, the introduction of the metric learning paradigm has favored the emergence of scalable yet accurate VI systems. On the other hand, using…

Sound · Computer Science 2022-10-05 Mathilde Abrassart , Guillaume Doras

Accurate and Scalable Version Identification Using Musically-Motivated Embeddings

The version identification (VI) task deals with the automatic detection of recordings that correspond to the same underlying musical piece. Despite many efforts, VI is still an open problem, with much room for improvement, specially with…

Sound · Computer Science 2020-04-14 Furkan Yesiler , Joan Serrà , Emilia Gómez

Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings

Music Cover Retrieval, also known as Version Identification, aims to recognize distinct renditions of the same underlying musical work, a task central to catalog management, copyright enforcement, and music retrieval. State-of-the-art…

Sound · Computer Science 2026-01-19 Joanne Affolter , Benjamin Martin , Elena V. Epure , Gabriel Meseguer-Brocal , Frédéric Kaplan

Less is more: Faster and better music version identification with embedding distillation

Version identification systems aim to detect different renditions of the same underlying musical composition (loosely called cover songs). By learning to encode entire recordings into plain vector embeddings, recent systems have made…

Sound · Computer Science 2020-10-08 Furkan Yesiler , Joan Serrà , Emilia Gómez

Investigating the efficacy of music version retrieval systems for setlist identification

The setlist identification (SLI) task addresses a music recognition use case where the goal is to retrieve the metadata and timestamps for all the tracks played in live music events. Due to various musical and non-musical changes in live…

Sound · Computer Science 2021-01-07 Furkan Yesiler , Emilio Molina , Joan Serrà , Emilia Gómez

Improving Musical Instrument Classification with Advanced Machine Learning Techniques

Musical instrument classification, a key area in Music Information Retrieval, has gained considerable interest due to its applications in education, digital music production, and consumer media. Recent advances in machine learning,…

Sound · Computer Science 2024-11-04 Joanikij Chulev

Towards Robust and Truly Large-Scale Audio-Sheet Music Retrieval

A range of applications of multi-modal music information retrieval is centred around the problem of connecting large collections of sheet music (images) to corresponding audio recordings, that is, identifying pairs of audio and score…

Sound · Computer Science 2023-09-22 Luis Carvalho , Gerhard Widmer

Improving Audio Question Answering with Variational Inference

Variational inference (VI) provides a principled framework for estimating posterior distributions over model parameters, enabling explicit modeling of weight uncertainty during optimization. By capturing this uncertainty, VI improves the…

Audio and Speech Processing · Electrical Eng. & Systems 2026-01-21 Haolin Chen

Visual Attention for Musical Instrument Recognition

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem. Previous works have seen some success in improving…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-23 Karn Watcharasupat , Siddharth Gururani , Alexander Lerch

SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance

Recent years have witnessed the success of deep learning on the visual sound separation task. However, existing works follow similar settings where the training and testing datasets share the same musical instrument categories, which to…

Multimedia · Computer Science 2022-03-28 Xinchi Zhou , Dongzhan Zhou , Wanli Ouyang , Hang Zhou , Ziwei Liu , Di Hu

A Foundation Model for Music Informatics

This paper investigates foundation models tailored for music informatics, a domain currently challenged by the scarcity of labeled data and generalization issues. To this end, we conduct an in-depth comparative study among various…

Sound · Computer Science 2023-11-07 Minz Won , Yun-Ning Hung , Duc Le

Evaluating Fake Music Detection Performance Under Audio Augmentations

With the rapid advancement of generative audio models, distinguishing between human-composed and generated music is becoming increasingly challenging. As a response, models for detecting fake music have been proposed. In this work, we…

Sound · Computer Science 2025-07-15 Tomasz Sroka , Tomasz Wężowicz , Dominik Sidorczuk , Mateusz Modrzejewski

Revisiting Singing Voice Detection: a Quantitative Review and the Future Outlook

Since the vocal component plays a crucial role in popular music, singing voice detection has been an active research topic in music information retrieval. Although several proposed algorithms have shown high performances, we argue that…

Sound · Computer Science 2018-06-05 Kyungyun Lee , Keunwoo Choi , Juhan Nam

A Survey on Recent Deep Learning-driven Singing Voice Synthesis Systems

Singing voice synthesis (SVS) is a task that aims to generate audio signals according to musical scores and lyrics. With its multifaceted nature concerning music and language, producing singing voices indistinguishable from that of human…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-07 Yin-Ping Cho , Fu-Rong Yang , Yung-Chuan Chang , Ching-Ting Cheng , Xiao-Han Wang , Yi-Wen Liu

Music Score Expansion with Variable-Length Infilling

In this paper, we investigate using the variable-length infilling (VLI) model, which is originally proposed to infill missing segments, to "prolong" existing musical segments at musical boundaries. Specifically, as a case study, we expand…

Sound · Computer Science 2021-11-12 Chih-Pin Tan , Chin-Jui Chang , Alvin W. Y. Su , Yi-Hsuan Yang

Deep Single Shot Musical Instrument Identification using Scalograms

Musical Instrument Identification has for long had a reputation of being one of the most ill-posed problems in the field of Musical Information Retrieval(MIR). Despite several robust attempts to solve the problem, a timeline spanning over…

Sound · Computer Science 2021-08-10 Debdutta Chatterjee , Arindam Dutta , Dibakar Sil , Aniruddha Chandra

Music Era Recognition Using Supervised Contrastive Learning and Artist Information

Does popular music from the 60s sound different than that of the 90s? Prior study has shown that there would exist some variations of patterns and regularities related to instrumentation changes and growing loudness across multi-decadal…

Sound · Computer Science 2024-07-09 Qiqi He , Xuchen Song , Weituo Hao , Ju-Chiang Wang , Wei-Tsung Lu , Wei Li

Model-Based Deep Learning for Music Information Research

In this article, we investigate the notion of model-based deep learning in the realm of music information research (MIR). Loosely speaking, we refer to the term model-based deep learning for approaches that combine traditional…

Signal Processing · Electrical Eng. & Systems 2024-06-18 Gael Richard , Vincent Lostanlen , Yi-Hsuan Yang , Meinard Müller

The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?

In this work, we explore the use and reliability of Large Language Models (LLMs) in musicology. From a discussion with experts and students, we assess the current acceptance and concerns regarding this, nowadays ubiquitous, technology. We…

Sound · Computer Science 2024-09-04 Pedro Ramoneda , Emilia Parada-Cabaleiro , Benno Weck , Xavier Serra