English
Related papers

Related papers: Audio-based Musical Version Identification: Elemen…

200 papers

Version identification (VI) systems now offer accurate and scalable solutions for detecting different renditions of a musical composition, allowing the use of these systems in industrial applications and throughout the wider music…

Sound · Computer Science 2021-10-01 Furkan Yesiler , Marius Miron , Joan Serrà , Emilia Gómez

Version identification (VI) has seen substantial progress over the past few years. On the one hand, the introduction of the metric learning paradigm has favored the emergence of scalable yet accurate VI systems. On the other hand, using…

Sound · Computer Science 2022-10-05 Mathilde Abrassart , Guillaume Doras

The version identification (VI) task deals with the automatic detection of recordings that correspond to the same underlying musical piece. Despite many efforts, VI is still an open problem, with much room for improvement, specially with…

Sound · Computer Science 2020-04-14 Furkan Yesiler , Joan Serrà , Emilia Gómez

Music Cover Retrieval, also known as Version Identification, aims to recognize distinct renditions of the same underlying musical work, a task central to catalog management, copyright enforcement, and music retrieval. State-of-the-art…

Version identification systems aim to detect different renditions of the same underlying musical composition (loosely called cover songs). By learning to encode entire recordings into plain vector embeddings, recent systems have made…

Sound · Computer Science 2020-10-08 Furkan Yesiler , Joan Serrà , Emilia Gómez

The setlist identification (SLI) task addresses a music recognition use case where the goal is to retrieve the metadata and timestamps for all the tracks played in live music events. Due to various musical and non-musical changes in live…

Sound · Computer Science 2021-01-07 Furkan Yesiler , Emilio Molina , Joan Serrà , Emilia Gómez

Musical instrument classification, a key area in Music Information Retrieval, has gained considerable interest due to its applications in education, digital music production, and consumer media. Recent advances in machine learning,…

Sound · Computer Science 2024-11-04 Joanikij Chulev

A range of applications of multi-modal music information retrieval is centred around the problem of connecting large collections of sheet music (images) to corresponding audio recordings, that is, identifying pairs of audio and score…

Sound · Computer Science 2023-09-22 Luis Carvalho , Gerhard Widmer

Variational inference (VI) provides a principled framework for estimating posterior distributions over model parameters, enabling explicit modeling of weight uncertainty during optimization. By capturing this uncertainty, VI improves the…

Audio and Speech Processing · Electrical Eng. & Systems 2026-01-21 Haolin Chen

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem. Previous works have seen some success in improving…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-23 Karn Watcharasupat , Siddharth Gururani , Alexander Lerch

Recent years have witnessed the success of deep learning on the visual sound separation task. However, existing works follow similar settings where the training and testing datasets share the same musical instrument categories, which to…

Multimedia · Computer Science 2022-03-28 Xinchi Zhou , Dongzhan Zhou , Wanli Ouyang , Hang Zhou , Ziwei Liu , Di Hu

This paper investigates foundation models tailored for music informatics, a domain currently challenged by the scarcity of labeled data and generalization issues. To this end, we conduct an in-depth comparative study among various…

Sound · Computer Science 2023-11-07 Minz Won , Yun-Ning Hung , Duc Le

With the rapid advancement of generative audio models, distinguishing between human-composed and generated music is becoming increasingly challenging. As a response, models for detecting fake music have been proposed. In this work, we…

Sound · Computer Science 2025-07-15 Tomasz Sroka , Tomasz Wężowicz , Dominik Sidorczuk , Mateusz Modrzejewski

Since the vocal component plays a crucial role in popular music, singing voice detection has been an active research topic in music information retrieval. Although several proposed algorithms have shown high performances, we argue that…

Sound · Computer Science 2018-06-05 Kyungyun Lee , Keunwoo Choi , Juhan Nam

Singing voice synthesis (SVS) is a task that aims to generate audio signals according to musical scores and lyrics. With its multifaceted nature concerning music and language, producing singing voices indistinguishable from that of human…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-07 Yin-Ping Cho , Fu-Rong Yang , Yung-Chuan Chang , Ching-Ting Cheng , Xiao-Han Wang , Yi-Wen Liu

In this paper, we investigate using the variable-length infilling (VLI) model, which is originally proposed to infill missing segments, to "prolong" existing musical segments at musical boundaries. Specifically, as a case study, we expand…

Sound · Computer Science 2021-11-12 Chih-Pin Tan , Chin-Jui Chang , Alvin W. Y. Su , Yi-Hsuan Yang

Musical Instrument Identification has for long had a reputation of being one of the most ill-posed problems in the field of Musical Information Retrieval(MIR). Despite several robust attempts to solve the problem, a timeline spanning over…

Sound · Computer Science 2021-08-10 Debdutta Chatterjee , Arindam Dutta , Dibakar Sil , Aniruddha Chandra

Does popular music from the 60s sound different than that of the 90s? Prior study has shown that there would exist some variations of patterns and regularities related to instrumentation changes and growing loudness across multi-decadal…

Sound · Computer Science 2024-07-09 Qiqi He , Xuchen Song , Weituo Hao , Ju-Chiang Wang , Wei-Tsung Lu , Wei Li

In this article, we investigate the notion of model-based deep learning in the realm of music information research (MIR). Loosely speaking, we refer to the term model-based deep learning for approaches that combine traditional…

Signal Processing · Electrical Eng. & Systems 2024-06-18 Gael Richard , Vincent Lostanlen , Yi-Hsuan Yang , Meinard Müller

In this work, we explore the use and reliability of Large Language Models (LLMs) in musicology. From a discussion with experts and students, we assess the current acceptance and concerns regarding this, nowadays ubiquitous, technology. We…

Sound · Computer Science 2024-09-04 Pedro Ramoneda , Emilia Parada-Cabaleiro , Benno Weck , Xavier Serra
‹ Prev 1 2 3 10 Next ›