Related papers: High-dimensional sequence transduction

Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription

We investigate the problem of modeling symbolic sequences of polyphonic music in a completely general piano-roll representation. We introduce a probabilistic model based on distribution estimators conditioned on a recurrent neural network…

Machine Learning · Computer Science 2012-07-03 Nicolas Boulanger-Lewandowski , Yoshua Bengio , Pascal Vincent

Computer Assisted Composition with Recurrent Neural Networks

Sequence modeling with neural networks has lead to powerful models of symbolic music data. We address the problem of exploiting these models to reach creative musical goals, by combining with human input. To this end we generalise previous…

Artificial Intelligence · Computer Science 2017-10-03 Christian Walder , Dongwoo Kim

Towards Interpretable Polyphonic Transcription with Invertible Neural Networks

We explore a novel way of conceptualising the task of polyphonic music transcription, using so-called invertible neural networks. Invertible models unify both discriminative and generative aspects in one function, sharing one set of…

Sound · Computer Science 2019-09-05 Rainer Kelz , Gerhard Widmer

Signal-domain representation of symbolic music for learning embedding spaces

A key aspect of machine learning models lies in their ability to learn efficient intermediate features. However, the input representation plays a crucial role in this process, and polyphonic musical scores remain a particularly complex type…

Machine Learning · Computer Science 2021-09-09 Mathieu Prang , Philippe Esling

An End-to-End Neural Network for Polyphonic Piano Music Transcription

We present a supervised neural network model for polyphonic piano music transcription. The architecture of the proposed model is analogous to speech recognition systems and comprises an acoustic model and a music language model. The…

Machine Learning · Statistics 2016-02-12 Siddharth Sigtia , Emmanouil Benetos , Simon Dixon

Supervised Symbolic Music Style Translation Using Synthetic Data

Research on style transfer and domain translation has clearly demonstrated the ability of deep learning-based algorithms to manipulate images in terms of artistic style. More recently, several attempts have been made to extend such…

Sound · Computer Science 2021-06-11 Ondřej Cífka , Umut Şimşekli , Gaël Richard

A Hybrid Recurrent Neural Network For Music Transcription

We investigate the problem of incorporating higher-level symbolic score-like information into Automatic Music Transcription (AMT) systems to improve their performance. We use recurrent neural networks (RNNs) and their variants as music…

Machine Learning · Computer Science 2014-11-07 Siddharth Sigtia , Emmanouil Benetos , Nicolas Boulanger-Lewandowski , Tillman Weyde , Artur S. d'Avila Garcez , Simon Dixon

Modelling Symbolic Music: Beyond the Piano Roll

In this paper, we consider the problem of probabilistically modelling symbolic music data. We introduce a representation which reduces polyphonic music to a univariate categorical sequence. In this way, we are able to apply state of the art…

Sound · Computer Science 2016-06-07 Christian Walder

TransFusion: Transcribing Speech with Multinomial Diffusion

Diffusion models have shown exceptional scaling properties in the image synthesis domain, and initial attempts have shown similar benefits for applying diffusion to unconditional text synthesis. Denoising diffusion models attempt to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-17 Matthew Baas , Kevin Eloff , Herman Kamper

Realistic Gramophone Noise Synthesis using a Diffusion Model

This paper introduces a novel data-driven strategy for synthesizing gramophone noise audio textures. A diffusion probabilistic model is applied to generate highly realistic quasiperiodic noises. The proposed model is designed to generate…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-01 Eloi Moliner , Vesa Välimäki

A holistic approach to polyphonic music transcription with neural networks

We present a framework based on neural networks to extract music scores directly from polyphonic audio in an end-to-end fashion. Most previous Automatic Music Transcription (AMT) methods seek a piano-roll representation of the pitches, that…

Sound · Computer Science 2019-10-29 Miguel A. Román , Antonio Pertusa , Jorge Calvo-Zaragoza

Sequence Transduction with Recurrent Neural Networks

Many machine learning tasks can be expressed as the transformation---or \emph{transduction}---of input sequences into output sequences: speech recognition, machine translation, protein secondary structure prediction and text-to-speech to…

Neural and Evolutionary Computing · Computer Science 2012-11-16 Alex Graves

Improving Polyphonic Music Models with Feature-Rich Encoding

This paper explores sequential modelling of polyphonic music with deep neural networks. While recent breakthroughs have focussed on network architecture, we demonstrate that the representation of the sequence can make an equally significant…

Sound · Computer Science 2021-08-11 Omar Peracha

Cross-modal variational inference for bijective signal-symbol translation

Extraction of symbolic information from signals is an active field of research enabling numerous applications especially in the Musical Information Retrieval domain. This complex task, that is also related to other topics such as pitch…

Machine Learning · Statistics 2020-02-11 Axel Chemla--Romeu-Santos , Stavros Ntalampiras , Philippe Esling , Goffredo Haus , Gérard Assayag

Sequence Generation using Deep Recurrent Networks and Embeddings: A study case in music

Automatic generation of sequences has been a highly explored field in the last years. In particular, natural language processing and automatic music composition have gained importance due to the recent advances in machine learning and…

Sound · Computer Science 2020-12-03 Sebastian Garcia-Valencia , Alejandro Betancourt , Juan G. Lalinde-Pulido

Sequence-to-Sequence Piano Transcription with Transformers

Automatic Music Transcription has seen significant progress in recent years by training custom deep neural networks on large datasets. However, these models have required extensive domain-specific design of network architectures,…

Sound · Computer Science 2021-07-21 Curtis Hawthorne , Ian Simon , Rigel Swavely , Ethan Manilow , Jesse Engel

Learning to Transduce with Unbounded Memory

Recently, strong results have been demonstrated by Deep Recurrent Neural Networks on natural language transduction problems. In this paper we explore the representational power of these models using synthetic grammars designed to exhibit…

Neural and Evolutionary Computing · Computer Science 2015-11-04 Edward Grefenstette , Karl Moritz Hermann , Mustafa Suleyman , Phil Blunsom

Symbolic Music Generation with Diffusion Models

Score-based generative models and diffusion probabilistic models have been successful at generating high-quality samples in continuous domains such as images and audio. However, due to their Langevin-inspired sampling mechanisms, their…

Sound · Computer Science 2021-11-29 Gautam Mittal , Jesse Engel , Curtis Hawthorne , Ian Simon

Coupled Recurrent Models for Polyphonic Music Composition

This paper introduces a novel recurrent model for music composition that is tailored to the structure of polyphonic music. We propose an efficient new conditional probabilistic factorization of musical scores, viewing a score as a…

Sound · Computer Science 2019-11-28 John Thickstun , Zaid Harchaoui , Dean P. Foster , Sham M. Kakade

Audio Super Resolution using Neural Networks

We introduce a new audio processing technique that increases the sampling rate of signals such as speech or music using deep convolutional neural networks. Our model is trained on pairs of low and high-quality audio examples; at test-time,…

Sound · Computer Science 2017-08-03 Volodymyr Kuleshov , S. Zayd Enam , Stefano Ermon