Related papers: Conditional Drums Generation using Compound Word R…

DeepDrum: An Adaptive Conditional Neural Network

Considering music as a sequence of events with multiple complex dependencies, the Long Short-Term Memory (LSTM) architecture has proven very efficient in learning and reproducing musical styles. However, the generation of rhythms requires…

Sound · Computer Science 2019-01-23 Dimos Makris , Maximos Kaliakatsos-Papakostas , Katia Lida Kermanidis

High-Level Control of Drum Track Generation Using Learned Patterns of Rhythmic Interaction

Spurred by the potential of deep learning, computational music generation has gained renewed academic interest. A crucial issue in music generation is that of user control, especially in scenarios where the music generation process is…

Sound · Computer Science 2019-08-05 Stefan Lattner , Maarten Grachten

Drum Synthesis from Expressive Drum Grids via Neural Audio Codecs

Generating realistic drum audio directly from symbolic representations is a challenging task at the intersection of music perception and machine learning. We propose a system that transforms an expressive drum grid, a time-aligned MIDI…

Sound · Computer Science 2026-05-12 Konstantinos Soiledis , Maximos Kaliakatsos-Papakostas , Dimos Makris , Konstantinos Tsamis

Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models

This study introduces a text-conditioned approach to generating drumbeats with Latent Diffusion Models (LDMs). It uses informative conditioning text extracted from training data filenames. By pretraining a text and drumbeat encoder through…

Sound · Computer Science 2024-08-07 Pushkar Jajoria , James McDermott

Generating Coherent Drum Accompaniment With Fills And Improvisations

Creating a complex work of art like music necessitates profound creativity. With recent advancements in deep learning and powerful models such as transformers, there has been huge progress in automatic music generation. In an accompaniment…

Sound · Computer Science 2022-09-02 Rishabh Dahale , Vaibhav Talwadker , Preeti Rao , Prateek Verma

Continuous Melody Generation via Disentangled Short-Term Representations and Structural Conditions

Automatic music generation is an interdisciplinary research topic that combines computational creativity and semantic analysis of music to create automatic machine improvisations. An important property of such a system is allowing the user…

Sound · Computer Science 2020-03-03 Ke Chen , Gus Xia , Shlomo Dubnov

DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks

Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or digital synthesis, allowing a musician to sculpt the desired timbre modifying various parameters. Typically, such parameters control low-level…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-29 J. Nistal , S. Lattner , G. Richard

Setting the rhythm scene: deep learning-based drum loop generation from arbitrary language cues

Generative artificial intelligence models can be a valuable aid to music composition and live performance, both to aid the professional musician and to help democratize the music creation process for hobbyists. Here we present a novel…

Sound · Computer Science 2022-09-22 Ignacio J. Tripodi

Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure

The rise of deep learning technologies has quickly advanced many fields, including that of generative music systems. There exist a number of systems that allow for the generation of good sounding short snippets, yet, these generated…

Sound · Computer Science 2021-04-27 Zixun Guo , Makris Dimos , Herremans Dorien

Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs

To apply neural sequence models such as the Transformers to music generation tasks, one has to represent a piece of music by a sequence of tokens drawn from a finite set of pre-defined vocabulary. Such a vocabulary usually involves tokens…

Sound · Computer Science 2021-01-08 Wen-Yi Hsiao , Jen-Yu Liu , Yin-Cheng Yeh , Yi-Hsuan Yang

Bass Accompaniment Generation via Latent Diffusion

The ability to automatically generate music that appropriately matches an arbitrary input track is a challenging task. We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length. At the…

Sound · Computer Science 2024-02-05 Marco Pasini , Maarten Grachten , Stefan Lattner

Talking Drums: Generating drum grooves with neural networks

Presented is a method of generating a full drum kit part for a provided kick-drum sequence. A sequence to sequence neural network model used in natural language translation was adopted to encode multiple musical styles and an online survey…

Sound · Computer Science 2017-06-30 P. Hutchings

Text-based LSTM networks for Automatic Music Composition

In this paper, we introduce new methods and discuss results of text-based LSTM (Long Short-Term Memory) networks for automatic music composition. The proposed network is designed to learn relationships within text documents that represent…

Artificial Intelligence · Computer Science 2016-04-20 Keunwoo Choi , George Fazekas , Mark Sandler

Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning

Deep generative models have recently achieved impressive performance in speech and music synthesis. However, compared to the generation of those domain-specific sounds, generating general sounds (such as siren, gunshots) has received less…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-07 Xubo Liu , Turab Iqbal , Jinzheng Zhao , Qiushi Huang , Mark D. Plumbley , Wenwu Wang

Controllable deep melody generation via hierarchical music structure representation

Recent advances in deep learning have expanded possibilities to generate music, but generating a customizable full piece of music with consistent long-term structure remains a challenge. This paper introduces MusicFrameworks, a hierarchical…

Sound · Computer Science 2021-09-03 Shuqi Dai , Zeyu Jin , Celso Gomes , Roger B. Dannenberg

JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE

This paper proposes a model that generates a drum track in the audio domain to play along to a user-provided drum-free recording. Specifically, using paired data of drumless tracks and the corresponding human-made drum tracks, we train a…

Sound · Computer Science 2022-11-01 Yueh-Kao Wu , Ching-Yu Chiu , Yi-Hsuan Yang

Rethinking Recurrent Latent Variable Model for Music Composition

We present a model for capturing musical features and creating novel sequences of music, called the Convolutional Variational Recurrent Neural Network. To generate sequential data, the model uses an encoder-decoder architecture with latent…

Sound · Computer Science 2018-10-09 Eunjeong Stella Koh , Shlomo Dubnov , Dustin Wright

Sequence Generation using Deep Recurrent Networks and Embeddings: A study case in music

Automatic generation of sequences has been a highly explored field in the last years. In particular, natural language processing and automatic music composition have gained importance due to the recent advances in machine learning and…

Sound · Computer Science 2020-12-03 Sebastian Garcia-Valencia , Alejandro Betancourt , Juan G. Lalinde-Pulido

Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders

Generative models have thrived in computer vision, enabling unprecedented image processes. Yet the results in audio remain less advanced. Our project targets real-time sound synthesis from a reduced set of high-level parameters, including…

Sound · Computer Science 2019-06-25 Adrien Bitton , Philippe Esling , Antoine Caillon , Martin Fouilleul

Music Generation Using an LSTM

Over the past several years, deep learning for sequence modeling has grown in popularity. To achieve this goal, LSTM network structures have proven to be very useful for making predictions for the next output in a series. For instance, a…

Sound · Computer Science 2022-03-24 Michael Conner , Lucas Gral , Kevin Adams , David Hunger , Reagan Strelow , Alexander Neuwirth