Related papers: Query-based Deep Improvisation

Learning source-aware representations of music in a discrete latent space

In recent years, neural network based methods have been proposed as a method that cangenerate representations from music, but they are not human readable and hardly analyzable oreditable by a human. To address this issue, we propose a novel…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-29 Jinsung Kim , Yeong-Seok Jeong , Woosung Choi , Jaehwa Chung , Soonyoung Jung

A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music

The Variational Autoencoder (VAE) has proven to be an effective model for producing semantically meaningful latent representations for natural data. However, it has thus far seen limited application to sequential data, and, as we…

Machine Learning · Computer Science 2019-11-12 Adam Roberts , Jesse Engel , Colin Raffel , Curtis Hawthorne , Douglas Eck

Modeling Melodic Feature Dependency with Modularized Variational Auto-Encoder

Automatic melody generation has been a long-time aspiration for both AI researchers and musicians. However, learning to generate euphonious melodies has turned out to be highly challenging. This paper introduces 1) a new variant of…

Artificial Intelligence · Computer Science 2018-11-02 Yu-An Wang , Yu-Kai Huang , Tzu-Chuan Lin , Shang-Yu Su , Yun-Nung Chen

A Classifying Variational Autoencoder with Application to Polyphonic Music Generation

The variational autoencoder (VAE) is a popular probabilistic generative model. However, one shortcoming of VAEs is that the latent variables cannot be discrete, which makes it difficult to generate data from different modes of a…

Machine Learning · Statistics 2017-11-21 Jay A. Hennig , Akash Umakantha , Ryan C. Williamson

Classical Music Prediction and Composition by means of Variational Autoencoders

This paper proposes a new model for music prediction based on Variational Autoencoders (VAEs). In this work, VAEs are used in a novel way in order to address two different problems: music representation into the latent space, and using this…

Sound · Computer Science 2019-06-25 Daniel Rivero , Enrique Fernandez-Blanco , Alejandro Pazos

Inspecting and Interacting with Meaningful Music Representations using VAE

Variational Autoencoders(VAEs) have already achieved great results on image generation and recently made promising progress on music generation. However, the generation process is still quite difficult to control in the sense that the…

Sound · Computer Science 2019-04-19 Ruihan Yang , Tianyao Chen , Yiyi Zhang , Gus Xia

Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models

Emerging Denoising Diffusion Probabilistic Models (DDPM) have become increasingly utilised because of promising results they have achieved in diverse generative tasks with continuous data, such as image and sound synthesis. Nonetheless, the…

Sound · Computer Science 2024-09-05 Jincheng Zhang , György Fazekas , Charalampos Saitis

Towards democratizing music production with AI-Design of Variational Autoencoder-based Rhythm Generator as a DAW plugin

There has been significant progress in the music generation technique utilizing deep learning. However, it is still hard for musicians and artists to use these techniques in their daily music-making practice. This paper proposes a…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-06 Nao Tokui

Hidden Talents of the Variational Autoencoder

Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying…

Machine Learning · Computer Science 2019-10-08 Bin Dai , Yu Wang , John Aston , Gang Hua , David Wipf

Sound Design Strategies for Latent Audio Space Explorations Using Deep Learning Architectures

The research in Deep Learning applications in sound and music computing have gathered an interest in the recent years; however, there is still a missing link between these new technologies and on how they can be incorporated into real-world…

Sound · Computer Science 2023-06-21 Kıvanç Tatar , Kelsey Cotton , Daniel Bisig

From Variational to Deterministic Autoencoders

Variational Autoencoders (VAEs) provide a theoretically-backed and popular framework for deep generative models. However, learning a VAE from data poses still unanswered theoretical questions and considerable practical challenges. In this…

Machine Learning · Computer Science 2020-06-01 Partha Ghosh , Mehdi S. M. Sajjadi , Antonio Vergari , Michael Black , Bernhard Schölkopf

MG-VAE: Deep Chinese Folk Songs Generation with Specific Regional Style

Regional style in Chinese folk songs is a rich treasure that can be used for ethnic music creation and folk culture research. In this paper, we propose MG-VAE, a music generative model based on VAE (Variational Auto-Encoder) that is capable…

Multimedia · Computer Science 2019-10-01 Jing Luo , Xinyu Yang , Shulei Ji , Juan Li

Is Disentanglement enough? On Latent Representations for Controllable Music Generation

Improving controllability or the ability to manipulate one or more attributes of the generated data has become a topic of interest in the context of deep generative models of music. Recent attempts in this direction have relied on learning…

Sound · Computer Science 2021-08-04 Ashis Pati , Alexander Lerch

The Transitive Information Theory and its Application to Deep Generative Models

Paradoxically, a Variational Autoencoder (VAE) could be pushed in two opposite directions, utilizing powerful decoder model for generating realistic images but collapsing the learned representation, or increasing regularization coefficient…

Machine Learning · Computer Science 2022-03-30 Trung Ngo , Najwa Laabid , Ville Hautamäki , Merja Heinäniemi

Vertical-Horizontal Structured Attention for Generating Music with Chords

In this paper, we propose a lightweight music-generating model based on variational autoencoder (VAE) with structured attention. Generating music is different from generating text because the melodies with chords give listeners…

Sound · Computer Science 2020-11-19 Yizhou Zhao , Liang Qiu , Wensi Ai , Feng Shi , Song-Chun Zhu

Disentangled Sequential Autoencoder

We present a VAE architecture for encoding and generating high dimensional sequential data, such as video or audio. Our deep generative model learns a latent representation of the data which is split into a static and dynamic part, allowing…

Machine Learning · Computer Science 2018-06-13 Yingzhen Li , Stephan Mandt

Deep learning based inverse method for layout design

Layout design with complex constraints is a challenging problem to solve due to the non-uniqueness of the solution and the difficulties in incorporating the constraints into the conventional optimization-based methods. In this paper, we…

Signal Processing · Electrical Eng. & Systems 2018-06-11 Yujie Zhang , Wenjing Ye

Diffusion bridges vector quantized Variational AutoEncoders

Vector Quantized-Variational AutoEncoders (VQ-VAE) are generative models based on discrete latent representations of the data, where inputs are mapped to a finite set of learned embeddings.To generate new samples, an autoregressive prior…

Machine Learning · Statistics 2022-08-04 Max Cohen , Guillaume Quispe , Sylvain Le Corff , Charles Ollion , Eric Moulines

Wavelets to the Rescue: Improving Sample Quality of Latent Variable Deep Generative Models

Variational Autoencoders (VAE) are probabilistic deep generative models underpinned by elegant theory, stable training processes, and meaningful manifold representations. However, they produce blurry images due to a lack of explicit…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Prashnna K Gyawali , Rudra Saha , Linwei Wang , VSR Veeravasarapu , Maneesh Singh

Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

We propose a novel system that takes as an input body movements of a musician playing a musical instrument and generates music in an unsupervised setting. Learning to generate multi-instrumental music from videos without labeling the…

Sound · Computer Science 2020-12-08 Kun Su , Xiulong Liu , Eli Shlizerman