English

Counterpoint by Convolution

Machine Learning 2019-03-19 v1 Sound Audio and Speech Processing Machine Learning

Abstract

Machine learning models of music typically break up the task of composition into a chronological process, composing a piece of music in a single pass from beginning to end. On the contrary, human composers write music in a nonlinear fashion, scribbling motifs here and there, often revisiting choices previously made. In order to better approximate this process, we train a convolutional neural network to complete partial musical scores, and explore the use of blocked Gibbs sampling as an analogue to rewriting. Neither the model nor the generative procedure are tied to a particular causal direction of composition. Our model is an instance of orderless NADE (Uria et al., 2014), which allows more direct ancestral sampling. However, we find that Gibbs sampling greatly improves sample quality, which we demonstrate to be due to some conditional distributions being poorly modeled. Moreover, we show that even the cheap approximate blocked Gibbs procedure from Yao et al. (2014) yields better samples than ancestral sampling, based on both log-likelihood and human evaluation.

Keywords

Cite

@article{arxiv.1903.07227,
  title  = {Counterpoint by Convolution},
  author = {Cheng-Zhi Anna Huang and Tim Cooijmans and Adam Roberts and Aaron Courville and Douglas Eck},
  journal= {arXiv preprint arXiv:1903.07227},
  year   = {2019}
}

Comments

Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

R2 v1 2026-06-23T08:10:54.503Z