Proceedings of the International Conference on New Interfaces for Musical Expression. 2019, 325-330
Generating convincing music via deep neural networks is a challenging problem that shows promise for many applications including interactive musical creation. One part of this challenge is the problem of generating convincing accompaniment parts to a given melody, as could be used in an automatic accompaniment system. Despite much progress in this area, systems that can automatically learn to generate interesting and harmonically plausible accompaniments remain somewhat elusive. In this paper we explore systems where a user provides a sequence of notes, and a neural network model responds with an accompanying sequence of equal length. We consider two popular sequenceto- sequence models; one featuring standard unidirectional long short-term memory (LSTM) architecture, and the other featuring bidirectional LSTM. These are evaluated and compared via a qualitative study that features 106 respondents listening to eight random samples from our set of generated music, as well as two human samples. From the results we see a preference for the sequences generated by the bidirectional model as well as an indication that these sequences sound more human.
This item's license is: Attribution 4.0 International