Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer

11/07/2021
by   Yi-Jen Shih, et al.
0

Attention-based Transformer models have been increasingly employed for automatic music generation. To condition the generation process of such a model with a user-specified sequence, a popular approach is to take that conditioning sequence as a priming sequence and ask a Transformer decoder to generate a continuation. However, this prompt-based conditioning cannot guarantee that the conditioning sequence would develop or even simply repeat itself in the generated continuation. In this paper, we propose an alternative conditioning approach, called theme-based conditioning, that explicitly trains the Transformer to treat the conditioning sequence as a thematic material that has to manifest itself multiple times in its generation result. This is achieved with two main technical contributions. First, we propose a deep learning-based approach that uses contrastive representation learning and clustering to automatically retrieve thematic materials from music pieces in the training data. Second, we propose a novel gated parallel attention module to be used in a sequence-to-sequence (seq2seq) encoder/decoder architecture to more effectively account for a given conditioning thematic material in the generation process of the Transformer decoder. We report on objective and subjective evaluations of variants of the proposed Theme Transformer and the conventional prompt-based baseline, showing that our best model can generate, to some extent, polyphonic pop piano music with repetition and plausible variations of a given condition.

READ FULL TEXT
research
10/06/2022

Melody Infilling with User-Provided Structural Context

This paper proposes a novel Transformer-based model for music score infi...
research
02/10/2023

GTR-CTRL: Instrument and Genre Conditioning for Guitar-Focused Music Generation with Transformers

Recently, symbolic music generation with deep learning techniques has wi...
research
02/09/2022

Conditional Drums Generation using Compound Word Representations

The field of automatic music composition has seen great progress in rece...
research
10/16/2019

Imperial College London Submission to VATEX Video Captioning Task

This paper describes the Imperial College London team's submission to th...
research
09/14/2020

Contrastive Triple Extraction with Generative Transformer

Triple extraction is an essential task in information extraction for nat...
research
12/21/2022

Generating music with sentiment using Transformer-GANs

The field of Automatic Music Generation has seen significant progress th...
research
09/01/2022

Generating Coherent Drum Accompaniment With Fills And Improvisations

Creating a complex work of art like music necessitates profound creativi...

Please sign up or login with your details

Forgot password? Click here to reset