The challenge of realistic music generation: modelling raw audio at scale

06/26/2018
by   Sander Dieleman, et al.
6

Realistic music generation is a challenging task. When building generative models of music that are learnt from data, typically high-level representations such as scores or MIDI are used that abstract away the idiosyncrasies of a particular performance. But these nuances are very important for our perception of musicality and realism, so in this work we embark on modelling music in the raw audio domain. It has been shown that autoregressive models excel at generating raw audio waveforms of speech, but when applied to music, we find them biased towards capturing local signal structure at the expense of modelling long-range correlations. This is problematic because music exhibits structure at many different timescales. In this work, we explore autoregressive discrete autoencoders (ADAs) as a means to enable autoregressive models to capture long-range correlations in waveforms. We find that they allow us to unconditionally generate piano music directly in the raw audio domain, which shows stylistic consistency across tens of seconds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2018

Conditioning Deep Generative Raw Audio Models for Structured Automatic Music

Existing automatic music generation approaches that feature deep learnin...
research
06/04/2019

MelNet: A Generative Model for Audio in the Frequency Domain

Capturing high-level structure in audio waveforms is challenging because...
research
01/12/2023

Rock Guitar Tablature Generation via Natural Language Processing

Deep learning has recently empowered and democratized generative modelin...
research
11/16/2018

Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles

We use a modified SampleRNN architecture to generate music in modern gen...
research
01/12/2021

MP3net: coherent, minute-long music generation from raw audio with a simple convolutional GAN

We present a deep convolutional GAN which leverages techniques from MP3/...
research
07/21/2021

Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning

Deep generative models have recently achieved impressive performance in ...
research
07/20/2023

Progressive distillation diffusion for raw music generation

This paper aims to apply a new deep learning approach to the task of gen...

Please sign up or login with your details

Forgot password? Click here to reset