Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

11/04/2022
by   Junghyun Koo, et al.
0

We propose an end-to-end music mixing style transfer system that converts the mixing style of an input multitrack to that of a reference song. This is achieved with an encoder pre-trained with a contrastive objective to extract only audio effects related information from a reference music recording. All our models are trained in a self-supervised manner from an already-processed wet multitrack dataset with an effective data preprocessing method that alleviates the data scarcity of obtaining unprocessed dry data. We analyze the proposed encoder for the disentanglement capability of audio effects and also validate its performance for mixing style transfer through both objective and subjective evaluations. From the results, we show the proposed system not only converts the mixing style of multitrack audio close to a reference but is also robust with mixture-wise style transfer upon using a music source separation model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2022

End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Mastering is an essential step in music production, but it is also a cha...
research
08/24/2022

Automatic music mixing with deep learning and out-of-domain data

Music mixing traditionally involves recording instruments in the form of...
research
02/10/2021

Self-Supervised VQ-VAE For One-Shot Music Style Transfer

Neural style transfer, allowing to apply the artistic style of one image...
research
05/31/2023

DC CoMix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer

Despite the huge successes made in neutral TTS, content-leakage remains ...
research
10/20/2020

Automatic multitrack mixing with a differentiable mixing console of neural audio effects

Applications of deep learning to automatic multitrack mixing are largely...
research
03/15/2023

Blind Estimation of Audio Processing Graph

Musicians and audio engineers sculpt and transform their sounds by conne...
research
05/21/2020

Pitchtron: Towards audiobook generation from ordinary people's voices

In this paper, we explore prosody transfer for audiobook generation unde...

Please sign up or login with your details

Forgot password? Click here to reset