Unsupervised Audio Source Separation via Spectrum Energy Preserved Wasserstein Learning

11/11/2017
by   Ning Zhang, et al.
0

Separating audio mixtures into individual tracks has been a long standing challenging task. We introduce a novel unsupervised audio source separation approach based on deep adversarial learning. Specifically, our loss function adopts the Wasserstein distance which directly measures the distribution distance between the separated sources and the real sources for each individual source. Moreover, a global regularization term is added to fulfill the spectrum energy preservation property regarding separation. Unlike state-of-the-art unsupervised models which often involve deliberately devised constraints or careful model selection, our approach need little prior model specification on the data, and can be straightforwardly learned in an end-to-end fashion. We show that the proposed method performs competitively on public benchmark against state-of-the-art unsupervised methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2019

Model selection for deep audio source separation via clustering analysis

Audio source separation is the process of separating a mixture (e.g. a p...
research
10/11/2021

Source Mixing and Separation Robust Audio Steganography

Audio steganography aims at concealing secret information in carrier aud...
research
01/24/2022

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Supervised deep learning approaches to underdetermined audio source sepa...
research
08/09/2023

Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning

Privacy preservation has long been a concern in smart acoustic monitorin...
research
10/25/2021

Unsupervised Source Separation By Steering Pretrained Music Models

We showcase an unsupervised method that repurposes deep models trained f...
research
12/21/2019

Deep Audio Prior

Deep convolutional neural networks are known to specialize in distilling...
research
02/11/2021

Multichannel-based learning for audio object extraction

The current paradigm for creating and deploying immersive audio content ...

Please sign up or login with your details

Forgot password? Click here to reset