Improving Universal Sound Separation Using Sound Classification

11/18/2019
by   Efthymios Tzinis, et al.
0

Deep learning approaches have recently achieved impressive performance on both audio source separation and sound classification. Most audio source separation approaches focus only on separating sources belonging to a restricted domain of source classes, such as speech and music. However, recent work has demonstrated the possibility of "universal sound separation", which aims to separate acoustic sources from an open domain, regardless of their class. In this paper, we utilize the semantic information learned by sound classifier networks trained on a vast amount of diverse sounds to improve universal sound separation. In particular, we show that semantic embeddings extracted from a sound classifier can be used to condition a separation network, providing it with useful additional information. This approach is especially useful in an iterative setup, where source estimates from an initial separation stage and their corresponding classifier-derived embeddings are fed to a second separation network. By performing a thorough hyperparameter search consisting of over a thousand experiments, we find that classifier embeddings from clean sources provide nearly one dB of SNR gain, and our best iterative models achieve a significant fraction of this oracle performance, establishing a new state-of-the-art for universal sound separation.

READ FULL TEXT
research
05/11/2023

Universal Source Separation with Weakly Labelled Data

Universal source separation (USS) is a fundamental research task for com...
research
11/02/2020

What's All the FUSS About Free Universal Sound Separation Data?

We introduce the Free Universal Sound Separation (FUSS) dataset, a new c...
research
07/27/2023

Complete and separate: Conditional separation with missing target source attribute completion

Recent approaches in source separation leverage semantic information abo...
research
05/12/2023

Benchmarks and leaderboards for sound demixing tasks

Music demixing is the task of separating different tracks from the given...
research
07/15/2020

Separating Sounds from a Single Image

Recently, visual information has been widely used to aid the sound sourc...
research
12/21/2019

Deep Audio Prior

Deep convolutional neural networks are known to specialize in distilling...
research
06/10/2020

Listen to What You Want: Neural Network-based Universal Sound Selector

Being able to control the acoustic events (AEs) to which we want to list...

Please sign up or login with your details

Forgot password? Click here to reset