Single channel voice separation for unknown number of speakers under reverberant and noisy settings

11/04/2020
by   Shlomo E. Chazan, et al.
0

We present a unified network for voice separation of an unknown number of speakers. The proposed approach is composed of several separation heads optimized together with a speaker classification branch. The separation is carried out in the time domain, together with parameter sharing between all separation heads. The classification branch estimates the number of speakers while each head is specialized in separating a different number of speakers. We evaluate the proposed model under both clean and noisy reverberant set-tings. Results suggest that the proposed approach is superior to the baseline model by a significant margin. Additionally, we present a new noisy and reverberant dataset of up to five different speakers speaking simultaneously.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/29/2020

Voice Separation with an Unknown Number of Multiple Speakers

We present a new method for separating a mixed audio sequence, in which ...
research
04/05/2019

Recursive speech separation for unknown number of speakers

In this paper we propose a method of single-channel speaker-independent ...
research
04/18/2021

Many-Speakers Single Channel Speech Separation with Optimal Permutation Training

Single channel speech separation has experienced great progress in the l...
research
11/24/2020

Multi-Decoder DPRNN: High Accuracy Source Counting and Separation

We propose an end-to-end trainable approach to single-channel speech sep...
research
05/24/2022

SepIt: Approaching a Single Channel Speech Separation Bound

We present an upper bound for the Single Channel Speech Separation task,...
research
08/12/2020

Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music

This paper presents a new input format, channel-wise subband input (CWS)...
research
09/15/2020

When Automatic Voice Disguise Meets Automatic Speaker Verification

The technique of transforming voices in order to hide the real identity ...

Please sign up or login with your details

Forgot password? Click here to reset