Simultaneous Denoising and Dereverberation Using Deep Embedding Features

04/06/2020
by   Cunhang Fan, et al.
0

Monaural speech dereverberation is a very challenging task because no spatial cues can be used. When the additive noises exist, this task becomes more challenging. In this paper, we propose a joint training method for simultaneous speech denoising and dereverberation using deep embedding features, which is based on the deep clustering (DC). DC is a state-of-the-art method for speech separation that includes embedding learning and K-means clustering. As for our proposed method, it contains two stages: denoising and dereverberation. At the denoising stage, the DC network is leveraged to extract noise-free deep embedding features. These embedding features are generated from the anechoic speech and residual reverberation signals. They can represent the inferred spectral masking patterns of the desired signals, which are discriminative features. At the dereverberation stage, instead of using the unsupervised K-means clustering algorithm, another supervised neural network is utilized to estimate the anechoic speech from these deep embedding features. Finally, the denoising stage and dereverberation stage are optimized by the joint training method. Experimental results show that the proposed method outperforms the WPE and BLSTM baselines, especially in the low SNR condition.

READ FULL TEXT
research
07/23/2019

Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features

Deep clustering (DC) and utterance-level permutation invariant training ...
research
02/05/2020

Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features

Multi-channel deep clustering (MDC) has acquired a good performance for ...
research
03/04/2013

Denoising Deep Neural Networks Based Voice Activity Detection

Recently, the deep-belief-networks (DBN) based voice activity detection ...
research
01/02/2019

Optical Fringe Patterns Filtering Based on Multi-Stage Convolution Neural Network

Optical fringe patterns are often contaminated by speckle noise, making ...
research
04/06/2022

Spectral Denoising for Microphone Classification

In this paper, we propose the use of denoising for microphone classifica...
research
09/18/2020

X-DC: Explainable Deep Clustering based on Learnable Spectrogram Templates

Deep neural networks (DNNs) have achieved substantial predictive perform...
research
10/24/2019

Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks

Recently, deep clustering (DPCL) based speaker-independent speech separa...

Please sign up or login with your details

Forgot password? Click here to reset