Multichannel Source Separation and Speech Enhancement Using the Convolutive Transfer Function

11/21/2017
by   Xiaofei Li, et al.
0

This paper addresses the problem of audio source recovery from multichannel noisy convolutive mixture for source separation and speech enhancement, assuming known mixing filters. We propose to conduct the source recovery in the short-time Fourier transform domain, and based on the convolutive transfer function (CTF) approximation. Compared to the time domain filters, CTF has much less taps, and thus less near-common zeros among channels and less computational complexity. This work proposes three source recovery methods, i) the multichannel inverse filtering method, i.e. multiple input/output inverse theorem (MINT), is exploited in the CTF domain, and for the multisource case, ii) a beamforming-like multichannel inverse filtering method is proposed appling the single source MINT and power minimization, which is suitable for the case that not the CTFs of all the sources are known, iii) a constrained Lasso method. The sources are recovered by minimizing their ℓ_1-norm to impose the spectral sparsity, with the constraint that the ℓ_2-norm fitting cost between the microphone signals and the mixture model involving the unknown source signals is less than a tolerance. The noise can be reduced by setting the tolerance to the noise power. Experiments under various acoustic conditions are conducted to evaluate the three proposed methods. The comparison among them and with the baseline methods are presented.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2017

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function

This paper addresses the problem of speech separation and enhancement fr...
research
04/10/2019

Expectation-Maximization for Speech Source Separation Using Convolutive Transfer Function

This paper addresses the problem of under-determinded speech source sepa...
research
12/20/2018

Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering

This paper addresses the problem of multichannel online dereverberation....
research
02/20/2020

Efficient Trainable Front-Ends for Neural Speech Enhancement

Many neural speech enhancement and source separation systems operate in ...
research
10/31/2022

Diffusion-based Generative Speech Source Separation

We propose DiffSep, a new single channel source separation method based ...
research
02/09/2023

Joint Acoustic Echo Cancellation and Speech Dereverberation Using Kalman filters

This paper proposes a joint acoustic echo cancellation (AEC) and speech ...
research
03/03/2023

Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints

Audio source separation is often achieved by estimating the magnitude sp...

Please sign up or login with your details

Forgot password? Click here to reset