ML Estimation and CRBs for Reverberation, Speech and Noise PSDs in Rank-Deficient Noise-Field

by   Yaron Laufer, et al.

Speech communication systems are prone to performance degradation in reverberant and noisy acoustic environments. Dereverberation and noise reduction algorithms typically require several model parameters, e.g. the speech, reverberation and noise power spectral densities (PSDs). A commonly used assumption is that the noise PSD matrix is known. However, in practical acoustic scenarios, the noise PSD matrix is unknown and should be estimated along with the speech and reverberation PSDs. In this paper, we consider the case of rank-deficient noise PSD matrix, which arises when the noise signal consists of multiple directional interference sources, whose number is less than the number of microphones. We derive two closed-form maximum likelihood estimators (MLEs). The first is a non-blocking-based estimator which jointly estimates the speech, reverberation and noise PSDs, and the second is a blocking-based estimator, which first blocks the speech signal and then jointly estimates the reverberation and noise PSDs. Both estimators are analytically compared and analyzed, and mean square errors (MSEs) expressions are derived. Furthermore, Cramer-Rao Bounds (CRBs) on the estimated PSDs are derived. The proposed estimators are examined using both simulation and real reverberant and noisy signals, demonstrating the advantage of the proposed method compared to competing estimators.


page 1

page 10


Normalized Features for Improving the Generalization of DNN Based Speech Enhancement

Enhancing noisy speech is an important task to restore its quality and t...

Robust Joint Estimation of Multi-Microphone Signal Model Parameters

One of the biggest challenges in multi-microphone applications is the es...

Optimal Binaural LCMV Beamforming in Complex Acoustic Scenarios: Theoretical and Practical Insights

Binaural beamforming algorithms for head-mounted assistive listening dev...

Efficient ML Direction of Arrival Estimation assuming Unknown Sensor Noise Powers

This paper presents an efficient method for computing maximum likelihood...

Model-Based Speech Enhancement in the Modulation Domain

This paper presents algorithms for modulation-domain speech enhancement ...

Bias Correction and Modified Profile Likelihood under the Wishart Complex Distribution

This paper proposes improved methods for the maximum likelihood (ML) est...

Speech segmentation using multilevel hybrid filters

A novel approach for speech segmentation is proposed, based on Multileve...

Please sign up or login with your details

Forgot password? Click here to reset