Subband Weighting for Binaural Speech Source Localization

01/28/2020
by   Parth Suresh, et al.
0

We consider the task of speech source localization from a bin-aural recording using interaural time difference (ITD). A typical approach is to process binaural speech using gammatone filters and calculate frame-level ITD in each subband. The ITDs in each gammatone subband are statistically modeled using Gaussian mixture models (GMMs) for every direction during training. Given a binaural test-speech, the source is localized using maximum likelihood (ML) criterion. In this work, we pro-pose a subband weighting scheme where subband likelihoods are weighted based on their reliability. We measure the reliability of a subband using the average frame level localization error obtained for the respective subbands. These reliability values are used as the weights for each subband likelihood prior to combining the likelihoods for ML estimation. We also introduce non-linear warping of these weights to accommodate and analyse a larger space of possible subband weights. Experiments on Subject003 from the CIPIC database reveal that weighting the subbands is better than the unweighted scheme of combining likelihoods

READ FULL TEXT
research
04/12/2016

Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting

In this paper, we study several microphone channel selection and weighti...
research
10/31/2017

Nebula: F0 Estimation and Voicing Detection by Modeling the Statistical Properties of Feature Extractors

A F0 and voicing status estimation algorithm for speech analysis/synthes...
research
03/26/2022

Current Source Localization Using Deep Prior with Depth Weighting

This paper proposes a novel neuronal current source localization method ...
research
07/26/2016

Variational Mixture Models with Gamma or inverse-Gamma components

Mixture models with Gamma and or inverse-Gamma distributed mixture compo...
research
02/20/2023

A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation

Deep neural networks (DNNs) have greatly benefited direction of arrival ...
research
03/01/2023

Understanding the Diffusion Objective as a Weighted Integral of ELBOs

Diffusion models in the literature are optimized with various objectives...
research
09/28/2018

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments

This paper addresses the problem of online multiple-speaker localization...

Please sign up or login with your details

Forgot password? Click here to reset