Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks

04/05/2019
by   Ning Ma, et al.
0

Despite there being clear evidence for top-down (e.g., attentional) effects in biological spatial hearing, relatively few machine hearing systems exploit top-down model-based knowledge in sound localisation. This paper addresses this issue by proposing a novel framework for binaural sound localisation that combines model-based information about the spectral characteristics of sound sources and deep neural networks (DNNs). A target source model and a background source model are first estimated during a training phase using spectral features extracted from sound signals in isolation. When the identity of the background source is not available, a universal background model can be used. During testing, the source models are used jointly to explain the mixed observations and improve the localisation process by selectively weighting source azimuth posteriors output by a DNN-based localisation system. To address the possible mismatch between training and testing, a model adaptation process is further employed on-the-fly during testing, which adapts the background model parameters directly from the noisy observations in an iterative manner. The proposed system therefore combines model-based and data-driven information flow within a single computational framework. The evaluation task involved localisation of a target speech source in the presence of an interfering source and room reverberation. Our experiments show that by exploiting model-based information in this way, sound localisation performance can be improved substantially under various noisy and reverberant conditions.

READ FULL TEXT

page 1

page 5

page 8

page 9

page 10

research
04/05/2019

Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments

This paper presents a novel machine-hearing system that exploits deep ne...
research
09/08/2021

A Survey of Sound Source Localization with Deep Learning Methods

This article is a survey on deep learning methods for single and multipl...
research
11/18/2020

Statistical model-based evaluation of neural networks

Using a statistical model-based data generation, we develop an experimen...
research
07/28/2020

Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling

Detecting sound source objects within visual observation is important fo...
research
10/13/2021

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection

Recording and annotating real sound events for a sound event localizatio...
research
02/16/2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization

Multiple moving sound source localization in real-world scenarios remain...
research
02/11/2015

Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning

From a machine learning perspective, the human ability localize sounds c...

Please sign up or login with your details

Forgot password? Click here to reset