Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning

02/11/2015
by   Yuancheng Luo, et al.
0

From a machine learning perspective, the human ability localize sounds can be modeled as a non-parametric and non-linear regression problem between binaural spectral features of sound received at the ears (input) and their sound-source directions (output). The input features can be summarized in terms of the individual's head-related transfer functions (HRTFs) which measure the spectral response between the listener's eardrum and an external point in 3D. Based on these viewpoints, two related problems are considered: how can one achieve an optimal sampling of measurements for training sound-source localization (SSL) models, and how can SSL models be used to infer the subject's HRTFs in listening tests. First, we develop a class of binaural SSL models based on Gaussian process regression and solve a forward selection problem that finds a subset of input-output samples that best generalize to all SSL directions. Second, we use an active-learning approach that updates an online SSL model for inferring the subject's SSL errors via headphones and a graphical user interface. Experiments show that only a small fraction of HRTFs are required for 5^∘ localization accuracy and that the learned HRTFs are localized closer to their intended directions than non-individualized HRTFs.

READ FULL TEXT

page 2

page 4

page 7

page 10

research
04/23/2020

Active Learning for Gaussian Process Considering Uncertainties with Application to Shape Control of Composite Fuselage

In the machine learning domain, active learning is an iterative data sel...
research
07/13/2019

The Use of Gaussian Processes in System Identification

Gaussian processes are used in machine learning to learn input-output ma...
research
05/05/2022

Uncertainty-Based Non-Parametric Active Peak Detection

Active, non-parametric peak detection is considered. As a use case, acti...
research
12/06/2018

Binaural Source Localization based on Modulation-Domain Features and Decision Pooling

In this work we apply Amplitude Modulation Spectrum (AMS) features to th...
research
05/11/2023

Sequential Experimental Design for Spectral Measurement: Active Learning Using a Parametric Model

In this study, we demonstrate a sequential experimental design for spect...
research
10/18/2022

Locally Smoothed Gaussian Process Regression

We develop a novel framework to accelerate Gaussian process regression (...
research
04/05/2019

Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks

Despite there being clear evidence for top-down (e.g., attentional) effe...

Please sign up or login with your details

Forgot password? Click here to reset