Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates

07/29/2018
by   Juan Manuel Vera-Diaz, et al.
0

This paper presents a novel approach for indoor acoustic source localization using microphone arrays and based on a Convolutional Neural Network (CNN). The proposed solution is, to the best of our knowledge, the first published work in which the CNN is designed to directly estimate the three dimensional position of an acoustic source, using the raw audio signal as the input information avoiding the use of hand crafted audio features. Given the limited amount of available localization data, we propose in this paper a training strategy based on two steps. We first train our network using semi-synthetic data, generated from close talk speech recordings, and where we simulate the time delays and distortion suffered in the signal that propagates from the source to the array of microphones. We then fine tune this network using a small amount of real data. Our experimental results show that this strategy is able to produce networks that significantly improve existing localization methods based on SRP-PHAT strategies. In addition, our experiments show that our CNN method exhibits better resistance against varying gender of the speaker and different window sizes compared with the other methods.

READ FULL TEXT

page 1

page 5

page 8

page 14

research
08/08/2023

Dual input neural networks for positional sound source localization

In many signal processing applications, metadata may be advantageously u...
research
03/02/2021

DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization

Drone-embedded sound source localization (SSL) has interesting applicati...
research
11/06/2020

Misalignment Recognition in Acoustic Sensor Networks using a Semi-supervised Source Estimation Method and Markov Random Fields

In this paper, we consider the problem of acoustic source localization b...
research
06/13/2013

Physeter catodon localization by sparse coding

This paper presents a spermwhale' localization architecture using jointl...
research
06/19/2023

Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming

Recently, many forms of audio industrial applications, such as sound mon...
research
10/25/2021

Automatic Impact-sounding Acoustic Inspection of Concrete Structure

Impact sounding signal has been shown to contain information about struc...

Please sign up or login with your details

Forgot password? Click here to reset