Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs

03/31/2022
by   David Diaz-Guerra, et al.
4

In this paper, we present a new model for Direction of Arrival (DOA) estimation of sound sources based on an Icosahedral Convolutional Neural Network (CNN) applied over SRP-PHAT power maps computed from the signals received by a microphone array. This icosahedral CNN is equivariant to the 60 rotational symmetries of the icosahedron, which represent a good approximation of the continuous space of spherical rotations, and can be implemented using standard 2D convolutional layers, having a lower computational cost than most of the spherical CNNs. In addition, instead of using fully connected layers after the icosahedral convolutions, we propose a new soft-argmax function that can be seen as a differentiable version of the argmax function and allows us to solve the DOA estimation as a regression problem interpreting the output of the convolutional layers as a probability distribution. We prove that using models that fit the equivariances of the problem allows us to outperform other state-of-the-art models with a lower computational cost and more robustness, obtaining root mean square localization errors lower than 10 even in scenarios with a reverberation time T_60 of 1.5 s.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 7

page 8

page 9

research
06/16/2020

Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks

In this paper, we present a new sound source DOA estimation and tracking...
research
05/09/2023

Accurate Real-Time Estimation of 2-Dimensional Direction of Arrival using a 3-Microphone Array

This paper presents a method for real-time estimation of 2-dimensional d...
research
09/27/2022

Scalable and Equivariant Spherical CNNs by Discrete-Continuous (DISCO) Convolutions

No existing spherical convolutional neural network (CNN) framework is bo...
research
03/12/2021

Dilated Fully Convolutional Neural Network for Depth Estimation from a Single Image

Depth prediction plays a key role in understanding a 3D scene. Several t...
research
12/13/2021

Extension of Convolutional Neural Network along Temporal and Vertical Directions for Precipitation Downscaling

Deep learning has been utilized for the statistical downscaling of clima...
research
04/17/2019

Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks

We present a novel learning-based approach to estimate the direction-of-...
research
11/20/2018

Multi-scale aggregation of phase information for reducing computational cost of CNN based DOA estimation

In a recent work on direction-of-arrival (DOA) estimation of multiple sp...

Please sign up or login with your details

Forgot password? Click here to reset