Small footprint Text-Independent Speaker Verification for Embedded Systems

11/03/2020
by   Julien Balian, et al.
0

Deep neural network approaches to speaker verification have proven successful, but typical computational requirements of State-Of-The-Art (SOTA) systems make them unsuited for embedded applications. In this work, we present a two-stage model architecture orders of magnitude smaller than common solutions (237.5K learning parameters, 11.5MFLOPS) reaching a competitive result of 3.31 verification test set. We demonstrate the possibility of running our solution on small devices typical of IoT systems such as the Raspberry Pi 3B with a latency smaller than 200ms on a 5s long utterance. Additionally, we evaluate our model on the acoustically challenging VOiCES corpus. We report a limited increase in EER of 2.6 percentage points with respect to the best scoring model of the 2019 VOiCES from a Distance Challenge, against a reduction of 25.6 times in the number of learning parameters.

READ FULL TEXT
research
04/06/2021

Binary Neural Network for Speaker Verification

Although deep neural networks are successful for many tasks in the speec...
research
08/20/2020

Speaker-Utterance Dual Attention for Speaker and Utterance Verification

In this paper, we study a novel technique that exploits the interaction ...
research
02/03/2022

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

The time delay neural network (TDNN) represents one of the state-of-the-...
research
03/13/2018

Deep CNN based feature extractor for text-prompted speaker recognition

Deep learning is still not a very common tool in speaker verification fi...
research
11/26/2019

A discriminative condition-aware backend for speaker verification

We present a scoring approach for speaker verification that mimics the s...
research
05/19/2018

Sparse Architectures for Text-Independent Speaker Verification Using Deep Neural Networks

Network pruning is of great importance due to the elimination of the uni...
research
02/23/2020

Comparing the Parameter Complexity of Hypernetworks and the Embedding-Based Alternative

In the context of learning to map an input I to a function h_I:X→R, we c...

Please sign up or login with your details

Forgot password? Click here to reset