NPLDA: A Deep Neural PLDA Model for Speaker Verification

02/10/2020
by   Shreyas Ramoji, et al.
0

The state-of-art approach for speaker verification consists of a neural network based embedding extractor along with a backend generative model such as the Probabilistic Linear Discriminant Analysis (PLDA). In this work, we propose a neural network approach for backend modeling in speaker recognition. The likelihood ratio score of the generative PLDA model is posed as a discriminative similarity function and the learnable parameters of the score function are optimized using a verification cost. The proposed model, termed as neural PLDA (NPLDA), is initialized using the generative PLDA model parameters. The loss function for the NPLDA model is an approximation of the minimum detection cost function (DCF). The speaker recognition experiments using the NPLDA model are performed on the speaker verificiation task in the VOiCES datasets as well as the SITW challenge dataset. In these experiments, the NPLDA model optimized using the proposed loss function improves significantly over the state-of-art PLDA based speaker verification system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2020

Neural PLDA Modeling for End-to-End Speaker Verification

While deep learning models have made significant advances in supervised ...
research
10/10/2020

Remarks on Optimal Scores for Speaker Recognition

In this article, we first establish the theory of optimal scores for spe...
research
02/22/2016

Blind score normalization method for PLDA based speaker recognition

Probabilistic Linear Discriminant Analysis (PLDA) has become state-of-th...
research
09/05/2021

Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection

Many endeavors have sought to develop countermeasure techniques as enhan...
research
01/09/2021

Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification

The task for speaker verification (SV) is to decide an utterance is spok...
research
08/25/2018

Multiobjective Optimization Training of PLDA for Speaker Verification

Most current state-of-the-art text-independent speaker verification syst...
research
04/07/2021

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Generative probability models are widely used for speaker verification (...

Please sign up or login with your details

Forgot password? Click here to reset