Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition

08/03/2017
by   Suwon Shon, et al.
0

Recently in speaker recognition, performance degradation due to the channel domain mismatched condition has been actively addressed. However, the mismatches arising from language is yet to be sufficiently addressed. This paper proposes an approach which employs recursive whitening transformation to mitigate the language mismatched condition. The proposed method is based on the multiple whitening transformation, which is intended to remove un-whitened residual components in the dataset associated with i-vector length normalization. The experiments were conducted on the Speaker Recognition Evaluation 2016 trials of which the task is non-English speaker recognition using development dataset consist of both a large scale out-of-domain (English) dataset and an extremely low-quantity in-domain (non-English) dataset. For performance comparison, we develop a state-of- the-art system using deep neural network and bottleneck feature, which is based on a phonetically aware model. From the experimental results, along with other prior studies, effectiveness of the proposed method on language mismatched condition is validated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2017

KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation

Korea University Intelligent Signal Processing Lab. (KU-ISPL) developed ...
research
02/22/2016

Blind score normalization method for PLDA based speaker recognition

Probabilistic Linear Discriminant Analysis (PLDA) has become state-of-th...
research
04/03/2015

A Unified Deep Neural Network for Speaker and Language Recognition

Learned feature representations and sub-phoneme posteriors from Deep Neu...
research
08/03/2017

Autoencoder based Domain Adaptation for Speaker Recognition under Insufficient Channel Information

In real-life conditions, mismatch between development and test domain de...
research
04/07/2020

Deep Normalization for Speaker Vectors

Deep speaker embedding has demonstrated state-of-the-art performance in ...
research
10/31/2019

CN-CELEB: a challenging Chinese speaker recognition dataset

Recently, researchers set an ambitious goal of conducting speaker recogn...
research
03/04/2022

On the relevance of language in speaker recognition

This paper presents a new database collected from a bilingual speakers s...

Please sign up or login with your details

Forgot password? Click here to reset