PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification

07/20/2023
by   Wonbin Kim, et al.
0

Background noise reduces speech intelligibility and quality, making speaker verification (SV) in noisy environments a challenging task. To improve the noise robustness of SV systems, additive noise data augmentation method has been commonly used. In this paper, we propose a new additive noise method, partial additive speech (PAS), which aims to train SV systems to be less affected by noisy environments. The experimental results demonstrate that PAS outperforms traditional additive noise in terms of equal error rates (EER), with relative improvements of 4.64 ECAPA-TDNN. We also show the effectiveness of proposed method by analyzing attention modules and visualizing speaker embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2019

Unsupervised Feature Enhancement for speaker verification

The task of making speaker verification systems robust to adverse scenar...
research
06/29/2020

Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments

The explosion of available speech data and new speaker modeling methods ...
research
07/12/2020

Data augmentation enhanced speaker enrollment for text-dependent speaker verification

Data augmentation is commonly used for generating additional data from t...
research
03/13/2020

End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification

Speech 'in-the-wild' is a handicap for speaker recognition systems due t...
research
05/03/2018

Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification

The performance of speaker-related systems usually degrades heavily in p...
research
02/15/2019

An improved uncertainty propagation method for robust i-vector based speaker recognition

The performance of automatic speaker recognition systems degrades when f...
research
02/19/2021

Unit selection synthesis based data augmentation for fixed phrase speaker verification

Data augmentation is commonly used to help build a robust speaker verifi...

Please sign up or login with your details

Forgot password? Click here to reset