research
∙
07/29/2022
Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
For Mandarin end-to-end (E2E) automatic speech recognition (ASR) tasks, ...
research
∙
04/08/2022
Transducer-based language embedding for spoken language identification
The acoustic and linguistic features are important cues for the spoken l...
research
∙
03/31/2022
Partial Coupling of Optimal Transport for Spoken Language Identification
In order to reduce domain discrepancy to improve the performance of cros...
research
∙
04/07/2021
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Generative probability models are widely used for speaker verification (...
research
∙
01/09/2021
Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification
The task for speaker verification (SV) is to decide an utterance is spok...
research
∙
12/24/2020
Unsupervised neural adaptation model based on optimal transport for spoken language identification
Due to the mismatch of statistical distributions of acoustic speech betw...
research
∙
12/27/2019