Peng Shen

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yu Tsao
127 publications
Sheng Li
85 publications
Xugang Lu
24 publications
Hisashi Kawai
21 publications

research

∙ 07/29/2022

Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

For Mandarin end-to-end (E2E) automatic speech recognition (ASR) tasks, ...

0 Peng Shen, et al. ∙

research

∙ 04/08/2022

Transducer-based language embedding for spoken language identification

The acoustic and linguistic features are important cues for the spoken l...

0 Peng Shen, et al. ∙

research

∙ 03/31/2022

Partial Coupling of Optimal Transport for Spoken Language Identification

In order to reduce domain discrepancy to improve the performance of cros...

0 Xugang Lu, et al. ∙

research

∙ 04/07/2021

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Generative probability models are widely used for speaker verification (...

0 Xugang Lu, et al. ∙

research

∙ 01/09/2021

Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification

The task for speaker verification (SV) is to decide an utterance is spok...

0 Xugang Lu, et al. ∙

research

∙ 12/24/2020

Unsupervised neural adaptation model based on optimal transport for spoken language identification

Due to the mismatch of statistical distributions of acoustic speech betw...

0 Xugang Lu, et al. ∙

research

∙ 12/27/2019

Deep progressive multi-scale attention for acoustic event classification

Convolutional neural network (CNN) is an indispensable building block fo...

0 Xugang Lu, et al. ∙

Success!

An error occurred

Peng Shen

Featured Co-authors

Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

Transducer-based language embedding for spoken language identification

Partial Coupling of Optimal Transport for Spoken Language Identification

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification

Unsupervised neural adaptation model based on optimal transport for spoken language identification

Deep progressive multi-scale attention for acoustic event classification

Sign in with Google

Consider DeepAI Pro