Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition

05/26/2021
by   Julia Pritzen, et al.
0

Loanwords, such as Anglicisms, are a challenge in German speech recognition. Due to their irregular pronunciation compared to native German words, automatically generated pronunciation dictionaries often include faulty phoneme sequences for Anglicisms. In this work, we propose a multitask sequence-to-sequence approach for grapheme-to-phoneme conversion to improve the phonetization of Anglicisms. We extended a grapheme-to-phoneme model with a classifier to distinguish Anglicisms from native German words. With this approach, the model learns to generate pronunciations differently depending on the classification result. We used our model to create supplementary Anglicism pronunciation dictionaries that are added to an existing German speech recognition model. Tested on a dedicated Anglicism evaluation set, we improved the recognition of Anglicisms compared to a baseline model, reducing the word error rate by 1 learning can help solving the challenge of loanwords in German speech recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2021

Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021

This paper describes the winning approach in the Shared Task 3 at SwissT...
research
10/27/2020

Multitask Training with Text Data for End-to-End Speech Recognition

We propose a multitask training method for attention-based end-to-end sp...
research
07/31/2023

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings

The Grapheme-to-Phoneme (G2P) task aims to convert orthographic input in...
research
06/28/2016

Generation and Pruning of Pronunciation Variants to Improve ASR Accuracy

Speech recognition, especially name recognition, is widely used in phone...
research
03/21/2020

A Joint Approach to Compound Splitting and Idiomatic Compound Detection

Applications such as machine translation, speech recognition, and inform...
research
03/31/2020

A Swiss German Dictionary: Variation in Speech and Writing

We introduce a dictionary containing forms of common words in various Sw...
research
06/15/2021

Modeling morphology with Linear Discriminative Learning: considerations and design choices

This study addresses a series of methodological questions that arise whe...

Please sign up or login with your details

Forgot password? Click here to reset