XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech

05/31/2023
by   Linh The Nguyen, et al.
0

We present XPhoneBERT, the first multilingual model pre-trained to learn phoneme representations for the downstream text-to-speech (TTS) task. Our XPhoneBERT has the same model architecture as BERT-base, trained using the RoBERTa pre-training approach on 330M phoneme-level sentences from nearly 100 languages and locales. Experimental results show that employing XPhoneBERT as an input phoneme encoder significantly boosts the performance of a strong neural TTS model in terms of naturalness and prosody and also helps produce fairly high-quality speech with limited training data. We publicly release our pre-trained XPhoneBERT with the hope that it would facilitate future research and downstream TTS applications for multiple languages. Our XPhoneBERT model is available at https://github.com/VinAIResearch/XPhoneBERT

READ FULL TEXT
research
05/20/2020

BERTweet: A pre-trained language model for English Tweets

We present BERTweet, the first public large-scale pre-trained language m...
research
11/12/2022

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities

In this work, we present a conceptually simple and effective method to t...
research
04/15/2021

Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Developing Text Normalization (TN) systems for Text-to-Speech (TTS) on n...
research
12/21/2022

3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions

We present 3D Highlighter, a technique for localizing semantic regions o...
research
10/05/2017

BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages

We present BPEmb, a collection of pre-trained subword unit embeddings in...
research
10/25/2018

Learning Neural Emotion Analysis from 100 Observations: The Surprising Effectiveness of Pre-Trained Word Representations

Deep Learning has drastically reshaped virtually all areas of NLP. Yet o...
research
09/20/2023

ModelGiF: Gradient Fields for Model Functional Distance

The last decade has witnessed the success of deep learning and the surge...

Please sign up or login with your details

Forgot password? Click here to reset