LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus

05/30/2023
by   Yuma Koizumi, et al.
0

This paper introduces a new speech dataset called “LibriTTS-R” designed for text-to-speech (TTS) use. It is derived by applying speech restoration to the LibriTTS corpus, which consists of 585 hours of speech data at 24 kHz sampling rate from 2,456 speakers and the corresponding texts. The constituent samples of LibriTTS-R are identical to those of LibriTTS, with only the sound quality improved. Experimental results show that the LibriTTS-R ground-truth samples showed significantly improved sound quality compared to those in LibriTTS. In addition, neural end-to-end TTS trained with LibriTTS-R achieved speech naturalness on par with that of the ground-truth samples. The corpus is freely available for download from <http://www.openslr.org/141/>.

READ FULL TEXT

page 3

page 4

research
04/05/2019

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech

This paper introduces a new speech corpus called "LibriTTS" designed for...
research
10/28/2017

JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis

Thanks to improvements in machine learning techniques including deep lea...
research
02/28/2023

ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus

At present, Text-to-speech (TTS) systems that are trained with high-qual...
research
11/19/2020

Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple Domains

We propose Universal MelGAN, a vocoder that synthesizes high-fidelity sp...
research
07/29/2023

ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus

We introduce the ÌròyìnSpeech corpus – a new dataset influenced by a des...
research
10/14/2019

Restoring ancient text using deep learning: a case study on Greek epigraphy

Ancient history relies on disciplines such as epigraphy, the study of an...
research
04/12/2022

Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch

The recently developed pitch-controllable text-to-speech (TTS) model, i....

Please sign up or login with your details

Forgot password? Click here to reset