Thomas Merritt

research

∙ 07/31/2023

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Neural text-to-speech systems are often optimized on L1/L2 losses, which...

0 Guangyan Zhang, et al. ∙

research

∙ 07/04/2022

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion

In this paper, we propose GlowVC: a multilingual multi-speaker flow-base...

0 Magdalena Proszewska, et al. ∙

research

∙ 06/28/2022

Expressive, Variable, and Controllable Duration Modelling in TTS

Duration modelling has become an important research problem once more wi...

0 Ammar Abbas, et al. ∙

research

∙ 03/15/2022

Text-free non-parallel many-to-many voice conversion using normalising flows

Non-parallel voice conversion (VC) is typically achieved using lossy rep...

0 Thomas Merritt, et al. ∙

research

∙ 06/24/2021

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Whilst recent neural text-to-speech (TTS) approaches produce high-qualit...

0 Raahil Shah, et al. ∙

research

∙ 11/11/2020

Low-resource expressive text-to-speech using data augmentation

While recent neural text-to-speech (TTS) systems perform remarkably well...

0 Goeric Huybrechts, et al. ∙

research

∙ 04/04/2019

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Neural text-to-speech synthesis (NTTS) models have shown significant pro...

0 Nishant Prateek, et al. ∙

research

∙ 11/15/2018

Effect of data reduction on sequence-to-sequence neural TTS

Recent speech synthesis systems based on sampling from autoregressive ne...

0 Javier Latorre, et al. ∙

research

∙ 11/15/2018

Comprehensive evaluation of statistical speech waveform synthesis

Statistical TTS systems that directly predict the speech waveform have r...

0 Thomas Merritt, et al. ∙

research

∙ 11/15/2018

Robust universal neural vocoding

This paper introduces a robust universal neural vocoder trained with 74 ...

0 Jaime Lorenzo-Trueba, et al. ∙

research

∙ 07/28/2018

Analysing Shortcomings of Statistical Parametric Speech Synthesis

Output from statistical parametric speech synthesis (SPSS) remains notic...

0 Gustav Eje Henter, et al. ∙

Thomas Merritt

Featured Co-authors

Sign in with Google

Consider DeepAI Pro