We present a noisy channel generative model of two sequences, for exampl...
This work explores the task of synthesizing speech in nonexistent
human-...
We describe a sequence-to-sequence neural network which can directly gen...
Non-saturating generative adversarial network (GAN) training is widely u...
Despite the ability to produce human-level speech for in-domain text,
at...
We present a novel generative model that combines state-of-the-art neura...
Recent work has explored sequence-to-sequence latent variable models for...
We present an extension to the Tacotron speech synthesis architecture th...
In this work, we propose "global style tokens" (GSTs), a bank of embeddi...
Prosodic modeling is a core problem in speech synthesis. The key challen...
In this work, we perform an empirical comparison among the CTC,
RNN-Tran...
Replacing hand-engineered pipelines with end-to-end deep learning system...
We show that an end-to-end deep learning approach can be used to recogni...