Daisy Stanton

research

∙ 12/06/2022

Learning the joint distribution of two sequences using little or no paired data

We present a noisy channel generative model of two sequences, for exampl...

0 Soroosh Mariooryad, et al. ∙

research

∙ 11/07/2021

Speaker Generation

This work explores the task of synthesizing speech in nonexistent human-...

0 Daisy Stanton, et al. ∙

research

∙ 10/15/2020

Non-saturating GAN training as divergence minimization

Non-saturating generative adversarial network (GAN) training is widely u...

0 Matt Shannon, et al. ∙

research

∙ 10/23/2019

Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Despite the ability to produce human-level speech for in-domain text, at...

0 Eric Battenberg, et al. ∙

research

∙ 10/03/2019

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

We present a novel generative model that combines state-of-the-art neura...

0 Raza Habib, et al. ∙

research

∙ 06/08/2019

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Recent work has explored sequence-to-sequence latent variable models for...

0 Eric Battenberg, et al. ∙

research

∙ 08/04/2018

Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis

Global Style Tokens (GSTs) are a recently-proposed method to learn laten...

0 Daisy Stanton, et al. ∙

research

∙ 03/24/2018

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

We present an extension to the Tacotron speech synthesis architecture th...

0 RJ Skerry-Ryan, et al. ∙

research

∙ 03/23/2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

In this work, we propose "global style tokens" (GSTs), a bank of embeddi...

0 Yuxuan Wang, et al. ∙

research

∙ 11/01/2017

Uncovering Latent Style Factors for Expressive Speech Synthesis

Prosodic modeling is a core problem in speech synthesis. The key challen...

0 Yuxuan Wang, et al. ∙

research

∙ 03/29/2017

Tacotron: Towards End-to-End Speech Synthesis

A text-to-speech synthesis system typically consists of multiple stages,...

0 Yuxuan Wang, et al. ∙

Daisy Stanton

Featured Co-authors

Sign in with Google

Consider DeepAI Pro