Deyi Tuo

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Jie Chen
218 publications
Dong Yu
160 publications
Feng Liu
124 publications
Helen Meng
108 publications
Peng Liu
107 publications
Jun Chen
98 publications
Zhiyong Wu
70 publications
Kun Xu
66 publications
Dan Su
60 publications
Chao Weng
40 publications
Xixin Wu
28 publications

research

∙ 08/31/2023

Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information

This paper presents an end-to-end high-quality singing voice synthesis (...

0 Shaohuan Zhou, et al. ∙

research

∙ 08/31/2023

Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information

For text-to-speech (TTS) synthesis, prosodic structure prediction (PSP) ...

0 Jie Chen, et al. ∙

research

∙ 06/15/2023

CoverHunter: Cover Song Identification with Refined Attention and Alignments

Abstract: Cover song identification (CSI) focuses on finding the same mu...

0 Feng Liu, et al. ∙

research

∙ 03/24/2022

Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion

Non-parallel data voice conversion (VC) have achieved considerable break...

0 Xintao Zhao, et al. ∙

research

∙ 03/23/2022

FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Previously proposed FullSubNet has achieved outstanding performance in D...

0 Jun Chen, et al. ∙

research

∙ 06/20/2020

Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams

Generating 3D speech-driven talking head has received more and more atte...

0 Huirong Huang, et al. ∙

research

∙ 09/04/2019

DurIAN: Duration Informed Attention Network For Multimodal Synthesis

In this paper, we present a generic and robust multimodal synthesis syst...

0 Chengzhu Yu, et al. ∙

Success!

An error occurred

Deyi Tuo

Featured Co-authors

Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information

Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information

CoverHunter: Cover Song Identification with Refined Attention and Alignments

Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion

FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams

DurIAN: Duration Informed Attention Network For Multimodal Synthesis

Sign in with Google

Consider DeepAI Pro