Shogo Seki | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Li Li
178 publications
Tomoki Toda
66 publications
Hirokazu Kameoka
35 publications
Takuhiro Kaneko
29 publications
Kazuya Takeda
27 publications
Kou Tanaka
20 publications
Nobukatsu Hojo
16 publications

research

∙ 08/14/2023

iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN

The inverse short-time Fourier transform network (iSTFTNet) has garnered...

0 Takuhiro Kaneko, et al. ∙

research

∙ 03/24/2023

Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis

In speech synthesis, a generative adversarial network (GAN), training a ...

0 Takuhiro Kaneko, et al. ∙

research

∙ 03/04/2022

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

In recent text-to-speech synthesis and voice conversion systems, a mel-s...

6 Takuhiro Kaneko, et al. ∙

research

∙ 10/06/2020

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics

In this paper, we propose a non-parallel any-to-many voice conversion (V...

0 Hirokazu Kameoka, et al. ∙

research

∙ 09/29/2018

Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation

This paper deals with a multichannel audio source separation problem und...

0 Shogo Seki, et al. ∙

Success!

An error occurred