The inverse short-time Fourier transform network (iSTFTNet) has garnered...
In speech synthesis, a generative adversarial network (GAN), training a
...
In recent text-to-speech synthesis and voice conversion systems, a
mel-s...
In this paper, we propose a non-parallel any-to-many voice conversion (V...
This paper deals with a multichannel audio source separation problem und...