State-of-the-art non-autoregressive text-to-speech (TTS) models based on...
Deep equilibrium models (DEQs) have proven to be very powerful for learn...
Self-supervised learning (SSL) has recently shown remarkable results in
...
Parallel text-to-speech (TTS) models have recently enabled fast and
high...
Recent works have revealed the vulnerability of automatic speech recogni...
Voice conversion has gained increasing popularity in many applications o...
Given the semantic descriptions of classes, Zero-Shot Learning (ZSL) aim...