Speech enhancement (SE) is usually required as a front end to improve th...
With the advance in self-supervised learning for audio and visual modali...
Wav2vec2.0 is a popular self-supervised pre-training framework for learn...
In this paper, we propose a weakly supervised multilingual representatio...