Self-supervised learning (SSL) methods such as WavLM have shown promisin...
Personalized speech enhancement (PSE) models achieve promising results
c...
Personalized speech enhancement (PSE), a process of estimating a clean t...
Existing multi-channel continuous speech separation (CSS) models are hea...
This paper investigates how to improve the runtime speed of personalized...
The Deep Noise Suppression (DNS) challenge is designed to foster innovat...
In this work, we develop new self-learning techniques with an attention-...
Multi-talker conversational speech processing has drawn many interests f...
With the recent surge of video conferencing tools usage, providing
high-...
Personalized speech enhancement (PSE) models utilize additional cues, su...
Continuous speech separation (CSS) aims to separate overlapping voices f...
In this paper, a new learning algorithm for Federated Learning (FL) is
i...
Visual emotion expression plays an important role in audiovisual speech
...
In this paper, a Federated Learning (FL) simulation platform is introduc...
Modern Automatic Speech Recognition (ASR) systems can achieve high
perfo...
The presence of a corresponding talking face has been shown to significa...