Continuous Speech Separation with Conformer

08/13/2020
by   Sanyuan Chen, et al.
0

Continuous speech separation plays a vital role in complicated speech related tasks such as conversation transcription. The separation model extracts a single speaker signal from a mixed speech. In this paper, we use transformer and conformer in lieu of recurrent neural networks in the separation system, as we believe capturing global information with the self-attention based method is crucial for the speech separation. Evaluating on the LibriCSS dataset, the conformer separation model achieves state of the art results, with a relative 23.5 utterance-wise evaluation and a 15.4 evaluation.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset