This paper describes the NPU-MSXF system for the IWSLT 2023 speech-to-sp...
Direct speech-to-speech translation (S2ST) has gradually become popular ...
We propose a novel Text-to-Image Generation Network, Adaptive Layout
Ref...
Direct speech-to-speech translation (S2ST) is an attractive research top...
Electrocardiogram (ECG) is a widely used non-invasive diagnostic tool fo...
Leveraging context information is an intuitive idea to improve performan...
Transformer-based models have demonstrated their effectiveness in automa...
Compositional Zero-Shot Learning (CZSL) aims to recognize unseen composi...
Conversational automatic speech recognition (ASR) is a task to recognize...
Modern deep learning methods have achieved great success in machine lear...
Conversational speech recognition is regarded as a challenging task due ...
In this study, we present recent developments on ESPnet: End-to-End Spee...