Large language models (LLM) have demonstrated their abilities to solve
v...
Pre-trained speech encoders have been central to pushing state-of-the-ar...
It remains a question that how simultaneous interpretation (SI) data aff...
Over the past few decades, multimodal emotion recognition has made remar...
Multimodal emotion recognition leverages complementary information acros...
Pre-trained speech Transformers have facilitated great success across va...
Relation extraction typically aims to extract semantic relationships bet...
Pre-trained speech Transformers in speech translation (ST) have facilita...
Training end-to-end speech translation (ST) systems requires sufficientl...
End-to-end speech-to-text translation models are often initialized with
...
Multimodal emotion recognition study is hindered by the lack of labelled...
Most existing simultaneous machine translation (SiMT) systems are traine...
Emotion recognition in conversation (ERC) is a crucial component in affe...
This paper is a brief report for MUSE2020 challenge. We present our solu...
Obtaining training data for multi-document summarization (MDS) is time
c...