Large language models have been shown to achieve remarkable performance
...
Pretrained general-purpose language models can achieve state-of-the-art
...
We present a hierarchical VAE that, for the first time, outperforms the
...
Recent work has demonstrated substantial gains on many NLP tasks and
ben...
We study empirical scaling laws for language model performance on the
cr...
Transformers are powerful sequence models, but require time and memory t...
In this work, we perform an empirical comparison among the CTC,
RNN-Tran...
Replacing hand-engineered pipelines with end-to-end deep learning system...
Keyword spotting (KWS) constitutes a major component of human-technology...
In training speech recognition systems, labeling audio clips can be
expe...