State space models (SSMs) have recently shown promising results on
small...
Neural network pruning can be effectively applied to compress automatic
...
There is growing interest in unifying the streaming and full-context
aut...
It is well known that many machine learning systems demonstrate bias tow...
Representation learning from unlabeled data has been of major interest i...
This paper improves the streaming transformer transducer for speech
reco...
In this work, to measure the accuracy and efficiency for a latency-contr...
In this work, we exploit speech enhancement for improving a recurrent ne...
End-to-end automatic speech recognition (ASR) models with a single neura...
In this work, we first show that on the widely used LibriSpeech benchmar...
Videos uploaded on social media are often accompanied with textual
descr...
Deep acoustic models typically receive features in the first layer of th...
We propose and evaluate transformer-based acoustic models (AMs) for hybr...
Towards developing high-performing ASR for low-resource languages, appro...
We explore training attention-based encoder-decoder ASR for low-resource...
In topic identification (topic ID) on real-world unstructured audio, an ...
We describe the system our team used during NIST's LoReHLT (Low Resource...
Modern topic identification (topic ID) systems for speech use automatic
...
Acoustic unit discovery (AUD) is a process of automatically identifying ...