Word-piece models (WPMs) are commonly used subword units in state-of-the...
We explore unifying a neural segmenter with two-pass cascaded encoder AS...
Automatic speech recognition (ASR) systems typically rely on an external...
Language identification is critical for many downstream tasks in automat...
On-device end-to-end (E2E) models have shown improvements over a convent...
In voice-enabled applications, a predetermined hotword isusually used to...
While a streaming voice assistant system has been used in many applicati...
Improving the performance of end-to-end ASR models on long utterances ra...
The recurrent neural network transducer (RNN-T) has recently become the
...
End-to-end (E2E) models have shown to outperform state-of-the-art
conven...
Streaming automatic speech recognition (ASR) aims to emit each hypothesi...
Thus far, end-to-end (E2E) models have not been shown to outperform
stat...
The emerging field of neural speech recognition (NSR) using
electrocorti...
In this paper, we propose "personal VAD", a system to detect the voice
a...
Given the recent surge in developments of deep learning, this article
pr...
End-to-end (E2E) models, which directly predict output character sequenc...