We show that training a multi-headed self-attention-based deep network t...
The recently proposed Joint Energy-based Model (JEM) interprets
discrimi...
We present a method to remove unknown convolutive noise introduced to sp...
How important are different temporal speech modulations for speech
recog...
Conventional Frequency Domain Linear Prediction (FDLP) technique models ...
We propose a technique to compute spectrograms using Frequency Domain Li...
Performance degradation of an Automatic Speech Recognition (ASR) system ...
The multi-stream paradigm of audio processing, in which several sources ...
Attention-based methods and Connectionist Temporal Classification (CTC)
...
Measuring performance of an automatic speech recognition (ASR) system wi...
Quality of data plays an important role in most deep learning tasks. In ...
Automatic Speech Recognition (ASR) using multiple microphone arrays has
...
Attention-based methods and Connectionist Temporal Classification (CTC)
...
A stream attention framework has been applied to the posterior probabili...