Explanations for Automatic Speech Recognition

02/27/2023
by   Xiaoliang Wu, et al.
0

We address quality assessment for neural network based ASR by providing explanations that help increase our understanding of the system and ultimately help build trust in the system. Compared to simple classification labels, explaining transcriptions is more challenging as judging their correctness is not straightforward and transcriptions as a variable-length sequence is not handled by existing interpretable machine learning models. We provide an explanation for an ASR transcription as a subset of audio frames that is both a minimal and sufficient cause of the transcription. To do this, we adapt existing explainable AI (XAI) techniques from image classification-Statistical Fault Localisation(SFL) and Causal. Additionally, we use an adapted version of Local Interpretable Model-Agnostic Explanations (LIME) for ASR as a baseline in our experiments. We evaluate the quality of the explanations generated by the proposed techniques over three different ASR ,Google API, the baseline model of Sphinx, Deepspeech and 100 audio samples from the Commonvoice dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2023

Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition

Explainable AI (XAI) techniques have been widely used to help explain an...
research
02/01/2022

Visualizing Automatic Speech Recognition – Means for a Better Understanding?

Automatic speech recognition (ASR) is improving ever more at mimicking h...
research
11/02/2022

XAI-Increment: A Novel Approach Leveraging LIME Explanations for Improved Incremental Learning

Explainability of neural network prediction is essential to understand f...
research
06/01/2023

Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication

This paper introduces a multilingual automatic speech recognizer (ASR) f...
research
09/03/2021

CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models

We propose CX-ToM, short for counterfactual explanations with theory-of ...
research
09/14/2023

Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features

Recent advances in eXplainable AI (XAI) have provided new insights into ...
research
05/04/2023

Interpretable Regional Descriptors: Hyperbox-Based Local Explanations

This work introduces interpretable regional descriptors, or IRDs, for lo...

Please sign up or login with your details

Forgot password? Click here to reset