Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases

07/08/2021
by   Subhashini Venugopalan, et al.
0

Automatic classification of disordered speech can provide an objective tool for identifying the presence and severity of speech impairment. Classification approaches can also help identify hard-to-recognize speech samples to teach ASR systems about the variable manifestations of impaired speech. Here, we develop and compare different deep learning techniques to classify the intelligibility of disordered speech on selected phrases. We collected samples from a diverse set of 661 speakers with a variety of self-reported disorders speaking 29 words or phrases, which were rated by speech-language pathologists for their overall intelligibility using a five-point Likert scale. We then evaluated classifiers developed using 3 approaches: (1) a convolutional neural network (CNN) trained for the task, (2) classifiers trained on non-semantic speech representations from CNNs that used an unsupervised objective [1], and (3) classifiers trained on the acoustic (encoder) embeddings from an ASR system trained on typical speech [2]. We found that the ASR encoder's embeddings considerably outperform the other two on detecting and classifying disordered speech. Further analysis shows that the ASR embeddings cluster speech by the spoken phrase, while the non-semantic embeddings cluster speech by speaker. Also, longer phrases are more indicative of intelligibility deficits than single words.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2023

Speech Intelligibility Classifiers from 550k Disordered Speech Samples

We developed dysarthric speech intelligibility classifiers on 551,176 di...
research
07/31/2019

Personalizing ASR for Dysarthric and Accented Speech with Limited Data

Automatic speech recognition (ASR) systems have dramatically improved ov...
research
07/11/2023

Speech Diarization and ASR with GMM

In this research paper, we delve into the topics of Speech Diarization a...
research
06/16/2022

Nonwords Pronunciation Classification in Language Development Tests for Preschool Children

This work aims to automatically evaluate whether the language developmen...
research
05/24/2022

Deep Learning-based automated classification of Chinese Speech Sound Disorders

This article describes a system for analyzing acoustic data to assist in...
research
03/21/2022

Multi-class versus One-class classifier in spontaneous speech analysis oriented to Alzheimer Disease diagnosis

Most of medical developments require the ability to identify samples tha...
research
04/18/2018

Unspeech: Unsupervised Speech Context Embeddings

We introduce "Unspeech" embeddings, which are based on unsupervised lear...

Please sign up or login with your details

Forgot password? Click here to reset