Speech Tasks Relevant to Sleepiness Determined with Deep Transfer Learning

11/29/2021
by   Bang Tran, et al.
0

Excessive sleepiness in attention-critical contexts can lead to adverse events, such as car crashes. Detecting and monitoring sleepiness can help prevent these adverse events from happening. In this paper, we use the Voiceome dataset to extract speech from 1,828 participants to develop a deep transfer learning model using Hidden-Unit BERT (HuBERT) speech representations to detect sleepiness from individuals. Speech is an under-utilized source of data in sleep detection, but as speech collection is easy, cost-effective, and non-invasive, it provides a promising resource for sleepiness detection. Two complementary techniques were conducted in order to seek converging evidence regarding the importance of individual speech tasks. Our first technique, masking, evaluated task importance by combining all speech tasks, masking selected responses in the speech, and observing systematic changes in model accuracy. Our second technique, separate training, compared the accuracy of multiple models, each of which used the same architecture, but was trained on a different subset of speech tasks. Our evaluation shows that the best-performing model utilizes the memory recall task and categorical naming task from the Boston Naming Test, which achieved an accuracy of 80.07 81.13

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2022

A Feature Extraction based Model for Hate Speech Identification

The detection of hate speech online has become an important task, as off...
research
11/19/2015

Transfer Learning for Speech and Language Processing

Transfer learning is a vital technique that generalizes models trained f...
research
03/21/2022

Automated detection of foreground speech with wearable sensing in everyday home environments: A transfer learning approach

Acoustic sensing has proved effective as a foundation for numerous appli...
research
09/09/2021

DeepEMO: Deep Learning for Speech Emotion Recognition

We proposed the industry level deep learning approach for speech emotion...
research
03/10/2022

KSoF: The Kassel State of Fluency Dataset – A Therapy Centered Dataset of Stuttering

Stuttering is a complex speech disorder that negatively affects an indiv...
research
07/02/2022

Computer-assisted Pronunciation Training – Speech synthesis is almost all you need

The research community has long studied computer-assisted pronunciation ...
research
11/12/2019

Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

The use of photoplethysmogram signal (PPG) for heart and sleep monitorin...

Please sign up or login with your details

Forgot password? Click here to reset