The Effect of Heterogeneous Data for Alzheimer's Disease Detection from Speech

by   Aparna Balagopalan, et al.

Speech datasets for identifying Alzheimer's disease (AD) are generally restricted to participants performing a single task, e.g. describing an image shown to them. As a result, models trained on linguistic features derived from such datasets may not be generalizable across tasks. Building on prior work demonstrating that same-task data of healthy participants helps improve AD detection on a single-task dataset of pathological speech, we augment an AD-specific dataset consisting of subjects describing a picture with multi-task healthy data. We demonstrate that normative data from multiple speech-based tasks helps improve AD detection by up to 9 boundaries reveals that models trained on a combination of structured picture descriptions and unstructured conversational speech have the least out-of-task error and show the most potential to generalize to multiple tasks. We analyze the impact of age of the added samples and if they affect fairness in classification. We also provide explanations for a possible inductive bias effect across tasks using model-agnostic feature anchors. This work highlights the need for heterogeneous datasets for encoding changes in multiple facets of cognition and for developing a task-independent AD detection model.


Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples

Objective: this study has a twofold goal. First, it aims to improve the ...

On the importance of normative data in speech-based assessment

Data sets for identifying Alzheimer's disease (AD) are often relatively ...

ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease

Alzheimer's disease (AD) is a neurodegenerative disease that affects nea...

Cross-Lingual Transfer Learning for Alzheimer's Detection From Spontaneous Speech

Alzheimer's disease (AD) is a progressive neurodegenerative disease most...

Evaluating Picture Description Speech for Dementia Detection using Image-text Alignment

Using picture description speech for dementia detection has been studied...

Idea density for predicting Alzheimer's disease from transcribed speech

Idea Density (ID) measures the rate at which ideas or elementary predica...

Please sign up or login with your details

Forgot password? Click here to reset