GeoECG: Data Augmentation via Wasserstein Geodesic Perturbation for Robust Electrocardiogram Prediction

by   Jiacheng Zhu, et al.
University of Illinois at Urbana-Champaign
Carnegie Mellon University

There has been an increased interest in applying deep neural networks to automatically interpret and analyze the 12-lead electrocardiogram (ECG). The current paradigms with machine learning methods are often limited by the amount of labeled data. This phenomenon is particularly problematic for clinically-relevant data, where labeling at scale can be time-consuming and costly in terms of the specialized expertise and human effort required. Moreover, deep learning classifiers may be vulnerable to adversarial examples and perturbations, which could have catastrophic consequences, for example, when applied in the context of medical treatment, clinical trials, or insurance claims. In this paper, we propose a physiologically-inspired data augmentation method to improve performance and increase the robustness of heart disease detection based on ECG signals. We obtain augmented samples by perturbing the data distribution towards other classes along the geodesic in Wasserstein space. To better utilize domain-specific knowledge, we design a ground metric that recognizes the difference between ECG signals based on physiologically determined features. Learning from 12-lead ECG signals, our model is able to distinguish five categories of cardiac conditions. Our results demonstrate improvements in accuracy and robustness, reflecting the effectiveness of our data augmentation method.


page 13

page 14


Optimal Transport based Data Augmentation for Heart Disease Diagnosis and Prediction

In this paper, we focus on a new method of data augmentation to solve th...

Data Augmentation with GAN increases the Performance of Arrhythmia Classification for an Unbalanced Dataset

Due to the data shortage problem, which is one of the major problems in ...

Data Augmentation for Electrocardiograms

Neural network models have demonstrated impressive performance in predic...

Graph Structure Based Data Augmentation Method

In this paper, we propose a novel graph-based data augmentation method t...

Adversarial Examples for Electrocardiograms

Among all physiological signals, electrocardiogram (ECG) has seen some o...

MedalCare-XL: 16,900 healthy and pathological 12 lead ECGs obtained through electrophysiological simulations

Mechanistic cardiac electrophysiology models allow for personalized simu...

Pretraining ECG Data with Adversarial Masking Improves Model Generalizability for Data-Scarce Tasks

Medical datasets often face the problem of data scarcity, as ground trut...

Please sign up or login with your details

Forgot password? Click here to reset