A Kernel to Exploit Informative Missingness in Multivariate Time Series from EHRs

02/27/2020
by   Karl Øyvind Mikalsen, et al.
0

A large fraction of the electronic health records (EHRs) consists of clinical measurements collected over time, such as lab tests and vital signs, which provide important information about a patient's health status. These sequences of clinical measurements are naturally represented as time series, characterized by multiple variables and large amounts of missing data, which complicate the analysis. In this work, we propose a novel kernel which is capable of exploiting both the information from the observed values as well the information hidden in the missing patterns in multivariate time series (MTS) originating e.g. from EHRs. The kernel, called TCK_IM, is designed using an ensemble learning strategy in which the base models are novel mixed mode Bayesian mixture models which can effectively exploit informative missingness without having to resort to imputation methods. Moreover, the ensemble approach ensures robustness to hyperparameters and therefore TCK_IM is particularly well suited if there is a lack of labels - a known challenge in medical applications. Experiments on three real-world clinical datasets demonstrate the effectiveness of the proposed kernel.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2019

Time series cluster kernels to exploit informative missingness and incomplete label information

The time series cluster kernel (TCK) provides a powerful tool for analys...
research
03/21/2018

An Unsupervised Multivariate Time Series Kernel Approach for Identifying Patients with Surgical Site Infection from Blood Samples

A large fraction of the electronic health records consists of clinical m...
research
08/12/2019

Mixture-based Multiple Imputation Models for Clinical Data with a Temporal Dimension

The problem of missing values in multivariable time series is a key chal...
research
06/13/2016

Modeling Missing Data in Clinical Time Series with RNNs

We demonstrate a simple strategy to cope with missing data in sequential...
research
04/30/2019

Multi-resolution Networks For Flexible Irregular Time Series Modeling (Multi-FIT)

Missing values, irregularly collected samples, and multi-resolution sign...
research
01/05/2021

Data-Driven Copy-Paste Imputation for Energy Time Series

A cornerstone of the worldwide transition to smart grids are smart meter...

Please sign up or login with your details

Forgot password? Click here to reset