sEHR-CE: Language modelling of structured EHR data for efficient and generalizable patient cohort expansion

11/30/2022
by   Anna Munoz-Farre, et al.
0

Electronic health records (EHR) offer unprecedented opportunities for in-depth clinical phenotyping and prediction of clinical outcomes. Combining multiple data sources is crucial to generate a complete picture of disease prevalence, incidence and trajectories. The standard approach to combining clinical data involves collating clinical terms across different terminology systems using curated maps, which are often inaccurate and/or incomplete. Here, we propose sEHR-CE, a novel framework based on transformers to enable integrated phenotyping and analyses of heterogeneous clinical datasets without relying on these mappings. We unify clinical terminologies using textual descriptors of concepts, and represent individuals' EHR as sections of text. We then fine-tune pre-trained language models to predict disease phenotypes more accurately than non-text and single terminology approaches. We validate our approach using primary and secondary care data from the UK Biobank, a large-scale research study. Finally, we illustrate in a type 2 diabetes use case how sEHR-CE identifies individuals without diagnosis that share clinical characteristics with patients.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2023

CPLLM: Clinical Prediction with Large Language Models

We present Clinical Prediction with Large Language Models (CPLLM), a met...
research
11/09/2021

Multi-Task Prediction of Clinical Outcomes in the Intensive Care Unit using Flexible Multimodal Transformers

Recent deep learning research based on Transformer model architectures h...
research
02/28/2022

VaultDB: A Real-World Pilot of Secure Multi-Party Computation within a Clinical Research Network

Electronic health records represent a rich and growing source of clinica...
research
09/14/2022

PainPoints: A Framework for Language-based Detection of Chronic Pain and Expert-Collaborative Text-Summarization

Chronic pain is a pervasive disorder which is often very disabling and i...
research
09/09/2022

Modelling Patient Trajectories Using Multimodal Information

Electronic Health Records (EHRs) aggregate diverse information at the pa...
research
07/15/2020

Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-based Neural Networks

In this paper we study the problem of predicting clinical diagnoses from...
research
01/26/2020

Secondary Use of Electronic Health Record: Opportunities and Challenges

In present technological era, healthcare providers generate huge amount ...

Please sign up or login with your details

Forgot password? Click here to reset