Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework

12/02/2020
by   Kevin Lybarger, et al.
0

Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven, automatic information extraction models are needed to use this text-encoded information in large-scale studies. This work presents a new clinical corpus, referred to as the COVID-19 Annotated Clinical Text (CACT) Corpus, which comprises 1,472 notes with detailed annotations characterizing COVID-19 diagnoses, testing, and clinical presentation. We introduce a span-based event extraction model that jointly extracts all annotated phenomena, achieving high performance in identifying COVID-19 and symptom events with associated assertion values (0.83-0.97 F1 for events and 0.73-0.79 F1 for assertions). In a secondary use application, we explored the prediction of COVID-19 test results using structured patient data (e.g. vital signs and laboratory results) and automatically extracted symptom information. The automatically extracted symptoms improve prediction performance, beyond structured data alone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2020

Annotating Social Determinants of Health Using Active Learning, and Characterizing Determinants Using Neural Event Extraction

Social determinants of health (SDOH) affect health outcomes, and knowled...
research
06/03/2020

Extracting COVID-19 Events from Twitter

We present a corpus of 7,500 tweets annotated with COVID-19 events, incl...
research
08/17/2022

Extracting Medication Changes in Clinical Narratives using Pre-trained Language Models

An accurate and detailed account of patient medications, including medic...
research
02/02/2020

Assessment of Amazon Comprehend Medical: Medication Information Extraction

In November 27, 2018, Amazon Web Services (AWS) released Amazon Comprehe...
research
05/19/2023

Eye-SpatialNet: Spatial Information Extraction from Ophthalmology Notes

We introduce an annotated corpus of 600 ophthalmology notes labeled with...
research
03/10/2021

Identifying ARDS using the Hierarchical Attention Network with Sentence Objectives Framework

Acute respiratory distress syndrome (ARDS) is a life-threatening conditi...

Please sign up or login with your details

Forgot password? Click here to reset