An Automatic SOAP Classification System Using Weakly Supervision And Transfer Learning

11/26/2022
by   Sunjae Kwon, et al.
0

In this paper, we introduce a comprehensive framework for developing a machine learning-based SOAP (Subjective, Objective, Assessment, and Plan) classification system without manually SOAP annotated training data or with less manually SOAP annotated training data. The system is composed of the following two parts: 1) Data construction, 2) A neural network-based SOAP classifier, and 3) Transfer learning framework. In data construction, since a manual construction of a large size training dataset is expensive, we propose a rule-based weak labeling method utilizing the structured information of an EHR note. Then, we present a SOAP classifier composed of a pre-trained language model and bi-directional long-short term memory with conditional random field (Bi-LSTM-CRF). Finally, we propose a transfer learning framework that re-uses the trained parameters of the SOAP classifier trained with the weakly labeled dataset for datasets collected from another hospital. The proposed weakly label-based learning model successfully performed SOAP classification (89.99 F1-score) on the notes collected from the target hospital. Otherwise, in the notes collected from other hospitals and departments, the performance dramatically decreased. Meanwhile, we verified that the transfer learning framework is advantageous for inter-hospital adaptation of the model increasing the models' performance in every cases. In particular, the transfer learning approach was more efficient when the manually annotated data size was smaller. We showed that SOAP classification models trained with our weakly labeling algorithm can perform SOAP classification without manually annotated data on the EHR notes from the same hospital. The transfer learning framework helps SOAP classification model's inter-hospital migration with a minimal size of the manually annotated dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2021

Natural Language Processing Accurately Categorizes Indications, Findings and Pathology Reports from Multicenter Colonoscopy

Colonoscopy is used for colorectal cancer (CRC) screening. Extracting de...
research
05/09/2022

Transfer Learning Based Efficient Traffic Prediction with Limited Training Data

Efficient prediction of internet traffic is an essential part of Self Or...
research
04/08/2023

Interpretable Multi Labeled Bengali Toxic Comments Classification using Deep Learning

This paper presents a deep learning-based pipeline for categorizing Beng...
research
01/18/2021

TLU-Net: A Deep Learning Approach for Automatic Steel Surface Defect Detection

Visual steel surface defect detection is an essential step in steel shee...
research
08/19/2021

How Hateful are Movies? A Study and Prediction on Movie Subtitles

In this research, we investigate techniques to detect hate speech in mov...
research
09/06/2016

Using Natural Language Processing to Screen Patients with Active Heart Failure: An Exploration for Hospital-wide Surveillance

In this paper, we proposed two different approaches, a rule-based approa...
research
08/20/2021

Estimation of Playable Piano Fingering by Pitch-difference Fingering Matching Model

The existing piano fingering labeling statistical models usually conside...

Please sign up or login with your details

Forgot password? Click here to reset