Making the Most Out of the Limited Context Length: Predictive Power Varies with Clinical Note Type and Note Section

07/13/2023
by   Hongyi Zheng, et al.
0

Recent advances in large language models have led to renewed interest in natural language processing in healthcare using the free text of clinical notes. One distinguishing characteristic of clinical notes is their long time span over multiple long documents. The unique structure of clinical notes creates a new design choice: when the context length for a language model predictor is limited, which part of clinical notes should we choose as the input? Existing studies either choose the inputs with domain knowledge or simply truncate them. We propose a framework to analyze the sections with high predictive power. Using MIMIC-III, we show that: 1) predictive power distribution is different between nursing notes and discharge notes and 2) combining different types of notes could improve performance when the context length is large. Our findings suggest that a carefully selected sampling function could enable more efficient information extraction from clinical notes.

READ FULL TEXT
research
12/27/2019

Clinical XLNet: Modeling Sequential Clinical Notes and Predicting Prolonged Mechanical Ventilation

Clinical notes contain rich data, which is unexploited in predictive mod...
research
05/05/2022

User-Driven Research of Medical Note Generation Software

A growing body of work uses Natural Language Processing (NLP) methods to...
research
08/02/2021

Self-supervised Answer Retrieval on Clinical Notes

Retrieving answer passages from long documents is a complex task requiri...
research
05/01/2023

Learning to Reason and Memorize with Self-Notes

Large language models have been shown to struggle with limited context m...
research
08/19/2022

Graph-Augmented Cyclic Learning Framework for Similarity Estimation of Medical Clinical Notes

Semantic textual similarity (STS) in the clinical domain helps improve d...
research
09/12/2023

Content Reduction, Surprisal and Information Density Estimation for Long Documents

Many computational linguistic methods have been proposed to study the in...

Please sign up or login with your details

Forgot password? Click here to reset