On the Impact of Random Seeds on the Fairness of Clinical Classifiers

04/13/2021
by   Silvio Amir, et al.
0

Recent work has shown that fine-tuning large networks is surprisingly sensitive to changes in random seed(s). We explore the implications of this phenomenon for model fairness across demographic groups in clinical prediction tasks over electronic health records (EHR) in MIMIC-III – the standard dataset in clinical NLP research. Apparent subgroup performance varies substantially for seeds that yield similar overall performance, although there is no evidence of a trade-off between overall and subgroup performance. However, we also find that the small sample sizes inherent to looking at intersections of minority groups and somewhat rare conditions limit our ability to accurately estimate disparities. Further, we find that jointly optimizing for high overall performance and low disparities does not yield statistically significant improvements. Our results suggest that fairness work using MIMIC-III should carefully account for variations in apparent differences that may arise from stochasticity and small sample sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2023

In the Name of Fairness: Assessing the Bias in Clinical Record De-identification

Data sharing is crucial for open science and reproducible research, but ...
research
06/05/2023

Fair Patient Model: Mitigating Bias in the Patient Representation Learned from the Electronic Health Records

Objective: To pre-train fair and unbiased patient representations from E...
research
04/12/2023

Auditing ICU Readmission Rates in an Clinical Database: An Analysis of Risk Factors and Clinical Outcomes

This study presents a machine learning (ML) pipeline for clinical data c...
research
03/11/2020

Hurtful Words: Quantifying Biases in Clinical Contextual Word Embeddings

In this work, we examine the extent to which embeddings may encode margi...
research
04/24/2023

FineEHR: Refine Clinical Note Representations to Improve Mortality Prediction

Monitoring the health status of patients in the ICU is crucial for provi...
research
08/22/2023

Mitigating Health Disparity on Biased Electronic Health Records via Deconfounder

The fairness issue of clinical data modeling, especially on Electronic H...
research
11/13/2018

Embedding Electronic Health Records for Clinical Information Retrieval

Neural network representation learning frameworks have recently shown to...

Please sign up or login with your details

Forgot password? Click here to reset