Developing an ICU scoring system with interaction terms using a genetic algorithm

04/22/2016
by   Chee Chun Gan, et al.
0

ICU mortality scoring systems attempt to predict patient mortality using predictive models with various clinical predictors. Examples of such systems are APACHE, SAPS and MPM. However, most such scoring systems do not actively look for and include interaction terms, despite physicians intuitively taking such interactions into account when making a diagnosis. One barrier to including such terms in predictive models is the difficulty of using most variable selection methods in high-dimensional datasets. A genetic algorithm framework for variable selection with logistic regression models is used to search for two-way interaction terms in a clinical dataset of adult ICU patients, with separate models being built for each category of diagnosis upon admittance to the ICU. The models had good discrimination across all categories, with a weighted average AUC of 0.84 (>0.90 for several categories) and the genetic algorithm was able to find several significant interaction terms, which may be able to provide greater insight into mortality prediction for health practitioners. The GA selected models had improved performance against stepwise selection and random forest models, and provides greater flexibility in terms of variable selection by being able to optimize over any modeler-defined model performance metric instead of a specific variable importance metric.

READ FULL TEXT

page 12

page 13

research
04/22/2016

An improved chromosome formulation for genetic algorithms applied to variable selection with the inclusion of interaction terms

Genetic algorithms are a well-known method for tackling the problem of v...
research
01/16/2022

A review and recommendations on variable selection methods in regression models for binary data

The selection of essential variables in logistic regression is vital bec...
research
04/27/2020

Using reference models in variable selection

Variable selection, or more generally, model reduction is an important a...
research
06/06/2019

Enhancing Multi-model Inference with Natural Selection

Multi-model inference covers a wide range of modern statistical applicat...
research
01/22/2018

Predictor Variable Prioritization in Nonlinear Models: A Genetic Association Case Study

The central aim in this paper is to address variable selection questions...
research
11/17/2017

Variable selection with genetic algorithms using repeated cross-validation of PLS regression models as fitness measure

Genetic algorithms are a widely used method in chemometrics for extracti...

Please sign up or login with your details

Forgot password? Click here to reset