Model updating after interventions paradoxically introduces bias

10/22/2020
by   James Liley, et al.
0

Machine learning is increasingly being used to generate prediction models for use in a number of real-world settings, from credit risk assessment to clinical decision support. Recent discussions have highlighted potential problems in the updating of a predictive score for a binary outcome when an existing predictive score forms part of the standard workflow, driving interventions. In this setting, the existing score induces an additional causative pathway which leads to miscalibration when the original score is replaced. We propose a general causal framework to describe and address this problem, and demonstrate an equivalent formulation as a partially observed Markov decision process. We use this model to demonstrate the impact of such `naive updating' when performed repeatedly. Namely, we show that successive predictive scores may converge to a point where they predict their own effect, or may eventually oscillate between two values, and we argue that neither outcome is desirable. Furthermore, we demonstrate that even if model-fitting procedures improve, actual performance may worsen. We complement these findings with a discussion of several potential routes to overcome these problems.

READ FULL TEXT

page 11

page 14

research
10/08/2021

Stacking interventions for equitable outcomes

Predictive risk scores estimating probabilities for a binary outcome on ...
research
02/13/2022

Optimal sizing of a holdout set for safe predictive model updating

Risk models in medical statistics and healthcare machine learning are in...
research
04/10/2023

Linking a predictive model to causal effect estimation

A predictive model makes outcome predictions based on some given feature...
research
12/19/2022

Counterfactual Risk Assessments under Unmeasured Confounding

Statistical risk assessments inform consequential decisions, such as pre...
research
01/02/2021

Characterizing Fairness Over the Set of Good Models Under Selective Labels

Algorithmic risk assessments are increasingly used to make and inform de...
research
11/30/2020

Predictive case control designs for modification learning

Prediction models for clinical outcomes may be developed using a source ...
research
03/12/2021

Geodesic B-Score for Improved Assessment of Knee Osteoarthritis

Three-dimensional medical imaging enables detailed understanding of oste...

Please sign up or login with your details

Forgot password? Click here to reset