Adapting Fairness Interventions to Missing Values

by   Raymond Feng, et al.
Harvard University

Missing values in real-world data pose a significant and unique challenge to algorithmic fairness. Different demographic groups may be unequally affected by missing data, and the standard procedure for handling missing values where first data is imputed, then the imputed data is used for classification – a procedure referred to as "impute-then-classify" – can exacerbate discrimination. In this paper, we analyze how missing values affect algorithmic fairness. We first prove that training a classifier from imputed data can significantly worsen the achievable values of group fairness and average accuracy. This is because imputing data results in the loss of the missing pattern of the data, which often conveys information about the predictive label. We present scalable and adaptive algorithms for fair classification with missing values. These algorithms can be combined with any preexisting fairness-intervention algorithm to handle all possible missing patterns while preserving information encoded within the missing patterns. Numerical experiments with state-of-the-art fairness interventions demonstrate that our adaptive algorithms consistently achieve higher fairness and accuracy than impute-then-classify across different datasets.


page 1

page 2

page 3

page 4


Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values

We investigate the fairness concerns of training a machine learning mode...

Aleatoric and Epistemic Discrimination in Classification

Machine learning (ML) models can underperform on certain population grou...

Fairness and Missing Values

The causes underlying unfair decision making are complex, being internal...

Minimax rate of consistency for linear models with missing values

Missing values arise in most real-world data sets due to the aggregation...

Arbitrariness Lies Beyond the Fairness-Accuracy Frontier

Machine learning tasks may admit multiple competing models that achieve ...

Fairness-aware Multi-view Clustering

In the era of big data, we are often facing the challenge of data hetero...

A comparative study of fairness-enhancing interventions in machine learning

Computers are increasingly used to make decisions that have significant ...

Please sign up or login with your details

Forgot password? Click here to reset