The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

12/21/2020
by   Naman Goel, et al.
0

Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we characterize the missingness mechanisms in different real-world scenarios. We show conditions under which various distributions, used in popular fairness algorithms, can or can not be recovered from the training data. Our theoretical results imply that many of these algorithms can not guarantee fairness in practice. Modeling missingness also helps to identify correct design principles for fair algorithms. For example, in multi-stage settings where decisions are made in multiple screening rounds, we use our framework to derive the minimal distributions required to design a fair algorithm. Our proposed algorithm decentralizes the decision-making process and still achieves similar performance to the optimal algorithm that requires centralization and non-recoverable distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2017

Fair Pipelines

This work facilitates ensuring fairness of machine learning in the real ...
research
02/25/2022

On Learning and Testing of Counterfactual Fairness through Data Preprocessing

Machine learning has become more important in real-life decision-making ...
research
09/22/2022

SCALES: From Fairness Principles to Constrained Decision-Making

This paper proposes SCALES, a general framework that translates well-est...
research
06/10/2020

Fair Data Integration

The use of machine learning (ML) in high-stakes societal decisions has e...
research
07/27/2023

Fair Machine Unlearning: Data Removal while Mitigating Disparities

As public consciousness regarding the collection and use of personal inf...
research
01/29/2021

Beyond traditional assumptions in fair machine learning

This thesis scrutinizes common assumptions underlying traditional machin...
research
02/08/2023

Fairness in Matching under Uncertainty

The prevalence and importance of algorithmic two-sided marketplaces has ...

Please sign up or login with your details

Forgot password? Click here to reset