Fairness-guided SMT-based Rectification of Decision Trees and Random Forests

11/22/2020
by   Jiang Zhang, et al.
0

Data-driven decision making is gaining prominence with the popularity of various machine learning models. Unfortunately, real-life data used in machine learning training may capture human biases, and as a result the learned models may lead to unfair decision making. In this paper, we provide a solution to this problem for decision trees and random forests. Our approach converts any decision tree or random forest into a fair one with respect to a specific data set, fairness criteria, and sensitive attributes. The FairRepair tool, built based on our approach, is inspired by automated program repair techniques for traditional programs. It uses an SMT solver to decide which paths in the decision tree could have their outcomes flipped to improve the fairness of the model. Our experiments on the well-known adult dataset from UC Irvine demonstrate that FairRepair scales to realistic decision trees and random forests. Furthermore, FairRepair provides formal guarantees about soundness and completeness of finding a repair. Since our fairness-guided repair technique repairs decision trees and random forests obtained from a given (unfair) data-set, it can help to identify and rectify biases in decision-making in an organisation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2021

Fair Training of Decision Tree Classifiers

We study the problem of formally verifying individual fairness of decisi...
research
12/21/2017

Fair Forests: Regularized Tree Induction to Minimize Model Bias

The potential lack of fairness in the outputs of machine learning algori...
research
03/25/2019

Learning Optimal and Fair Decision Trees for Non-Discriminative Decision-Making

In recent years, automated data-driven decision-making systems have enjo...
research
10/10/2018

Equality Constrained Decision Trees: For the Algorithmic Enforcement of Group Fairness

Fairness, through its many forms and definitions, has become an importan...
research
10/16/2021

Streaming Decision Trees and Forests

Machine learning has successfully leveraged modern data and provided com...
research
03/15/2017

Cost-complexity pruning of random forests

Random forests perform bootstrap-aggregation by sampling the training sa...
research
01/10/2022

A Study on Mitigating Hard Boundaries of Decision-Tree-based Uncertainty Estimates for AI Models

Outcomes of data-driven AI models cannot be assumed to be always correct...

Please sign up or login with your details

Forgot password? Click here to reset