Fairness Testing of Deep Image Classification with Adequacy Metrics

11/17/2021
by   Peixin Zhang, et al.
0

As deep image classification applications, e.g., face recognition, become increasingly prevalent in our daily lives, their fairness issues raise more and more concern. It is thus crucial to comprehensively test the fairness of these applications before deployment. Existing fairness testing methods suffer from the following limitations: 1) applicability, i.e., they are only applicable for structured data or text without handling the high-dimensional and abstract domain sampling in the semantic level for image classification applications; 2) functionality, i.e., they generate unfair samples without providing testing criterion to characterize the model's fairness adequacy. To fill the gap, we propose DeepFAIT, a systematic fairness testing framework specifically designed for deep image classification applications. DeepFAIT consists of several important components enabling effective fairness testing of deep image classification applications: 1) a neuron selection strategy to identify the fairness-related neurons; 2) a set of multi-granularity adequacy metrics to evaluate the model's fairness; 3) a test selection algorithm for fixing the fairness issues efficiently. We have conducted experiments on widely adopted large-scale face recognition applications, i.e., VGGFace and FairFace. The experimental results confirm that our approach can effectively identify the fairness-related neurons, characterize the model's fairness, and select the most valuable test cases to mitigate the model's fairness issues.

READ FULL TEXT
research
04/14/2023

FairRec: Fairness Testing for Deep Recommender Systems

Deep learning-based recommender systems (DRSs) are increasingly and wide...
research
07/17/2021

Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling

Although deep learning has demonstrated astonishing performance in many ...
research
07/20/2022

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Software systems are vulnerable to fairness bugs and frequently exhibit ...
research
02/08/2022

Fair SA: Sensitivity Analysis for Fairness in Face Recognition

As the use of deep learning in high impact domains becomes ubiquitous, i...
research
11/14/2022

Assessing Performance and Fairness Metrics in Face Recognition - Bootstrap Methods

The ROC curve is the major tool for assessing not only the performance b...
research
05/08/2023

Distribution-aware Fairness Test Generation

This work addresses how to validate group fairness in image recognition ...
research
10/10/2022

fAux: Testing Individual Fairness via Gradient Alignment

Machine learning models are vulnerable to biases that result in unfair t...

Please sign up or login with your details

Forgot password? Click here to reset