Fairness Constraints in Semi-supervised Learning

by   Tao Zhang, et al.

Fairness in machine learning has received considerable attention. However, most studies on fair learning focus on either supervised learning or unsupervised learning. Very few consider semi-supervised settings. Yet, in reality, most machine learning tasks rely on large datasets that contain both labeled and unlabeled data. One of key issues with fair learning is the balance between fairness and accuracy. Previous studies arguing that increasing the size of the training set can have a better trade-off. We believe that increasing the training set with unlabeled data may achieve the similar result. Hence, we develop a framework for fair semi-supervised learning, which is formulated as an optimization problem. This includes classifier loss to optimize accuracy, label propagation loss to optimize unlabled data prediction, and fairness constraints over labeled and unlabeled data to optimize the fairness level. The framework is conducted in logistic regression and support vector machines under the fairness metrics of disparate impact and disparate mistreatment. We theoretically analyze the source of discrimination in semi-supervised learning via bias, variance and noise decomposition. Extensive experiments show that our method is able to achieve fair semi-supervised learning, and reach a better trade-off between accuracy and fairness than fair supervised learning.


page 1

page 12


Fairness in Semi-supervised Learning: Unlabeled Data Help to Reduce Discrimination

A growing specter in the rise of machine learning is whether the decisio...

Leveraging Semi-Supervised Learning for Fairness using Neural Networks

There has been a growing concern about the fairness of decision-making s...

The Peaking Phenomenon in Semi-supervised Learning

For the supervised least squares classifier, when the number of training...

Laplacian Support Vector Machines Trained in the Primal

In the last few years, due to the growing ubiquity of unlabeled data, mu...

Alternating Projections for Learning with Expectation Constraints

We present an objective function for learning with unlabeled data that u...

Teaching the Old Dog New Tricks: Supervised Learning with Constraints

Methods for taking into account external knowledge in Machine Learning m...

Optimally Combining Classifiers for Semi-Supervised Learning

This paper considers semi-supervised learning for tabular data. It is wi...

Please sign up or login with your details

Forgot password? Click here to reset