Fair Clustering with Multiple Colors

02/18/2020
by   Matteo Böhm, et al.
0

A fair clustering instance is given a data set A in which every point is assigned some color. Colors correspond to various protected attributes such as sex, ethnicity, or age. A fair clustering is an instance where membership of points in a cluster is uncorrelated with the coloring of the points. Of particular interest is the case where all colors are equally represented. If we have exactly two colors, Chierrichetti, Kumar, Lattanzi and Vassilvitskii (NIPS 2017) showed that various k-clustering objectives admit a constant factor approximation. Since then, a number of follow up work has attempted to extend this result to a multi-color case, though so far, the only known results either result in no-constant factor approximation, apply only to special clustering objectives such as k-center, yield bicrititeria approximations, or require k to be constant. In this paper, we present a simple reduction from unconstrained k-clustering to fair k-clustering for a large range of clustering objectives including k-median, k-means, and k-center. The reduction loses only a constant factor in the approximation guarantee, marking the first true constant factor approximation for many of these problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

On the cost of essentially fair clusterings

Clustering is a fundamental tool in data mining. It partitions points in...
research
07/08/2020

A Technique for Obtaining True Approximations for k-Center with Covering Constraints

There has been a recent surge of interest in incorporating fairness aspe...
research
07/06/2022

Techniques for Generalized Colorful k-Center Problems

Fair clustering enjoyed a surge of interest recently. One appealing way ...
research
06/19/2020

Probabilistic Fair Clustering

In clustering problems, a central decision-maker is given a complete met...
research
02/21/2022

Multilayer Random Sequential Adsorption

In this work, we present a variant of the multilayer random sequential a...
research
11/08/2021

Approximating Fair Clustering with Cascaded Norm Objectives

We introduce the (p,q)-Fair Clustering problem. In this problem, we are ...
research
03/16/2021

On Undecided LP, Clustering and Active Learning

We study colored coverage and clustering problems. Here, we are given a ...

Please sign up or login with your details

Forgot password? Click here to reset