Coresets for Clustering with General Assignment Constraints

01/20/2023
by   Lingxiao Huang, et al.
0

Designing small-sized coresets, which approximately preserve the costs of the solutions for large datasets, has been an important research direction for the past decade. We consider coreset construction for a variety of general constrained clustering problems. We introduce a general class of assignment constraints, including capacity constraints on cluster centers, and assignment structure constraints for data points (modeled by a convex body ℬ). We give coresets for constrained clustering problems with such general assignment constraints, significantly generalizing known coreset results for constrained clustering. Notable implications of our general theorem include the first ϵ-coreset for capacitated and fair k-Median with m outliers in Euclidean spaces whose size is Õ(m + k^2 ϵ^-4), generalizing and improving upon the prior bounds in [Braverman et al., FOCS'22; Huang et al., ICLR'23] (for capacitated k-Median, the coreset size bound obtained in [Braverman et al., FOCS'22] is Õ(k^3 ϵ^-6), and for k-Median with m outliers, the coreset size bound obtained in [Huang et al., ICLR'23] is Õ(m + k^3 ϵ^-5)), and the first ϵ-coreset of size poly(k ϵ^-1) for fault-tolerant clustering for metric spaces with bounded covering exponent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2023

FPT Approximations for Capacitated/Fair Clustering with Outliers

Clustering problems such as k-Median, and k-Means, are motivated from ap...
research
07/20/2020

On Coresets for Fair Clustering in Metric and Euclidean Spaces and Their Applications

Fair clustering is a constrained variant of clustering where the goal is...
research
06/20/2019

Coresets for Clustering with Fairness Constraints

In a recent work, Chierichetti et al. studied the following "fair" varia...
research
10/19/2022

Near-optimal Coresets for Robust Clustering

We consider robust clustering problems in ℝ^d, specifically k-clustering...
research
09/05/2022

The Power of Uniform Sampling for Coresets

Motivated by practical generalizations of the classic k-median and k-mea...
research
02/27/2023

On Coresets for Clustering in Small Dimensional Euclidean Spaces

We consider the problem of constructing small coresets for k-Median in E...
research
02/10/2023

Neural Capacitated Clustering

Recent work on deep clustering has found new promising methods also for ...

Please sign up or login with your details

Forgot password? Click here to reset