Parameterized Approximation for Robust Clustering in Discrete Geometric Spaces

05/12/2023
by   Fateme Abbasi, et al.
0

We consider the well-studied Robust (k, z)-Clustering problem, which generalizes the classic k-Median, k-Means, and k-Center problems. Given a constant z≥ 1, the input to Robust (k, z)-Clustering is a set P of n weighted points in a metric space (M,δ) and a positive integer k. Further, each point belongs to one (or more) of the m many different groups S_1,S_2,…,S_m. Our goal is to find a set X of k centers such that max_i ∈ [m]∑_p ∈ S_i w(p) δ(p,X)^z is minimized. This problem arises in the domains of robust optimization [Anthony, Goyal, Gupta, Nagarajan, Math. Oper. Res. 2010] and in algorithmic fairness. For polynomial time computation, an approximation factor of O(log m/loglog m) is known [Makarychev, Vakilian, COLT 2021], which is tight under a plausible complexity assumption even in the line metrics. For FPT time, there is a (3^z+ϵ)-approximation algorithm, which is tight under GAP-ETH [Goyal, Jaiswal, Inf. Proc. Letters, 2023]. Motivated by the tight lower bounds for general discrete metrics, we focus on geometric spaces such as the (discrete) high-dimensional Euclidean setting and metrics of low doubling dimension, which play an important role in data analysis applications. First, for a universal constant η_0 >0.0006, we devise a 3^z(1-η_0)-factor FPT approximation algorithm for discrete high-dimensional Euclidean spaces thereby bypassing the lower bound for general metrics. We complement this result by showing that even the special case of k-Center in dimension Θ(log n) is (√(3/2)- o(1))-hard to approximate for FPT algorithms. Finally, we complete the FPT approximation landscape by designing an FPT (1+ϵ)-approximation scheme (EPAS) for the metric of sub-logarithmic doubling dimension.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2022

On the Fixed-Parameter Tractability of Capacitated Clustering

We study the complexity of the classic capacitated k-median and k-means ...
research
04/06/2023

Parameterized Approximation Schemes for Clustering with General Norm Objectives

This paper considers the well-studied algorithmic regime of designing a ...
research
11/03/2022

Connected k-Center and k-Diameter Clustering

Motivated by an application from geodesy, we introduce a novel clusterin...
research
03/16/2022

Tight Lower Bounds for Approximate Exact k-Center in ℝ^d

In the discrete k-center problem, we are given a metric space (P,) where...
research
03/02/2023

Coresets for Clustering in Geometric Intersection Graphs

Designing coresets–small-space sketches of the data preserving cost of t...
research
09/04/2018

Faster Balanced Clusterings in High Dimension

The problem of constrained clustering has attracted significant attentio...
research
02/27/2023

On Coresets for Clustering in Small Dimensional Euclidean Spaces

We consider the problem of constructing small coresets for k-Median in E...

Please sign up or login with your details

Forgot password? Click here to reset