Locally Private k-Means Clustering with Constant Multiplicative Approximation and Near-Optimal Additive Error

05/31/2021
by   Anamay Chaturvedi, et al.
0

Given a data set of size n in d'-dimensional Euclidean space, the k-means problem asks for a set of k points (called centers) so that the sum of the ℓ_2^2-distances between points of a given data set of size n and the set of k centers is minimized. Recent work on this problem in the locally private setting achieves constant multiplicative approximation with additive error Õ (n^1/2 + a· k ·max{√(d), √(k)}) and proves a lower bound of Ω(√(n)) on the additive error for any solution with a constant number of rounds. In this work we bridge the gap between the exponents of n in the upper and lower bounds on the additive error with two new algorithms. Given any α>0, our first algorithm achieves a multiplicative approximation guarantee which is at most a (1+α) factor greater than that of any non-private k-means clustering algorithm with k^Õ(1/α^2)√(d' n)log n additive error. Given any c>√(2), our second algorithm achieves O(k^1 + Õ(1/(2c^2-1))√(d' n)log n) additive error with constant multiplicative approximation. Both algorithms go beyond the Ω(n^1/2 + a) factor that occurs in the additive error for arbitrarily small parameters a in previous work, and the second algorithm in particular shows for the first time that it is possible to solve the locally private k-means problem in a constant number of rounds with constant factor multiplicative approximation and polynomial dependence on k in the additive error arbitrarily close to linear.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2019

Locally Private k-Means Clustering

We design a new algorithm for the Euclidean k-means problem that operate...
research
09/28/2020

A note on differentially private clustering with large additive error

In this note, we describe a simple approach to obtain a differentially p...
research
03/03/2019

Near-Additive Spanners In Low Polynomial Deterministic CONGEST Time

Given parameters α≥ 1,β≥ 0, a subgraph G'=(V,H) of an n-vertex unweighte...
research
04/20/2021

Locally Private k-Means in One Round

We provide an approximation algorithm for k-means clustering in the one-...
research
10/31/2022

Improved Learning-augmented Algorithms for k-means and k-medians Clustering

We consider the problem of clustering in the learning-augmented setting,...
research
08/31/2020

Active Local Learning

In this work we consider active local learning: given a query point x, a...
research
12/11/2020

Online Coresets for Clustering with Bregman Divergences

We present algorithms that create coresets in an online setting for clus...

Please sign up or login with your details

Forgot password? Click here to reset