Almost 3-Approximate Correlation Clustering in Constant Rounds

05/07/2022
by   Soheil Behnezhad, et al.
0

We study parallel algorithms for correlation clustering. Each pair among n objects is labeled as either "similar" or "dissimilar". The goal is to partition the objects into arbitrarily many clusters while minimizing the number of disagreements with the labels. Our main result is an algorithm that for any ϵ > 0 obtains a (3+ϵ)-approximation in O(1/ϵ) rounds (of models such as massively parallel computation, local, and semi-streaming). This is a culminating point for the rich literature on parallel correlation clustering. On the one hand, the approximation (almost) matches a natural barrier of 3 for combinatorial algorithms. On the other hand, the algorithm's round-complexity is essentially constant. To achieve this result, we introduce a simple O(1/ϵ)-round parallel algorithm. Our main result is to provide an analysis of this algorithm, showing that it achieves a (3+ϵ)-approximation. Our analysis draws on new connections to sublinear-time algorithms. Specifically, it builds on the work of Yoshida, Yamamoto, and Ito [STOC'09] on bounding the "query complexity" of greedy maximal independent set. To our knowledge, this is the first application of this method in analyzing the approximation ratio of any algorithm.

READ FULL TEXT

page 2

page 6

research
07/13/2023

Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

In this paper, we study parallel algorithms for the correlation clusteri...
research
06/15/2021

Correlation Clustering in Constant Many Parallel Rounds

Correlation clustering is a central topic in unsupervised learning, with...
research
05/16/2022

A Parallel Algorithm for (3 + ε)-Approximate Correlation Clustering

Grouping together similar elements in datasets is a common task in data ...
research
02/19/2020

Parallel Algorithms for Small Subgraph Counting

Subgraph counting is a fundamental problem in analyzing massive graphs, ...
research
05/23/2023

Single-Pass Pivot Algorithm for Correlation Clustering. Keep it simple!

We show that a simple single-pass semi-streaming variant of the Pivot al...
research
04/16/2019

Parallel Balanced Allocations: The Heavily Loaded Case

We study parallel algorithms for the classical balls-into-bins problem, ...
research
02/26/2020

Query-Efficient Correlation Clustering

Correlation clustering is arguably the most natural formulation of clust...

Please sign up or login with your details

Forgot password? Click here to reset