Sketch-and-solve approaches to k-means clustering by semidefinite programming

11/28/2022
by   Charles Clum, et al.
0

We introduce a sketch-and-solve approach to speed up the Peng-Wei semidefinite relaxation of k-means clustering. When the data is appropriately separated we identify the k-means optimal clustering. Otherwise, our approach provides a high-confidence lower bound on the optimal k-means value. This lower bound is data-driven; it does not make any assumption on the data nor how it is generated. We provide code and an extensive set of numerical experiments where we use this approach to certify approximate optimality of clustering solutions obtained by k-means++.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2017

Monte Carlo approximation certificates for k-means clustering

Efficient algorithms for k-means clustering frequently converge to subop...
research
02/22/2016

Clustering subgaussian mixtures by semidefinite programming

We introduce a model-free relax-and-round algorithm for k-means clusteri...
research
08/10/2020

Sketching semidefinite programs for faster clustering

Many clustering problems enjoy solutions by semidefinite programming. Th...
research
01/20/2022

Sketch-and-Lift: Scalable Subsampled Semidefinite Program for K-means Clustering

Semidefinite programming (SDP) is a powerful tool for tackling a wide ra...
research
09/19/2022

Global Optimization for Cardinality-constrained Minimum Sum-of-Squares Clustering via Semidefinite Programming

The minimum sum-of-squares clustering (MSSC), or k-means type clustering...
research
06/01/2018

Efficient, Certifiably Optimal High-Dimensional Clustering

We consider SDP relaxation methods for data and variable clustering prob...
research
03/25/2021

A Semidefinite Optimization-based Branch-and-Bound Algorithm for Several Reactive Optimal Power Flow Problems

The Reactive Optimal Power Flow (ROPF) problem consists in computing an ...

Please sign up or login with your details

Forgot password? Click here to reset