Learning to Cluster via Same-Cluster Queries

08/17/2021
by   Yi Li, et al.
0

We study the problem of learning to cluster data points using an oracle which can answer same-cluster queries. Different from previous approaches, we do not assume that the total number of clusters is known at the beginning and do not require that the true clusters are consistent with a predefined objective function such as the K-means. These relaxations are critical from the practical perspective and, meanwhile, make the problem more challenging. We propose two algorithms with provable theoretical guarantees and verify their effectiveness via an extensive set of experiments on both synthetic and real-world data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2019

Same-Cluster Querying for Overlapping Clusters

Overlapping clusters are common in models of many practical data-segment...
research
06/08/2020

Exact Recovery of Mangled Clusters with Same-Cluster Queries

We study the problem of recovering distorted clusters in the semi-superv...
research
01/31/2021

Exact Recovery of Clusters in Finite Metric Spaces Using Oracle Queries

We investigate the problem of exact cluster recovery using oracle querie...
research
06/15/2018

Query K-means Clustering and the Double Dixie Cup Problem

We consider the problem of approximate K-means clustering with outliers ...
research
05/05/2020

Cluster-based dual evolution for multivariate systems

This paper proposes a cluster-based method to analyse multivariate syste...
research
02/06/2020

Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability

Improving the explainability of the results from machine learning method...
research
04/17/2019

SCE: A manifold regularized set-covering method for data partitioning

Cluster analysis plays a very important role in data analysis. In these ...

Please sign up or login with your details

Forgot password? Click here to reset