Generating Object Cluster Hierarchies for Benchmarking

06/17/2016
by   Michał Spytkowski, et al.
0

The field of Machine Learning and the topic of clustering within it is still widely researched. Recently, researchers became interested in a new variant of hierarchical clustering, where hierarchical (partial order) relationships exist not only between clusters but also objects. In this variant of clustering, objects can be assigned not only to leave, but other properties are also defined. Although examples of this approach already exist in literature, the authors have encountered a problem with the analysis and comparison of obtained results. The problem is twofold. Firstly, there is a lack of evaluation methods. Secondly, there is a lack of available benchmark data, at least the authors failed to find them. The aim of this work is to fill the second gap. The main contribution of this paper is a new method of generating hierarchical structures of data. Additionally, the paper includes a theoretical analysis of the generation parameters and their influence on the results. Comprehensive experiments are presented and discussed. The dataset generator and visualiser tools developed are publicly available for use (http://kio.pwr.edu.pl/?page_id=396).

READ FULL TEXT

page 18

page 19

page 21

research
03/28/2016

Hierarchy of Groups Evaluation Using Different F-score Variants

The paper presents a cursory examination of clustering, focusing on a ra...
research
11/02/2018

Foundations of Comparison-Based Hierarchical Clustering

We address the classical problem of hierarchical clustering, but in a fr...
research
06/07/2019

Benchmarking Minimax Linkage

Minimax linkage was first introduced by Ao et al. [3] in 2004, as an alt...
research
09/07/2023

Medoid Silhouette clustering with automatic cluster number selection

The evaluation of clustering results is difficult, highly dependent on t...
research
09/20/2019

Online Hierarchical Clustering Approximations

Hierarchical clustering is a widely used approach for clustering dataset...
research
02/01/2019

Clubmark: a Parallel Isolation Framework for Benchmarking and Profiling Clustering Algorithms on NUMA Architectures

There is a great diversity of clustering and community detection algorit...
research
12/06/2012

Clusters and water flows: a novel approach to modal clustering through Morse theory

The problem of finding groups in data (cluster analysis) has been extens...

Please sign up or login with your details

Forgot password? Click here to reset