Nonparametric Hierarchical Clustering of Functional Data

07/02/2014
by   Marc Boullé, et al.
0

In this paper, we deal with the problem of curves clustering. We propose a nonparametric method which partitions the curves into clusters and discretizes the dimensions of the curve points into intervals. The cross-product of these partitions forms a data-grid which is obtained using a Bayesian model selection approach while making no assumptions regarding the curves. Finally, a post-processing technique, aiming at reducing the number of clusters in order to improve the interpretability of the clustering, is proposed. It consists in optimally merging the clusters step by step, which corresponds to an agglomerative hierarchical classification whose dissimilarity measure is the variation of the criterion. Interestingly this measure is none other than the sum of the Kullback-Leibler divergences between clusters distributions before and after the merges. The practical interest of the approach for functional data exploratory analysis is presented and compared with an alternative approach on an artificial and a real world data set.

READ FULL TEXT
research
05/06/2015

Cats & Co: Categorical Time Series Coclustering

We suggest a novel method of clustering and exploratory analysis of temp...
research
05/22/2023

funLOCI: a local clustering algorithm for functional data

Nowadays, more and more problems are dealing with data with one infinite...
research
05/02/2019

Selection of the Number of Clusters in Functional Data Analysis

Identifying the number K of clusters in a dataset is one of the most dif...
research
09/15/2023

Choice of trimming proportion and number of clusters in robust clustering based on trimming

So-called "classification trimmed likelihood curves" have been proposed ...
research
02/15/2023

Mimetic Muscle Rehabilitation Analysis Using Clustering of Low Dimensional 3D Kinect Data

Facial nerve paresis is a severe complication that arises post-head and ...
research
01/01/2018

A clustering method for misaligned curves

We consider the problem of clustering misaligned curves. According to ou...
research
01/04/2010

Inference of global clusters from locally distributed data

We consider the problem of analyzing the heterogeneity of clustering dis...

Please sign up or login with your details

Forgot password? Click here to reset