Analytical and statistical properties of local depth functions motivated by clustering applications

08/27/2020
by   Giacomo Francisci, et al.
0

Local depth functions (LDFs) are used for describing the local geometric features of multivariate distributions, especially in multimodal models. In this paper, we undertake a rigorous systematic study of the LDFs and use it to develop a theoretically validated algorithm for clustering. For this reason, we establish several analytical and statistical properties of LDFs. First, we show that, when the underlying probability distribution is absolutely continuous, under an appropriate scaling that converge to zero (referred to as extreme localization), LDFs converge uniformly to a power of the density and obtain a related rate of convergence result. Second, we establish that the centered and scaled sample LDFs converge in distribution to a centered Gaussian process, uniformly in the space of bounded functions on R p x [0,infinity], as the sample size diverges to infinity. Third, under an extreme localization that depends on the sample size, we determine the correct centering and scaling for the sample LDFs to possess a limiting normal distribution. Fourth, invoking the above results, we develop a new clustering algorithm that uses the LDFs and their differentiability properties. Fifth, for the last purpose, we establish several results concerning the gradient systems related to LDFs. Finally, we illustrate the finite sample performance of our results using simulations and apply them to two datasets.

READ FULL TEXT
research
07/31/2023

Guaranteed Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution

Generative modeling is a widely-used machine learning method with variou...
research
07/24/2020

Optimal sample size for the Birnbaum-Saunders distribution under a decision-theoretic approach

The Birnbaum-Saunders distribution has been widely applied in several ar...
research
10/16/2018

Finite-sample Analysis of M-estimators using Self-concordance

We demonstrate how self-concordance of the loss can be exploited to obta...
research
01/08/2022

Optimal 1-Wasserstein Distance for WGANs

The mathematical forces at work behind Generative Adversarial Networks r...

Please sign up or login with your details

Forgot password? Click here to reset