Massive Data Clustering in Moderate Dimensions from the Dual Spaces of Observation and Attribute Data Clouds

04/06/2017
by   Fionn Murtagh, et al.
0

Cluster analysis of very high dimensional data can benefit from the properties of such high dimensionality. Informally expressed, in this work, our focus is on the analogous situation when the dimensionality is moderate to small, relative to a massively sized set of observations. Mathematically expressed, these are the dual spaces of observations and attributes. The point cloud of observations is in attribute space, and the point cloud of attributes is in observation space. In this paper, we begin by summarizing various perspectives related to methodologies that are used in multivariate analytics. We draw on these to establish an efficient clustering processing pipeline, both partitioning and hierarchical clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2020

Folding-based compression of point cloud attributes

Existing techniques to compress point cloud attributes leverage either g...
research
02/15/2012

The Future of Search and Discovery in Big Data Analytics: Ultrametric Information Spaces

Consider observation data, comprised of n observation vectors with value...
research
03/29/2023

Topological Point Cloud Clustering

We present Topological Point Cloud Clustering (TPCC), a new method to cl...
research
03/17/2022

3DAC: Learning Attribute Compression for Point Clouds

We study the problem of attribute compression for large-scale unstructur...
research
07/27/2023

Clustering based Point Cloud Representation Learning for 3D Analysis

Point cloud analysis (such as 3D segmentation and detection) is a challe...
research
09/21/2022

Surface area and volume of excursion sets observed on point cloud based polytopic tessellations

The excursion set of a C^2 smooth random field carries relevant informat...

Please sign up or login with your details

Forgot password? Click here to reset