Understanding High Dimensional Spaces through Visual Means Employing Multidimensional Projections

by   Haseeb Younis, et al.

Data visualisation helps understanding data represented by multiple variables, also called features, stored in a large matrix where individuals are stored in lines and variable values in columns. These data structures are frequently called multidimensional spaces.In this paper, we illustrate ways of employing the visual results of multidimensional projection algorithms to understand and fine-tune the parameters of their mathematical framework. Some of the common mathematical common to these approaches are Laplacian matrices, Euclidian distance, Cosine distance, and statistical methods such as Kullback-Leibler divergence, employed to fit probability distributions and reduce dimensions. Two of the relevant algorithms in the data visualisation field are t-distributed stochastic neighbourhood embedding (t-SNE) and Least-Square Projection (LSP). These algorithms can be used to understand several ranges of mathematical functions including their impact on datasets. In this article, mathematical parameters of underlying techniques such as Principal Component Analysis (PCA) behind t-SNE and mesh reconstruction methods behind LSP are adjusted to reflect the properties afforded by the mathematical formulation. The results, supported by illustrative methods of the processes of LSP and t-SNE, are meant to inspire students in understanding the mathematics behind such methods, in order to apply them in effective data analysis tasks in multiple applications.


A Multidimensional Artistic Approach to Enhance Understanding of Julia Sets through Computer Programming

This article proposes an artistic approach to increase and enrich the un...

Generalized Biplots for Multidimensional Scaled Projections

Dimension reduction and visualization is a staple of data analytics. Met...

Uncertainty-Aware Principal Component Analysis

We present a technique to perform dimensionality reduction on data that ...

Semi-Orthogonal Multilinear PCA with Relaxed Start

Principal component analysis (PCA) is an unsupervised method for learnin...

Manifold valued data analysis of samples of networks, with applications in corpus linguistics

Networks can be used in many applications, such as in the analysis of te...

Unsupervised Doppler Radar-Based Activity Recognition for e-healthcare

Passive radio frequency (RF) sensing and monitoring of human daily activ...

Please sign up or login with your details

Forgot password? Click here to reset