Supporting Analysis of Dimensionality Reduction Results with Contrastive Learning

05/10/2019
by   Takanori Fujiwara, et al.
0

Dimensionality reduction (DR) is frequently used for analyzing and visualizing high-dimensional data as it provides a good first glance of the data. However, to interpret the DR result for gaining useful insights from the data, it would take additional analysis effort such as identifying clusters and understanding their characteristics. While there are many automatic methods (e.g., density-based clustering methods) to identify clusters, effective methods for understanding a cluster's characteristics are still lacking. A cluster can be mostly characterized by its distribution of feature values. Reviewing the original feature values is not a straightforward task when the number of features is large. To address this challenge, we present a visual analytics method that effectively highlights the essential features of a cluster in a DR result. To extract the essential features, we introduce an enhanced usage of contrastive principal component analysis (cPCA). Our method can calculate each feature's relative contribution to the contrast between one cluster and other clusters. With our cPCA-based method, we have created an interactive system including a scalable visualization of clusters' feature contributions. We demonstrate the effectiveness of our method and system with case studies using several publicly available datasets.

READ FULL TEXT

page 3

page 6

page 7

page 8

page 9

research
03/09/2021

Explaining dimensionality reduction results using Shapley values

Dimensionality reduction (DR) techniques have been consistently supporti...
research
06/29/2021

Interactive Dimensionality Reduction for Comparative Analysis

Finding the similarities and differences between groups of datasets is a...
research
01/26/2021

Contrastive analysis for scatter plot-based representations of dimensionality reduction

Exploring multidimensional datasets is a ubiquitous part of the ones wor...
research
07/09/2020

Contrastive Multiple Correspondence Analysis (cMCA): Applying the Contrastive Learning Method to Identify Political Subgroups

Ideal point estimation and dimensionality reduction have long been utili...
research
08/01/2023

Classes are not Clusters: Improving Label-based Evaluation of Dimensionality Reduction

A common way to evaluate the reliability of dimensionality reduction (DR...
research
03/09/2023

Entropic Wasserstein Component Analysis

Dimension reduction (DR) methods provide systematic approaches for analy...
research
01/30/2020

NCVis: Noise Contrastive Approach for Scalable Visualization

Modern methods for data visualization via dimensionality reduction, such...

Please sign up or login with your details

Forgot password? Click here to reset