Self-Organizing Maps for Exploration of Partially Observed Data and Imputation of Missing Values

02/16/2022
by   Sara Rejeb, et al.
0

The self-organizing map is an unsupervised neural network which is widely used for data visualisation and clustering in the field of chemometrics. The classical Kohonen algorithm that computes self-organizing maps is suitable only for complete data without any missing values. However, in many applications, partially observed data are the norm. In this paper, we propose an extension of self-organizing maps to incomplete data via a new criterion that also defines estimators of the missing values. In addition, an adaptation of the Kohonen algorithm, named missSOM, is provided to compute these self-organizing maps and impute missing values. An efficient implementation is provided. Numerical experiments on simulated data and a chemical dataset illustrate the short computing time of missSOM and assess its performance regarding various criteria and in comparison to the state of the art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

RDIS: Random Drop Imputation with Self-Training for Incomplete Time Series Data

It is common that time-series data with missing values are encountered i...
research
02/20/2023

Transformed Distribution Matching for Missing Value Imputation

We study the problem of imputing missing values in a dataset, which has ...
research
06/06/2019

Estimation and imputation in Probabilistic Principal Component Analysis with Missing Not At Random data

Missing Not At Random values are considered to be non-ignorable and requ...
research
02/25/2023

Data Imputation for Sparse Radio Maps in Indoor Positioning (Extended Version)

Indoor location-based services rely on the availability of sufficiently ...
research
02/03/2020

Linear predictor on linearly-generated data with missing values: non consistency and solutions

We consider building predictors when the data have missing values. We st...
research
06/28/2019

missSBM: An R Package for Handling Missing Values in the Stochastic Block Model

The Stochastic Block Model (SBM) is a popular probabilistic model for ra...
research
02/25/2021

Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data

Learning from heterogeneous data poses challenges such as combining data...

Please sign up or login with your details

Forgot password? Click here to reset