Multidimensional Scaling for Gene Sequence Data with Autoencoders

04/19/2021
by   Pulasthi Wickramasinghe, et al.
21

Multidimensional scaling of gene sequence data has long played a vital role in analysing gene sequence data to identify clusters and patterns. However the computation complexities and memory requirements of state-of-the-art dimensional scaling algorithms make it infeasible to scale to large datasets. In this paper we present an autoencoder-based dimensional reduction model which can easily scale to datasets containing millions of gene sequences, while attaining results comparable to state-of-the-art MDS algorithms with minimal resource requirements. The model also supports out-of-sample data points with a 99.5 against DAMDS with a real world fungi gene sequence dataset. The presented results showcase the effectiveness of the autoencoder-based dimension reduction model and its advantages.

READ FULL TEXT

page 4

page 5

research
11/07/2021

High Performance Out-of-sample Embedding Techniques for Multidimensional Scaling

The recent rapid growth of the dimension of many datasets means that man...
research
07/23/2020

Multidimensional Scaling for Big Data

We present a set of algorithms for Multidimensional Scaling (MDS) to be ...
research
02/24/2022

SQuadMDS: a lean Stochastic Quartet MDS improving global structure preservation in neighbor embedding like t-SNE and UMAP

Multidimensional scaling is a statistical process that aims to embed hig...
research
10/26/2022

Bayesian Hyperbolic Multidimensional Scaling

Multidimensional scaling (MDS) is a widely used approach to representing...
research
06/28/2022

Statistical Depth based Normalization and Outlier Detection of Gene Expression Data

Normalization and outlier detection belong to the preprocessing of gene ...
research
01/09/2008

Toward the Graphics Turing Scale on a Blue Gene Supercomputer

We investigate raytracing performance that can be achieved on a class of...
research
10/24/2018

Modified Multidimensional Scaling and High Dimensional Clustering

Multidimensional scaling is an important dimension reduction tool in sta...

Please sign up or login with your details

Forgot password? Click here to reset