Spectral Simplicial Theory for Feature Selection and Applications to Genomics

11/08/2018
by   Kiya W. Govek, et al.
0

The scale and complexity of modern data sets and the limitations associated with testing large numbers of hypotheses underline the need for feature selection methods. Spectral techniques rank features according to their degree of consistency with an underlying metric structure, but their current graph-based formulation restricts their applicability to point features. We extend spectral methods for feature selection to abstract simplicial complexes and present a general framework which can be applied to 2-point and higher-order features. Combinatorial Laplacian scores take into account the topology spanned by the data and reduce to the ordinary Laplacian score in the case of point features. We demonstrate the utility of spectral simplicial methods for feature selection with several examples of application to the analysis of gene expression and multi-modal genomic data. Our results provide a unifying perspective on topological data analysis and manifold learning approaches.

READ FULL TEXT
research
10/11/2021

Deep Unsupervised Feature Selection by Discarding Nuisance and Correlated Features

Modern datasets often contain large subsets of correlated features and n...
research
07/18/2022

ManiFeSt: Manifold-based Feature Selection for Small Data Sets

In this paper, we present a new method for few-sample supervised feature...
research
03/16/2023

Multi-modal Differentiable Unsupervised Feature Selection

Multi-modal high throughput biological data presents a great scientific ...
research
06/15/2022

Multiscale methods for signal selection in single-cell data

Analysis of single-cell transcriptomics often relies on clustering cells...
research
04/05/2019

A topological data analysis based classification method for multiple measurements

Machine learning models for repeated measurements are limited. Using top...
research
06/01/2017

Statistical Analysis and Parameter Selection for Mapper

In this article, we study the question of the statistical convergence of...
research
07/09/2020

Let the Data Choose its Features: Differentiable Unsupervised Feature Selection

Scientific observations often consist of a large number of variables (fe...

Please sign up or login with your details

Forgot password? Click here to reset