Feature Selection for Microarray Gene Expression Data using Simulated Annealing guided by the Multivariate Joint Entropy

02/07/2013
by   Fernando González, et al.
0

In this work a new way to calculate the multivariate joint entropy is presented. This measure is the basis for a fast information-theoretic based evaluation of gene relevance in a Microarray Gene Expression data context. Its low complexity is based on the reuse of previous computations to calculate current feature relevance. The mu-TAFS algorithm --named as such to differentiate it from previous TAFS algorithms-- implements a simulated annealing technique specially designed for feature subset selection. The algorithm is applied to the maximization of gene subset relevance in several public-domain microarray data sets. The experimental results show a notoriously high classification performance and low size subsets formed by biologically meaningful genes.

READ FULL TEXT
research
06/06/2013

Verdict Accuracy of Quick Reduct Algorithm using Clustering and Classification Techniques for Gene Expression Data

In most gene expression data, the number of training samples is very sma...
research
11/03/2021

Multivariate feature ranking of gene expression data

Gene expression datasets are usually of high dimensionality and therefor...
research
03/26/2020

A New Gene Selection Algorithm using Fuzzy-Rough Set Theory for Tumor Classification

In statistics and machine learning, feature selection is the process of ...
research
06/05/2015

Gene selection for cancer classification using a hybrid of univariate and multivariate feature selection methods

Various approaches to gene selection for cancer classification based on ...
research
01/12/2011

Review and Evaluation of Feature Selection Algorithms in Synthetic Problems

The main purpose of Feature Subset Selection is to find a reduced subset...
research
03/15/2017

MapReduce Algorithms for Inferring Gene Regulatory Networks from Time-Series Microarray Data Using an Information-Theoretic Approach

Gene regulation is a series of processes that control gene expression an...
research
10/19/2012

A Distance-Based Branch and Bound Feature Selection Algorithm

There is no known efficient method for selecting k Gaussian features fro...

Please sign up or login with your details

Forgot password? Click here to reset