Feature Selection as a Multiagent Coordination Problem

03/16/2016
by   Kleanthis Malialis, et al.
0

Datasets with hundreds to tens of thousands features is the new norm. Feature selection constitutes a central problem in machine learning, where the aim is to derive a representative set of features from which to construct a classification (or prediction) model for a specific task. Our experimental study involves microarray gene expression datasets, these are high-dimensional and noisy datasets that contain genetic data typically used for distinguishing between benign or malicious tissues or classifying different types of cancer. In this paper, we formulate feature selection as a multiagent coordination problem and propose a novel feature selection method using multiagent reinforcement learning. The central idea of the proposed approach is to "assign" a reinforcement learning agent to each feature where each agent learns to control a single feature, we refer to this approach as MARL. Applying this to microarray datasets creates an enormous multiagent coordination problem between thousands of learning agents. To address the scalability challenge we apply a form of reward shaping called CLEAN rewards. We compare in total nine feature selection methods, including state-of-the-art methods, and show that the proposed method using CLEAN rewards can significantly scale-up, thus outperforming the rest of learning-based methods. We further show that a hybrid variant of MARL achieves the best overall performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2021

A Study of Feature Selection and Extraction Algorithms for Cancer Subtype Prediction

In this work, we study and analyze different feature selection algorithm...
research
04/01/2020

A generalised OMP algorithm for feature selection with application to gene expression data

Feature selection for predictive analytics is the problem of identifying...
research
12/09/2018

Trade Selection with Supervised Learning and OCA

In recent years, state-of-the-art methods for supervised learning have e...
research
12/02/2017

PFAx: Predictable Feature Analysis to Perform Control

Predictable Feature Analysis (PFA) (Richthofer, Wiskott, ICMLA 2015) is ...
research
09/19/2020

Simplifying Reinforced Feature Selection via Restructured Choice Strategy of Single Agent

Feature selection aims to select a subset of features to optimize the pe...
research
07/31/2021

A Hybrid Ensemble Feature Selection Design for Candidate Biomarkers Discovery from Transcriptome Profiles

The discovery of disease biomarkers from gene expression data has been g...

Please sign up or login with your details

Forgot password? Click here to reset