Predictor Variable Prioritization in Nonlinear Models: A Genetic Association Case Study

01/22/2018
by   Lorin Crawford, et al.
0

The central aim in this paper is to address variable selection questions in nonlinear and nonparametric regression. Motivated within the context of statistical genetics, where nonlinear interactions are of particular interest, we introduce a novel and interpretable way to summarize the relative importance of predictor variables. Methodologically, we develop the "RelATive cEntrality" (RATE) measure to prioritize candidate predictors that are not just marginally important, but whose associations also stem from significant covarying relationships with other variables in the data. We focus on illustrating RATE through Bayesian Gaussian process regression; although, the methodological innovations apply to other and more general methods. It is known that nonlinear models often exhibit greater predictive accuracy than linear models, particularly for outcomes generated by complex architectures. With detailed simulations and a botanical QTL mapping study, we show that applying RATE enables an explanation for this improved performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2021

Variable Selection Using Bayesian Additive Regression Trees

Variable selection is an important statistical problem. This problem bec...
research
03/13/2018

A machine learning-based approach for estimating and testing associations with multivariate outcomes

We propose a method for summarizing the strength of association between ...
research
09/08/2023

Generalized Variable Selection Algorithms for Gaussian Process Models by LASSO-like Penalty

With the rapid development of modern technology, massive amounts of data...
research
08/05/2015

Bayesian Approximate Kernel Regression with Variable Selection

Nonlinear kernel regression models are often used in statistics and mach...
research
09/15/2021

Bayesian testing of linear versus nonlinear effects using Gaussian process priors

A Bayes factor is proposed for testing whether the effect of a key predi...
research
02/03/2023

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

The ability to interpret machine learning models has become increasingly...
research
04/22/2016

Developing an ICU scoring system with interaction terms using a genetic algorithm

ICU mortality scoring systems attempt to predict patient mortality using...

Please sign up or login with your details

Forgot password? Click here to reset