An Empirical Bayes Regression for Multi-tissue eQTL Data Analysis

11/25/2022
by   Fei Xue, et al.
0

The Genotype-Tissue Expression (GTEx) project collects samples from multiple human tissues to study the relationship between genetic variation or single nucleotide polymorphisms (SNPs) and gene expression in each tissue. However, most existing eQTL analyses only focus on single tissue information. In this paper, we develop a multi-tissue eQTL analysis that improves the single tissue cis-SNP gene expression association analysis by borrowing information across tissues. Specifically, we propose an empirical Bayes regression model for SNP-expression association analysis using data across multiple tissues. To allow the effects of SNPs to vary greatly among tissues, we use a mixture distribution as the prior, which is a mixture of a multivariate Gaussian distribution and a Dirac mass at zero. The model allows us to assess the cis-SNP gene expression association in each tissue by calculating the Bayes factors. We show that the proposed estimator of the cis-SNP effects on gene expression achieves the minimum Bayes risk among all estimators. Analyses of the GTEx data show that our proposed method is superior to traditional simple regression methods in terms of predicting accuracy for gene expression levels using cis-SNPs in testing data sets. Moreover, we find that although genetic effects on expression are extensively shared among tissues, effect sizes still vary greatly across tissues.

READ FULL TEXT

page 12

page 13

page 14

page 16

research
01/23/2020

A covariance-enhanced approach to multi-tissue joint eQTL mapping with application to transcriptome-wide association studies

Transcriptome-wide association studies based on genetically predicted ge...
research
03/04/2015

Sparse multi-view matrix factorisation: a multivariate approach to multiple tissue comparisons

Gene expression levels in a population vary extensively across tissues. ...
research
12/19/2018

Covariance-based sample selection for heterogenous data: Applications to gene expression and autism risk gene detection

Risk for autism can be influenced by genetic mutations in hundreds of ge...
research
07/18/2018

Detecting strong signals in gene perturbation experiments: An adaptive approach with power guarantee and FDR control

The perturbation of a transcription factor should affect the expression ...
research
11/21/2008

Entropy inference and the James-Stein estimator, with application to nonlinear gene association networks

We present a procedure for effective estimation of entropy and mutual in...
research
05/06/2019

Machine Learning to Predict Developmental Neurotoxicity with High-throughput Data from 2D Bio-engineered Tissues

There is a growing need for fast and accurate methods for testing develo...
research
10/11/2017

An Empirical Bayes Approach to Regularization Using Previously Published Models

This manuscript proposes a novel empirical Bayes technique for regulariz...

Please sign up or login with your details

Forgot password? Click here to reset