Differential analysis in Transcriptomic: The strength of randomly picking 'reference' genes

03/17/2021
by   Dorota Desaulle, et al.
0

Transcriptomic analysis are characterized by being not directly quantitative and only providing relative measurements of expression levels up to an unknown individual scaling factor. This difficulty is enhanced for differential expression analysis. Several methods have been proposed to circumvent this lack of knowledge by estimating the unknown individual scaling factors however, even the most used one, are suffering from being built on hardly justifiable biological hypotheses or from having weak statistical background. Only two methods withstand this analysis: one based on largest connected graph component hardly usable for large amount of expressions like in NGS, the second based on log-linear fits which unfortunately require a first step which uses one of the methods described before. We introduce a new procedure for differential analysis in the context of transcriptomic data. It is the result of pooling together several differential analyses each based on randomly picked genes used as reference genes. It provides a differential analysis free from the estimation of the individual scaling factors or any other knowledge. Theoretical properties are investigated both in term of FWER and power. Moreover in the context of Poisson or negative binomial modelization of the transcriptomic expressions, we derived a test with non asymptotic control of its bounds. We complete our study by some empirical simulations and apply our procedure to a real data set of hepatic miRNA expressions from a mouse model of non-alcoholic steatohepatitis (NASH), the CDAHFD model. This study on real data provides new hits with good biological explanations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2022

A flexible model for correlated count data, with application to analysis of gene expression differences in multi-condition experiments

Detecting differences in gene expression is an important part of RNA seq...
research
09/02/2017

Adaptive Scaling

Preprocessing data is an important step before any data analysis. In thi...
research
10/06/2019

Scalings for Tokamak Energy Confinement

On the basis of an analysis of the ITER L-mode energy confinement databa...
research
07/28/2017

Review of Machine Learning Algorithms in Differential Expression Analysis

In biological research machine learning algorithms are part of nearly ev...
research
04/29/2018

A Robust Wald-type Test for Testing the Equality of Two Means from Log-Normal Samples

The log-normal distribution is one of the most common distributions used...
research
07/18/2018

Detecting strong signals in gene perturbation experiments: An adaptive approach with power guarantee and FDR control

The perturbation of a transcription factor should affect the expression ...

Please sign up or login with your details

Forgot password? Click here to reset