Depth Normalization of Small RNA Sequencing: Using Data and Biology to Select a Suitable Method

01/13/2022
by   Yannick Düren, et al.
0

Deep sequencing has become one of the most popular tools for transcriptome profiling in biomedical studies. While an abundance of computational methods exists for "normalizing" sequencing data to remove unwanted between-sample variations due to experimental handling, there is no consensus on which normalization is the most suitable for a given data set. To address this problem, we developed "DANA" - an approach for assessing the performance of normalization methods for microRNA sequencing data based on biology-motivated and data-driven metrics. Our approach takes advantage of well-known biological features of microRNAs for their expression pattern and chromosomal clustering to simultaneously assess (1) how effectively normalization removes handling artifacts, and (2) how aptly normalization preserves biological signals. With DANA, we confirm that the performance of eight commonly used normalization methods vary widely across different data sets and provide guidance for selecting a suitable method for the data at hand. Hence, it should be adopted as a routine preprocessing step (preceding normalization) for microRNA sequencing data analysis. DANA is implemented in R and publicly available at https://github.com/LXQin/DANA.

READ FULL TEXT

page 20

page 22

research
03/29/2020

The covariance shift (C-SHIFT) algorithm for normalizing biological data

Omics technologies are powerful tools for analyzing patterns in gene exp...
research
06/07/2023

Normalization Layers Are All That Sharpness-Aware Minimization Needs

Sharpness-aware minimization (SAM) was proposed to reduce sharpness of m...
research
10/26/2021

Revisiting Batch Normalization

Batch normalization (BN) is comprised of a normalization component follo...
research
08/01/2022

Weighted Scaling Approach for Metabolomics Data Analysis

Systematic variation is a common issue in metabolomics data analysis. Th...
research
09/08/2022

BatMan: Mitigating Batch Effects via Stratification for Survival Outcome Prediction

Reproducible translation of transcriptomics data has been hampered by th...
research
06/19/2023

Human Limits in Machine Learning: Prediction of Plant Phenotypes Using Soil Microbiome Data

The preservation of soil health has been identified as one of the main c...
research
01/18/2021

Separating Controversy from Noise: Comparison and Normalization of Structural Polarization Measures

Quantifying the amount of polarization is crucial for understanding and ...

Please sign up or login with your details

Forgot password? Click here to reset