Supervised Quantile Normalisation

06/01/2017
by   Marine Le Morvan, et al.
0

Quantile normalisation is a popular normalisation method for data subject to unwanted variations such as images, speech, or genomic data. It applies a monotonic transformation to the feature values of each sample to ensure that after normalisation, they follow the same target distribution for each sample. Choosing a "good" target distribution remains however largely empirical and heuristic, and is usually done independently of the subsequent analysis of normalised data. We propose instead to couple the quantile normalisation step with the subsequent analysis, and to optimise the target distribution jointly with the other parameters in the analysis. We illustrate this principle on the problem of estimating a linear model over normalised data, and show that it leads to a particular low-rank matrix regression problem that can be solved efficiently. We illustrate the potential of our method, which we term SUQUAN, on simulated data, images and genomic data, where it outperforms standard quantile normalisation.

READ FULL TEXT
research
01/11/2020

A likelihood analysis of quantile-matching transformations

Quantile matching is a strictly monotone transformation that sends the o...
research
03/06/2023

Quantile-Quantile Methodology – Detailed Results

The linear quantile-quantile relationship provides an easy-to-implement ...
research
10/26/2022

SPQR: An R Package for Semi-Parametric Density and Quantile Regression

We develop an R package SPQR that implements the semi-parametric quantil...
research
02/08/2020

Supervised Quantile Normalization for Low-rank Matrix Approximation

Low rank matrix factorization is a fundamental building block in machine...
research
12/27/2022

Quantile Risk Control: A Flexible Framework for Bounding the Probability of High-Loss Predictions

Rigorous guarantees about the performance of predictive algorithms are n...
research
04/21/2015

The adaptable buffer algorithm for high quantile estimation in non-stationary data streams

The need to estimate a particular quantile of a distribution is an impor...

Please sign up or login with your details

Forgot password? Click here to reset