Accurate p-Value Calculation for Generalized Fisher's Combination Tests Under Dependence

by   Hong Zhang, et al.

Combining dependent tests of significance has broad applications but the p-value calculation is challenging. Current moment-matching methods (e.g., Brown's approximation) for Fisher's combination test tend to significantly inflate the type I error rate at the level less than 0.05. It could lead to significant false discoveries in big data analyses. This paper provides several more accurate and computationally efficient p-value calculation methods for a general family of Fisher type statistics, referred as the GFisher. The GFisher covers Fisher's combination, Good's statistic, Lancaster's statistic, weighted Z-score combination, etc. It allows a flexible weighting scheme, as well as an omnibus procedure that automatically adapts proper weights and degrees of freedom to a given data. The new p-value calculation methods are based on novel ideas of moment-ratio matching and joint-distribution surrogating. Systematic simulations show that they are accurate under multivariate Gaussian, and robust under the generalized linear model and the multivariate t-distribution, down to at least 10^-6 level. We illustrate the usefulness of the GFisher and the new p-value calculation methods in analyzing both simulated and real data of gene-based SNP-set association studies in genetics. Relevant computation has been implemented into R package GFisher.


page 1

page 2

page 3

page 4


Generalized Goodness-Of-Fit Tests for Correlated Data

This paper concerns the problem of applying the generalized goodness-of-...

An asymptotically optimal transform of Pearson's correlation statistic

It is shown that for any correlation-parametrized model of dependence an...

Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures

Combining individual p-values to aggregate multiple small effects has a ...

TFisher Tests: Optimal and Adaptive Thresholding for Combining p-Values

For testing a group of hypotheses, tremendous p-value combination method...

DiscreteFDR: An R package for controlling the false discovery rate for discrete test statistics

The simultaneous analysis of many statistical tests is ubiquitous in app...

An efficient and accurate approximation to the distribution of quadratic forms of Gaussian variables

Fast and accurate calculation for the distributions of Quadratic forms o...

A minimum Wasserstein distance approach to Fisher's combination of independent discrete p-values

This paper introduces a comprehensive framework to adjust a discrete tes...

Please sign up or login with your details

Forgot password? Click here to reset