Weak Signal Detection via Displacement Interpolation

05/24/2023
by   YoonHaeng Hur, et al.
0

Detecting weak, systematic signals hidden in a large collection of p-values published in academic journals is instrumental to identifying and understanding publication bias and p-value hacking in social and economic sciences. Given two probability distributions P (null) and Q (signal), we study the problem of detecting weak signals from the null P based on n independent samples: we model weak signals via displacement interpolation between P and Q, where the signal strength vanishes with n. We propose a hypothesis testing procedure based on the Wasserstein distance from optimal transport theory, derive sharp conditions under which detection is possible, and provide the exact characterization of the asymptotic Type I and Type II errors at the detection boundary using empirical processes. Applying our testing procedure to real data sets on published p-values across academic journals, we demonstrate that a rigorous testing procedure can detect weak signals that are otherwise indistinguishable.

READ FULL TEXT
research
02/15/2021

On the Inability of the Higher Criticism to Detect Rare/Weak Departures

Consider a multiple hypothesis testing setting involving rare/weak featu...
research
06/19/2020

Optimality of the max test for detecting sparse signals with Gaussian or heavier tail

A fundamental problem in high-dimensional testing is that of global null...
research
02/07/2023

Phase Transitions in the Detection of Correlated Databases

We study the problem of detecting the correlation between two Gaussian d...
research
03/06/2021

Log-Chisquared P-values under Rare and Weak Departures

Consider a multiple hypothesis testing setting in which only a small pro...
research
10/31/2022

SIMPLE-RC: Group Network Inference with Non-Sharp Nulls and Weak Signals

Large-scale network inference with uncertainty quantification has import...
research
04/05/2019

Spatial CUSUM for Signal Region Detection

Detecting weak clustered signal in spatial data is important but challen...
research
02/12/2018

Detecting weak signals by combining small P-values in observational studies with multiple testing

Human health is affected by multiple risk factors. Studies may focus on ...

Please sign up or login with your details

Forgot password? Click here to reset