Selective inference for the problem of regions via multiscale bootstrap

11/02/2017
by   Yoshikazu Terada, et al.
0

Selective inference procedures are considered for computing approximately unbiased p-values of hypothesis testing using nonparametric bootstrap resampling without direct access to the parameter space nor the null distribution. A typical example is to assess the uncertainty of hierarchical clustering, where we can easily compute a frequentist confidence level of each cluster by counting how many times it appears in bootstrap replicates. This is implemented in "pvclust" package of R, and we are going to extend it for selective inference. P-values are computed only for the obtained clusters for testing the null hypothesis that the cluster is not true. This is formulated as the "problem of regions" where hypotheses are represented as arbitrary shaped regions in a parameter space. Geometric quantities, namely, signed distance and mean curvature, determine the frequentist confidence level. Our idea is to estimate these geometric quantities by the multiscale bootstrap in which we change the sample size of bootstrap replicates. Our method is second-order accurate in the large sample theory of smooth boundary surfaces of regions, and it is also justified for nonsmooth surfaces. Our p-values are asymptotically equivalent to those of more computationally intensive iterated bootstrap.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2009

Computing p-values of LiNGAM outputs via Multiscale Bootstrap

Structural equation models and Bayesian networks have been widely used t...
research
02/13/2019

Selective Inference for Testing Trees and Edges in Phylogenetics

Selective inference is considered for testing trees and edges in phyloge...
research
08/24/2022

Testing Many and Possibly Singular Polynomial Constraints

We consider the problem of testing a null hypothesis defined by polynomi...
research
06/13/2023

Nonparametric inference on non-negative dissimilarity measures at the boundary of the parameter space

It is often of interest to assess whether a function-valued statistical ...
research
06/24/2022

On the validity of bootstrap uncertainty estimates in the Mallows-Binomial model

The Mallows-Binomial distribution is the first joint statistical model f...
research
01/30/2023

Selective inference for clustering with unknown variance

In many modern statistical problems, the limited available data must be ...
research
03/14/2021

Quantifying uncertainty in spikes estimated from calcium imaging data

In recent years, a number of methods have been proposed to estimate the ...

Please sign up or login with your details

Forgot password? Click here to reset