B-tests: Low Variance Kernel Two-Sample Tests

07/08/2013
by   Wojciech Zaremba, et al.
0

A family of maximum mean discrepancy (MMD) kernel two-sample tests is introduced. Members of the test family are called Block-tests or B-tests, since the test statistic is an average over MMDs computed on subsets of the samples. The choice of block size allows control over the tradeoff between test power and computation time. In this respect, the B-test family combines favorable properties of previously proposed MMD two-sample tests: B-tests are more powerful than a linear time test where blocks are just pairs of samples, yet they are more computationally efficient than a quadratic time test where a single large block incorporating all the samples is used to compute a U-statistic. A further important advantage of the B-tests is their asymptotically Normal null distribution: this is by contrast with the U-statistic, which is degenerate under the null hypothesis, and for which estimates of the null distribution are computationally demanding. Recent results on kernel selection for hypothesis testing transfer seamlessly to the B-tests, yielding a means to optimize test power via kernel choice.

READ FULL TEXT
research
06/06/2021

Neural Tangent Kernel Maximum Mean Discrepancy

We present a novel neural network Maximum Mean Discrepancy (MMD) statist...
research
10/09/2018

A maximum-mean-discrepancy goodness-of-fit test for censored data

We introduce a kernel-based goodness-of-fit test for censored data, wher...
research
02/17/2020

Estimating the number and effect sizes of non-null hypotheses

We study the problem of estimating the distribution of effect sizes (the...
research
02/24/2020

Optimizing effective numbers of tests by vine copula modeling

In the multiple testing context, we utilize vine copulae for optimizing ...
research
06/18/2022

Efficient Aggregated Kernel Tests using Incomplete U-statistics

We propose a series of computationally efficient, nonparametric tests fo...
research
09/16/2022

Reweighted Anderson-Darling Tests of Goodness-of-Fit

Assessing goodness-of-fit is challenging because theoretically there is ...
research
10/08/2019

On the feasibility of parsimonious variable selection for Hotelling's T^2-test

Hotelling's T^2-test for the mean of a multivariate normal distribution ...

Please sign up or login with your details

Forgot password? Click here to reset