The Exact Equivalence of Independence Testing and Two-Sample Testing

10/20/2019
by   Cencheng Shen, et al.
0

Testing independence and testing equality of distributions are two tightly related statistical hypotheses. Several distance and kernel-based statistics are recently proposed to achieve universally consistent testing for either hypothesis. On the distance side, the distance correlation is proposed for independence testing, and the energy statistic is proposed for two-sample testing. On the kernel side, the Hilbert-Schmidt independence criterion is proposed for independence testing and the maximum mean discrepancy is proposed for two-sample testing. In this paper, we show that two-sample testing are special cases of independence testing via an auxiliary label vector, and prove that distance correlation is exactly equivalent to the energy statistic in terms of the population statistic, the sample statistic, and the testing p-value via permutation test. The equivalence can be further generalized to K-sample testing and extended to the kernel regime. As a consequence, it suffices to always use an independence statistic to test equality of distributions, which enables better interpretability of the test statistic and more efficient testing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2018

The Exact Equivalence of Distance and Kernel Methods for Hypothesis Testing

Distance-based methods, also called "energy statistics", are leading met...
research
06/13/2018

Asymptotic hypothesis testing for the colour blind problem

In the classical two-sample problem, the conventional approach for testi...
research
02/11/2015

An Extreme-Value Approach for Testing the Equality of Large U-Statistic Based Correlation Matrices

There has been an increasing interest in testing the equality of large P...
research
09/14/2017

Two-sample Statistics Based on Anisotropic Kernels

The paper introduces a new kernel-based Maximum Mean Discrepancy (MMD) s...
research
04/26/2020

Efficient tests for bio-equivalence in functional data

We study the problem of testing the equivalence of functional parameters...
research
12/27/2019

The Chi-Square Test of Distance Correlation

Distance correlation has gained much recent attention in the statistics ...
research
07/03/2022

Testing Homogeneity: The Trouble with Sparse Functional Data

Testing the homogeneity between two samples of functional data is an imp...

Please sign up or login with your details

Forgot password? Click here to reset