The quantification of Simpsons paradox and other contributions to contingency table theory

by   Friedrich Teuscher, et al.

The analysis of contingency tables is a powerful statistical tool used in experiments with categorical variables. This study improves parts of the theory underlying the use of contingency tables. Specifically, the linkage disequilibrium parameter as a measure of two-way interactions applied to three-way tables makes it possible to quantify Simpsons paradox by a simple formula. With tests on three-way interactions, there is only one that determines whether the partial interactions of all variables agree or whether there is at least one variable whose partial interactions disagree. To date, there has been no test available that determines whether the partial interactions of a certain variable agree or disagree, and the presented work closes this gap. This work reveals the relation of the multiplicative and the additive measure of a three-way interaction. Another contribution addresses the question of which cells in a contingency table are fixed when the first- and second-order marginal totals are given. The proposed procedure not only detects fixed zero counts but also fixed positive counts. This impacts the determination of the degrees of freedom. Furthermore, limitations of methods that simulate contingency tables with given pairwise associations are addressed.


page 1

page 2

page 3

page 4


Geometric Mean Type of Proportional Reduction in Variation Measure for Two-Way Contingency Tables

In a two-way contingency table analysis with explanatory and response va...

Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents

Tables are widely used in several types of documents since they can brin...

Recommending Related Tables

Tables are an extremely powerful visual and interactive tool for structu...

BreakingBERT@IITK at SemEval-2021 Task 9 : Statement Verification and Evidence Finding with Tables

Recently, there has been an interest in factual verification and predict...

Asymptotics of high-dimensional contingency tables with fixed marginals

We consider the asymptotic distribution of a cell in a 2 x ... x 2 conti...

Minimal inference from incomplete 2x2-tables

Estimates based on 2x2 tables of frequencies are widely used in statisti...

Hybrid Methods for Running MCMC over I× J× K Contingency Tables

We consider an I × J× K table with cell counts X_ijk≥ 0 for i = 1, … , I...

Please sign up or login with your details

Forgot password? Click here to reset