DeepAI AI Chat
Log In Sign Up

Searching for significant patterns in stratified data

by   Felipe Llinares-Lopez, et al.
ETH Zurich

Significant pattern mining, the problem of finding itemsets that are significantly enriched in one class of objects, is statistically challenging, as the large space of candidate patterns leads to an enormous multiple testing problem. Recently, the concept of testability was proposed as one approach to correct for multiple testing in pattern mining while retaining statistical power. Still, these strategies based on testability do not allow one to condition the test of significance on the observed covariates, which severely limits its utility in biomedical applications. Here we propose a strategy and an efficient algorithm to perform significant pattern mining in the presence of categorical covariates with K states.


page 1

page 2

page 3

page 4


Significant Subgraph Mining with Multiple Testing Correction

The problem of finding itemsets that are statistically significantly enr...

Statistically Significant Pattern Mining with Ordinal Utility

Statistically significant patterns mining (SSPM) is an essential and cha...

Statistically Significant Discriminative Patterns Searching

Discriminative pattern mining is an essential task of data mining. This ...

Fast and Memory-Efficient Significant Pattern Mining via Permutation Testing

We present a novel algorithm, Westfall-Young light, for detecting patter...

Selective Inference Approach for Statistically Sound Predictive Pattern Mining

Discovering statistically significant patterns from databases is an impo...

Efficient Evaluation of the Number of False Alarm Criterion

This paper proposes a method for computing efficiently the significance ...

Scaling pattern mining through non-overlapping variable partitioning

Biclustering algorithms play a central role in the biotechnological and ...