Statistically Significant Discriminative Patterns Searching

by   Hoang Son Pham, et al.

Discriminative pattern mining is an essential task of data mining. This task aims to discover patterns which occur more frequently in a class than other classes in a class-labeled dataset. This type of patterns is valuable in various domains such as bioinformatics, data classification. In this paper, we propose a novel algorithm, named SSDPS, to discover patterns in two-class datasets. The SSDPS algorithm owes its efficiency to an original enumeration strategy of the patterns, which allows to exploit some degrees of anti-monotonicity on the measures of discriminance and statistical significance. Experimental results demonstrate that the performance of the SSDPS algorithm is better than others. In addition, the number of generated patterns is much less than the number of other algorithms. Experiment on real data also shows that SSDPS efficiently detects multiple SNPs combinations in genetic data.


page 1

page 2

page 3

page 4


Towards Efficient Discriminative Pattern Mining in Hybrid Domains

Discriminative pattern mining is a data mining task in which we find pat...

Statistically Significant Pattern Mining with Ordinal Utility

Statistically significant patterns mining (SSPM) is an essential and cha...

Towards Top-K Non-Overlapping Sequential Patterns

Sequential pattern mining (SPM) has excellent prospects and application ...

Searching for significant patterns in stratified data

Significant pattern mining, the problem of finding itemsets that are sig...

A genetic algorithm to discover flexible motifs with support

Finding repeated patterns or motifs in a time series is an important uns...

Redescription Model Mining

This paper introduces Redescription Model Mining, a novel approach to id...

Improving Scalability of Contrast Pattern Mining for Network Traffic Using Closed Patterns

Contrast pattern mining (CPM) aims to discover patterns whose support in...

Please sign up or login with your details

Forgot password? Click here to reset