Contributions to the Formalization and Extraction of Generic Bases of Association Rules

11/01/2019
by   Sadok Ben Yahia, et al.
0

In this thesis, a detailed study shows that closed itemsets and minimal generators play a key role for concisely representing both frequent itemsets and association rules. These itemsets structure the search space into equivalence classes such that each class gathers the itemsets appearing in the same subset aka objects or transactions of the given data. In this respect, we proposed lossless reductions of the minimal generator set thanks to a new substitution-based process. Our theoretical results are extended to the association rule framework in order to reduce as much as possible the number of retained rules without information loss. We then give a thorough formal study of the related inference mechanism allowing to derive all redundant association rules, starting from the retained ones. We also lead a thorough exploration of the disjunctive search space, where itemsets are characterized by their respective disjunctive supports, instead of the conjunctive ones. This exploration is motivated by the fact that, in some applications, such information brings richer knowledge to the end-users. To obtain a redundancy free representation of the disjunctive search space, an interesting solution consists in selecting a unique element to represent itemsets covering the same set of data. Two itemsets are equivalent if their respective items cover the same set of data. In this regard, we introduced a new operator dedicated to this task. In each induced equivalence class, minimal elements are called essential itemsets, while the largest one is called disjunctive closed itemset. The introduced operator is then at the roots of new concise representations of frequent itemsets. We also exploit the disjunctive search space to derive generalized association rules. These latter rules generalize classic ones to also offer disjunction and negation connectors between items, in addition to the conjunctive one.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2019

Proposition d'une nouvelle approche d'extraction des motifs fermés fréquents

This work is done as part of a master's thesis project. The increase in ...
research
04/02/2020

Nouvelles représentations concises exactes des motifs rares

Until a present, the majority of work in data mining were interested in ...
research
06/15/2002

Relational Association Rules: getting WARMeR

In recent years, the problem of association rule mining in transactional...
research
12/03/2010

Closed-set-based Discovery of Bases of Association Rules

The output of an association rule miner is often huge in practice. This ...
research
02/23/2010

Redundancy, Deduction Schemes, and Minimum-Size Bases for Association Rules

Association rules are among the most widely employed data analysis metho...
research
01/06/2019

Search Space Reduction of Asynchrony Immune Cellular Automata by Center Permutivity

We continue the study of asynchrony immunity in cellular automata (CA), ...
research
01/03/2019

Une nouvelle approche de complétion des valeurs manquantes dans les bases de données

When tackling real-life datasets, it is common to face the existence of ...

Please sign up or login with your details

Forgot password? Click here to reset