Discovering Descriptive Tile Trees by Mining Optimal Geometric Subtiles

02/07/2019
by   Nikolaj Tatti, et al.
0

When analysing binary data, the ease at which one can interpret results is very important. Many existing methods, however, discover either models that are difficult to read, or return so many results interpretation becomes impossible. Here, we study a fully automated approach for mining easily interpretable models for binary data. We model data hierarchically with noisy tiles-rectangles with significantly different density than their parent tile. To identify good trees, we employ the Minimum Description Length principle. We propose STIJL, a greedy any-time algorithm for mining good tile trees from binary data. Iteratively, it finds the locally optimal addition to the current tree, allowing overlap with tiles of the same parent. A major result of this paper is that we find the optimal tile in only Θ(NM(N, M)) time. STIJL can either be employed as a top-k miner, or by MDL we can identify the tree that describes the data best. Experiments show we find succinct models that accurately summarise the data, and, by their hierarchical property are easily interpretable.

READ FULL TEXT
research
03/15/2012

Bayesian Rose Trees

Hierarchical structure is ubiquitous in data across many domains. There ...
research
11/21/2019

Vouw: Geometric Pattern Mining using the MDL Principle

We introduce geometric pattern mining, the problem of finding recurring ...
research
10/09/2020

Learning Binary Trees via Sparse Relaxation

One of the most classical problems in machine learning is how to learn b...
research
03/11/2021

Arc-Completion of 2-Colored Best Match Graphs to Binary-Explainable Best Match Graphs

Best match graphs (BMGs) are vertex-colored digraphs that naturally aris...
research
08/30/2022

Unbalancing Binary Trees

Assuming Zipf's Law to be accurate, we show that existing techniques for...
research
07/30/2019

Overlap-free Drawing of Generalized Pythagoras Trees for Hierarchy Visualization

Generalized Pythagoras trees were developed for visualizing hierarchical...
research
03/27/2013

Minimum Error Tree Decomposition

This paper describes a generalization of previous methods for constructi...

Please sign up or login with your details

Forgot password? Click here to reset