Graph Compression with Application to Model Selection

10/02/2021
by   Mojtaba Abolfazli, et al.
0

Many multivariate data such as social and biological data exhibit complex dependencies that are best characterized by graphs. Unlike sequential data, graphs are, in general, unordered structures. This means we can no longer use classic, sequential-based compression methods on these graph-based data. Therefore, it is necessary to develop new methods for graph compression. In this paper, we present universal source coding methods for the lossless compression of unweighted, undirected, unlabelled graphs. We encode in two steps: 1) transforming graph into a rooted binary tree, 2) the encoding rooted binary tree using graph statistics. Our coders showed better compression performance than other source coding methods on both synthetic and real-world graphs. We then applied our graph coding methods for model selection of Gaussian graphical models using minimum description length (MDL) principle finding the description length of the conditional independence graph. Experiments on synthetic data show that our approach gives better performance compared to common model selection methods. We also applied our approach to electrocardiogram (ECG) data in order to explore the differences between graph models of two groups of subjects.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2021

Graph Coding for Model Selection and Anomaly Detection in Gaussian Graphical Models

A classic application of description length is for model selection with ...
research
10/17/2021

Graph Wedgelets: Adaptive Data Compression on Graphs based on Binary Wedge Partitioning Trees and Geometric Wavelets

We introduce graph wedgelets - a tool for data compression on graphs bas...
research
05/29/2021

Graph Similarity Description: How Are These Graphs Similar?

How do social networks differ across platforms? How do information netwo...
research
04/17/2013

A Junction Tree Framework for Undirected Graphical Model Selection

An undirected graphical model is a joint probability distribution define...
research
01/13/2023

A Universal Low Complexity Compression Algorithm for Sparse Marked Graphs

Many modern applications involve accessing and processing graphical data...
research
06/25/2021

Graph model selection by edge probability sequential inference

Graphs are widely used for describing systems made up of many interactin...
research
08/21/2019

Minimum Description Length Revisited

This is an up-to-date introduction to and overview of the Minimum Descri...

Please sign up or login with your details

Forgot password? Click here to reset