Co-Clustering via Information-Theoretic Markov Aggregation

01/02/2018
by   Clemens Bloechl, et al.
0

We present an information-theoretic cost function for co-clustering, i.e., for simultaneous clustering of two sets based on similarities between their elements. By constructing a simple random walk on the corresponding bipartite graph, our cost function is derived from a recently proposed generalized framework for information-theoretic Markov chain aggregation. The goal of our cost function is to minimize relevant information loss, hence it connects to the information bottleneck formalism. Moreover, via the connection to Markov aggregation, our cost function is not ad hoc, but inherits its justification from the operational qualities associated with the corresponding Markov aggregation problem. We furthermore show that, for appropriate parameter settings, our cost function is identical to well-known approaches from the literature, such as Information-Theoretic Co-Clustering of Dhillon et al. Hence, understanding the influence of this parameter admits a deeper understanding of the relationship between previously proposed information-theoretic cost functions. We illustrate the performance of our cost function, optimized with a simple sequential heuristic, on several synthetic and real-world data sets, including the Newsgroup20 and the MovieLens100k data sets.

READ FULL TEXT

page 8

page 12

research
12/17/2021

Semi-Supervised Clustering via Markov Chain Aggregation

We connect the problem of semi-supervised clustering to constrained Mark...
research
04/19/2015

Compression and the origins of Zipf's law of abbreviation

Languages across the world exhibit Zipf's law of abbreviation, namely mo...
research
08/13/2021

An Information-theoretic Perspective of Hierarchical Clustering

A combinatorial cost function for hierarchical clustering was introduced...
research
04/01/2016

The deterministic information bottleneck

Lossy compression and clustering fundamentally involve a decision about ...
research
05/11/2020

A Game Theoretic Approach for Parking Spot Search with Limited Parking Lot Information

We propose a game theoretic approach to address the problem of searching...
research
05/10/2020

A Robust Matching Pursuit Algorithm Using Information Theoretic Learning

Current orthogonal matching pursuit (OMP) algorithms calculate the corre...
research
05/28/2019

Information-Theoretic Registration with Explicit Reorientation of Diffusion-Weighted Images

We present an information-theoretic approach to registration of DWI with...

Please sign up or login with your details

Forgot password? Click here to reset