Information Bottleneck Revisited: Posterior Probability Perspective with Optimal Transport

08/22/2023
by   Lingyi Chen, et al.
0

Information bottleneck (IB) is a paradigm to extract information in one target random variable from another relevant random variable, which has aroused great interest due to its potential to explain deep neural networks in terms of information compression and prediction. Despite its great importance, finding the optimal bottleneck variable involves a difficult nonconvex optimization problem due to the nonconvexity of mutual information constraint. The Blahut-Arimoto algorithm and its variants provide an approach by considering its Lagrangian with fixed Lagrange multiplier. However, only the strictly concave IB curve can be fully obtained by the BA algorithm, which strongly limits its application in machine learning and related fields, as strict concavity cannot be guaranteed in those problems. To overcome the above difficulty, we derive an entropy regularized optimal transport (OT) model for IB problem from a posterior probability perspective. Correspondingly, we use the alternating optimization procedure and generalize the Sinkhorn algorithm to solve the above OT model. The effectiveness and efficiency of our approach are demonstrated via numerical experiments.

READ FULL TEXT
research
08/23/2018

Pathologies in information bottleneck for deterministic supervised learning

Information bottleneck (IB) is a method for extracting information from ...
research
05/11/2023

The Cardinality Bound on the Information Bottleneck Representations is Tight

The information bottleneck (IB) method aims to find compressed represent...
research
08/16/2022

An Optimal Transport Approach to the Computation of the LM Rate

Mismatch capacity characterizes the highest information rate for a chann...
research
12/14/2020

Disentangled Information Bottleneck

The information bottleneck (IB) method is a technique for extracting inf...
research
11/25/2019

The Convex Information Bottleneck Lagrangian

The information bottleneck (IB) problem tackles the issue of obtaining r...
research
11/03/2022

Optimal Compression for Minimizing Classification Error Probability: an Information-Theoretic Approach

We formulate the problem of performing optimal data compression under th...
research
06/05/2021

Principal Bit Analysis: Autoencoding with Schur-Concave Loss

We consider a linear autoencoder in which the latent variables are quant...

Please sign up or login with your details

Forgot password? Click here to reset