k-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension

06/17/2022
by   Ziv Goldfeld, et al.
0

Sliced mutual information (SMI) is defined as an average of mutual information (MI) terms between one-dimensional random projections of the random variables. It serves as a surrogate measure of dependence to classic MI that preserves many of its properties but is more scalable to high dimensions. However, a quantitative characterization of how SMI itself and estimation rates thereof depend on the ambient dimension, which is crucial to the understanding of scalability, remain obscure. This works extends the original SMI definition to k-SMI, which considers projections to k-dimensional subspaces, and provides a multifaceted account on its dependence on dimension. Using a new result on the continuity of differential entropy in the 2-Wasserstein metric, we derive sharp bounds on the error of Monte Carlo (MC)-based estimates of k-SMI, with explicit dependence on k and the ambient dimension, revealing their interplay with the number of samples. We then combine the MC integrator with the neural estimation framework to provide an end-to-end k-SMI estimator, for which optimal convergence rates are established. We also explore asymptotics of the population k-SMI as dimension grows, providing Gaussian approximation results with a residual that decays under appropriate moment bounds. Our theory is validated with numerical experiments and is applied to sliced InfoGAN, which altogether provide a comprehensive quantitative account of the scalability question of k-SMI, including SMI as a special case when k=1.

READ FULL TEXT
research
10/11/2021

Sliced Mutual Information: A Scalable Measure of Statistical Dependence

Mutual information (MI) is a fundamental measure of statistical dependen...
research
10/17/2022

Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances

Sliced Wasserstein distances preserve properties of classic Wasserstein ...
research
09/29/2021

Finite-State Mutual Dimension

In 2004, Dai, Lathrop, Lutz, and Mayordomo defined and investigated the ...
research
07/28/2021

Limit Distribution Theory for the Smooth 1-Wasserstein Distance with Applications

The smooth 1-Wasserstein distance (SWD) W_1^σ was recently proposed as a...
research
06/29/2021

Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections

The Sliced-Wasserstein distance (SW) is being increasingly used in machi...
research
12/13/2020

Pseudo-likelihood-based M-estimation of random graphs with dependent edges and parameter vectors of increasing dimension

An important question in statistical network analysis is how to estimate...
research
06/27/2012

Ranking by Dependence - A Fair Criteria

Estimating the dependences between random variables, and ranking them ac...

Please sign up or login with your details

Forgot password? Click here to reset