Sum-of-Squares Relaxations for Information Theory and Variational Inference

06/27/2022
by   Francis Bach, et al.
0

We consider extensions of the Shannon relative entropy, referred to as f-divergences. Three classical related computational problems are typically associated with these divergences: (a) estimation from moments, (b) computing normalizing integrals, and (c) variational inference in probabilistic models. These problems are related to one another through convex duality, and for all them, there are many applications throughout data science, and we aim for computationally tractable approximation algorithms that preserve properties of the original problem such as potential convexity or monotonicity. In order to achieve this, we derive a sequence of convex relaxations for computing these divergences from non-centered covariance matrices associated with a given feature vector: starting from the typically non-tractable optimal lower-bound, we consider an additional relaxation based on ”sums-of-squares”, which is is now computable in polynomial time as a semidefinite program, as well as further computationally more efficient relaxations based on spectral information divergences from quantum information theory. For all of the tasks above, beyond proposing new relaxations, we derive tractable algorithms based on augmented Lagrangians and first-order methods, and we present illustrations on multivariate trigonometric polynomials and functions on the Boolean hypercube.

READ FULL TEXT
research
02/17/2022

Information Theory with Kernel Methods

We consider the analysis of probability distributions through their asso...
research
02/13/2019

On the Convergence of Extended Variational Inference for Non-Gaussian Statistical Models

Variational inference (VI) is a widely used framework in Bayesian estima...
research
06/07/2021

NISQ Algorithm for Semidefinite Programming

Semidefinite Programming (SDP) is a class of convex optimization program...
research
03/12/2018

Variational Inference for Gaussian Process with Panel Count Data

We present the first framework for Gaussian-process-modulated Poisson pr...
research
11/30/2017

Improved Linear Embeddings via Lagrange Duality

Near isometric orthogonal embeddings to lower dimensions are a fundament...
research
09/21/2023

Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling

Connectionist temporal classification (CTC) is commonly adopted for sequ...
research
10/09/2017

Response to "Counterexample to global convergence of DSOS and SDSOS hierarchies"

In a recent note [8], the author provides a counterexample to the global...

Please sign up or login with your details

Forgot password? Click here to reset