Linear Optimal Transport Embedding: Provable fast Wasserstein distance computation and classification for nonlinear problems

08/20/2020
by   Caroline Moosmüller, et al.
0

Discriminating between distributions is an important problem in a number of scientific fields. This motivated the introduction of Linear Optimal Transportation (LOT), which embeds the space of distributions into an L^2-space. The transform is defined by computing the optimal transport of each distribution to a fixed reference distribution, and has a number of benefits when it comes to speed of computation and to determining classification boundaries. In this paper, we characterize a number of settings in which LOT embeds families of distributions into a space in which they are linearly separable. This is true in arbitrary dimension, and for families of distributions generated through perturbations of shifts and scalings of a fixed distribution. We also prove conditions under which the L^2 distance of the LOT embedding between two distributions in arbitrary dimension is nearly isometric to Wasserstein-2 distance between those distributions. This is of significant computational benefit, as one must only compute N optimal transport maps to define the N^2 pairwise distances between N distributions. We demonstrate the benefits of LOT on a number of distribution classification problems.

READ FULL TEXT

page 12

page 13

page 16

page 17

research
12/22/2021

On a linear Gromov-Wasserstein distance

Gromov-Wasserstein distances are generalization of Wasserstein distances...
research
01/25/2022

Supervised learning of sheared distributions using linearized optimal transport

In this paper we study supervised learning tasks on the space of probabi...
research
08/19/2020

Linearized Optimal Transport for Collider Events

We introduce an efficient framework for computing the distance between c...
research
02/14/2023

Linearized Wasserstein dimensionality reduction with approximation guarantees

We introduce LOT Wassmap, a computationally feasible algorithm to uncove...
research
05/18/2018

Computing Kantorovich-Wasserstein Distances on d-dimensional histograms using (d+1)-partite graphs

This paper presents a novel method to compute the exact Kantorovich-Wass...
research
12/10/2020

Hardness results for Multimarginal Optimal Transport problems

Multimarginal Optimal Transport (MOT) is the problem of linear programmi...
research
04/13/2023

A Natural Copula

Copulas are widely used in financial economics as well as in other areas...

Please sign up or login with your details

Forgot password? Click here to reset