Robust Estimation under the Wasserstein Distance

02/02/2023
by   Sloan Nietert, et al.
0

We study the problem of robust distribution estimation under the Wasserstein metric, a popular discrepancy measure between probability distributions rooted in optimal transport (OT) theory. We introduce a new outlier-robust Wasserstein distance 𝖶_p^ε which allows for ε outlier mass to be removed from its input distributions, and show that minimum distance estimation under 𝖶_p^ε achieves minimax optimal robust estimation risk. Our analysis is rooted in several new results for partial OT, including an approximate triangle inequality, which may be of independent interest. To address computational tractability, we derive a dual formulation for 𝖶_p^ε that adds a simple penalty term to the classic Kantorovich dual objective. As such, 𝖶_p^ε can be implemented via an elementary modification to standard, duality-based OT solvers. Our results are extended to sliced OT, where distributions are projected onto low-dimensional subspaces, and applications to homogeneity and independence testing are explored. We illustrate the virtues of our framework via applications to generative modeling with contaminated datasets.

READ FULL TEXT

page 11

page 22

page 23

research
11/02/2021

Outlier-Robust Optimal Transport: Duality, Structure, and Statistical Analysis

The Wasserstein distance, rooted in optimal transport (OT) theory, is a ...
research
06/18/2020

When OT meets MoM: Robust estimation of Wasserstein Distance

Issued from Optimal Transport, the Wasserstein distance has gained impor...
research
09/16/2019

Estimation of Wasserstein distances in the Spiked Transport Model

We propose a new statistical model, the spiked transport model, which fo...
research
04/30/2022

A Simple Duality Proof for Wasserstein Distributionally Robust Optimization

We present a short and elementary proof of the duality for Wasserstein d...
research
07/03/2023

Quantifying Distributional Model Risk in Marginal Problems via Optimal Transport

This paper studies distributional model risk in marginal problems, where...
research
02/09/2023

Outlier-Robust Gromov Wasserstein for Graph Data

Gromov Wasserstein (GW) distance is a powerful tool for comparing and al...
research
09/28/2022

GeONet: a neural operator for learning the Wasserstein geodesic

Optimal transport (OT) offers a versatile framework to compare complex d...

Please sign up or login with your details

Forgot password? Click here to reset