Wasserstein GANs with Gradient Penalty Compute Congested Transport

09/01/2021
by   Tristan Milne, et al.
0

Wasserstein GANs with Gradient Penalty (WGAN-GP) are an extremely popular method for training generative models to produce high quality synthetic data. While WGAN-GP were initially developed to calculate the Wasserstein 1 distance between generated and real data, recent works (e.g. Stanczuk et al. (2021)) have provided empirical evidence that this does not occur, and have argued that WGAN-GP perform well not in spite of this issue, but because of it. In this paper we show for the first time that WGAN-GP compute the minimum of a different optimal transport problem, the so-called congested transport (Carlier et al. (2008)). Congested transport determines the cost of moving one distribution to another under a transport model that penalizes congestion. For WGAN-GP, we find that the congestion penalty has a spatially varying component determined by the sampling strategy used in Gulrajani et al. (2017) which acts like a local speed limit, making congestion cost less in some regions than others. This aspect of the congested transport problem is new in that the congestion penalty turns out to be unbounded and depend on the distributions to be transported, and so we provide the necessary mathematical proofs for this setting. We use our discovery to show that the gradients of solutions to the optimization problem in WGAN-GP determine the time averaged momentum of optimal mass flow. This is in contrast to the gradients of Kantorovich potentials for the Wasserstein 1 distance, which only determine the normalized direction of flow. This may explain, in support of Stanczuk et al. (2021), the success of WGAN-GP, since the training of the generator is based on these gradients.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2019

A gradual, semi-discrete approach to generative network training via explicit wasserstein minimization

This paper provides a simple procedure to fit generative networks to tar...
research
06/15/2022

Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?

Wasserstein Generative Adversarial Networks (WGANs) are the popular gene...
research
05/22/2017

From optimal transport to generative modeling: the VEGAN cookbook

We study unsupervised generative modeling in terms of the optimal transp...
research
10/21/2022

On amortizing convex conjugates for optimal transport

This paper focuses on computing the convex conjugate operation that aris...
research
05/27/2022

Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport

The problem of optimization on Stiefel manifold, i.e., minimizing functi...
research
05/27/2021

Efficient and Accurate Gradients for Neural SDEs

Neural SDEs combine many of the best qualities of both RNNs and SDEs, an...
research
10/05/2018

Local Stability and Performance of Simple Gradient Penalty mu-Wasserstein GAN

Wasserstein GAN(WGAN) is a model that minimizes the Wasserstein distance...

Please sign up or login with your details

Forgot password? Click here to reset