Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

04/14/2022
by   Gabriel Loaiza-Ganem, et al.
23

Likelihood-based, or explicit, deep generative models use neural networks to construct flexible high-dimensional densities. This formulation directly contradicts the manifold hypothesis, which states that observed data lies on a low-dimensional manifold embedded in high-dimensional ambient space. In this paper we investigate the pathologies of maximum-likelihood training in the presence of this dimensionality mismatch. We formally prove that degenerate optima are achieved wherein the manifold itself is learned but not the distribution on it, a phenomenon we call manifold overfitting. We propose a class of two-step procedures consisting of a dimensionality reduction step followed by maximum-likelihood density estimation, and prove that they recover the data-generating distribution in the nonparametric regime, thus avoiding manifold overfitting. We also show that these procedures enable density estimation on the manifolds learned by implicit models, such as generative adversarial networks, hence addressing a major shortcoming of these models. Several recently proposed methods are instances of our two-step procedures; we thus unify, extend, and theoretically justify a large class of models.

READ FULL TEXT

page 9

page 23

page 24

page 25

page 26

page 27

page 28

page 29

research
05/09/2021

A likelihood approach to nonparametric estimation of a singular distribution using deep generative models

We investigate statistical properties of a likelihood approach to nonpar...
research
11/30/2022

Denoising Deep Generative Models

Likelihood-based deep generative models have recently been shown to exhi...
research
06/02/2021

Rectangular Flows for Manifold Learning

Normalizing flows are invertible neural networks with tractable change-o...
research
08/26/2023

Out-of-distribution detection using normalizing flows on the data manifold

A common approach for out-of-distribution detection involves estimating ...
research
06/08/2021

Manifold Topology Divergence: a Framework for Comparing Data Manifolds

We develop a framework for comparing data manifolds, aimed, in particula...
research
02/18/2022

Minimax Rate of Distribution Estimation on Unknown Submanifold under Adversarial Losses

Statistical inference from high-dimensional data with low-dimensional st...
research
05/30/2023

One-Line-of-Code Data Mollification Improves Optimization of Likelihood-based Generative Models

Generative Models (GMs) have attracted considerable attention due to the...

Please sign up or login with your details

Forgot password? Click here to reset