Flow Matching in Latent Space

07/17/2023
by   Quan Dao, et al.
0

Flow matching is a recent framework to train generative models that exhibits impressive empirical performance while being relatively easier to train compared with diffusion-based models. Despite its advantageous properties, prior methods still face the challenges of expensive computing and a large number of function evaluations of off-the-shelf solvers in the pixel space. Furthermore, although latent-based generative methods have shown great success in recent years, this particular model type remains underexplored in this area. In this work, we propose to apply flow matching in the latent spaces of pretrained autoencoders, which offers improved computational efficiency and scalability for high-resolution image synthesis. This enables flow-matching training on constrained computational resources while maintaining their quality and flexibility. Additionally, our work stands as a pioneering contribution in the integration of various conditions into flow matching for conditional generation tasks, including label-conditioned image generation, image inpainting, and semantic-to-image generation. Through extensive experiments, our approach demonstrates its effectiveness in both quantitative and qualitative results on various datasets, such as CelebA-HQ, FFHQ, LSUN Church Bedroom, and ImageNet. We also provide a theoretical control of the Wasserstein-2 distance between the reconstructed latent flow distribution and true data distribution, showing it is upper-bounded by the latent flow matching objective. Our code will be available at https://github.com/VinAIResearch/LFM.git.

READ FULL TEXT

page 20

page 21

page 25

page 26

page 27

page 30

page 31

page 32

research
12/20/2021

High-Resolution Image Synthesis with Latent Diffusion Models

By decomposing the image formation process into a sequential application...
research
03/24/2023

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

Conditional image-to-video (cI2V) generation aims to synthesize a new pl...
research
05/26/2023

Functional Flow Matching

In this work, we propose Functional Flow Matching (FFM), a function-spac...
research
05/13/2021

PassFlow: Guessing Passwords with Generative Flows

Recent advances in generative machine learning models rekindled research...
research
06/25/2021

NP-DRAW: A Non-Parametric Structured Latent Variable Modelfor Image Generation

In this paper, we present a non-parametric structured latent variable mo...
research
11/26/2022

Randomized Conditional Flow Matching for Video Prediction

We introduce a novel generative model for video prediction based on late...
research
12/13/2022

Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance

Score-based generative models are shown to achieve remarkable empirical ...

Please sign up or login with your details

Forgot password? Click here to reset