CDVAE: Co-embedding Deep Variational Auto Encoder for Conditional Variational Generation

12/01/2016
by   Jiajun Lu, et al.
0

Problems such as predicting a new shading field (Y) for an image (X) are ambiguous: many very distinct solutions are good. Representing this ambiguity requires building a conditional model P(Y|X) of the prediction, conditioned on the image. Such a model is difficult to train, because we do not usually have training data containing many different shadings for the same image. As a result, we need different training examples to share data to produce good models. This presents a danger we call "code space collapse" - the training procedure produces a model that has a very good loss score, but which represents the conditional distribution poorly. We demonstrate an improved method for building conditional models by exploiting a metric constraint on training data that prevents code space collapse. We demonstrate our model on two example tasks using real data: image saturation adjustment, image relighting. We describe quantitative metrics to evaluate ambiguous generation results. Our results quantitatively and qualitatively outperform different strong baselines.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 8

page 13

page 14

page 15

research
04/30/2023

Learning Structured Output Representations from Attributes using Deep Conditional Generative Models

Structured output representation is a generative task explored in comput...
research
12/06/2016

Learning Diverse Image Colorization

Colorization is an ambiguous problem, with multiple viable colorizations...
research
07/21/2022

Auto-regressive Image Synthesis with Integrated Quantization

Deep generative models have achieved conspicuous progress in realistic i...
research
05/30/2022

Task-Prior Conditional Variational Auto-Encoder for Few-Shot Image Classification

Transductive methods always outperform inductive methods in few-shot ima...
research
11/10/2019

Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders

Current neural Natural Language Generation (NLG) models cannot handle em...
research
06/15/2023

Improving Path Planning Performance through Multimodal Generative Models with Local Critics

This paper presents a novel method for accelerating path planning tasks ...
research
09/06/2018

Structural Consistency and Controllability for Diverse Colorization

Colorizing a given gray-level image is an important task in the media an...

Please sign up or login with your details

Forgot password? Click here to reset