MaskSketch: Unpaired Structure-guided Masked Image Generation

02/10/2023
by   Dina Bashkirova, et al.
0

Recent conditional image generation methods produce images of remarkable diversity, fidelity and realism. However, the majority of these methods allow conditioning only on labels or text prompts, which limits their level of control over the generation result. In this paper, we introduce MaskSketch, an image generation method that allows spatial conditioning of the generation result using a guiding sketch as an extra conditioning signal during sampling. MaskSketch utilizes a pre-trained masked generative transformer, requiring no model training or paired supervision, and works with input sketches of different levels of abstraction. We show that intermediate self-attention maps of a masked generative transformer encode important structural information of the input image, such as scene layout and object shape, and we propose a novel sampling method based on this observation to enable structure-guided generation. Our results show that MaskSketch achieves high image realism and fidelity to the guiding structure. Evaluated on standard benchmark datasets, MaskSketch outperforms state-of-the-art methods for sketch-to-image translation, as well as unpaired image-to-image translation approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 7

page 8

page 14

research
12/12/2019

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

Controllable image-to-image translation, i.e., transferring an image fro...
research
10/24/2019

Guided Image-to-Image Translation with Bi-Directional Feature Transformation

We address the problem of guided image-to-image translation where we tra...
research
05/08/2022

On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models

Conditional image generation has paved the way for several breakthroughs...
research
12/09/2019

Learning Structure-Appearance Joint Embedding for Indoor Scene Image Synthesis

Advanced image synthesis methods can generate photo-realistic images for...
research
03/25/2022

Spatially Multi-conditional Image Generation

In most scenarios, conditional image generation can be thought of as an ...
research
04/11/2021

CoPE: Conditional image generation using Polynomial Expansions

Generative modeling has evolved to a notable field of machine learning. ...
research
03/26/2023

Relational Inductive Biases for Object-Centric Image Generation

Conditioning image generation on specific features of the desired output...

Please sign up or login with your details

Forgot password? Click here to reset