Continuous Layout Editing of Single Images with Diffusion Models

06/22/2023
by   Zhiyuan Zhang, et al.
0

Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our approach is achieved through two key modules. First, to preserve the characteristics of multiple objects within an image, we disentangle the concepts of different objects and embed them into separate textual tokens using a novel method called masked textual inversion. Next, we propose a training-free optimization method to perform layout control for a pre-trained diffusion model, which allows us to regenerate images with learned concepts and align them with user-specified layouts. As the first framework to edit the layout of existing images, we demonstrate that our method is effective and outperforms other baselines that were modified to support this task. Our code will be freely available for public use upon acceptance.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

page 9

page 10

research
12/08/2022

SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Recent works on diffusion models have demonstrated a strong capability f...
research
03/14/2023

Editing Implicit Assumptions in Text-to-Image Diffusion Models

Text-to-image diffusion models often make implicit assumptions about the...
research
02/08/2020

Correction of Chromatic Aberration from a Single Image Using Keypoints

In this paper, we propose a method to correct for chromatic aberration i...
research
05/02/2023

Key-Locked Rank One Editing for Text-to-Image Personalization

Text-to-image models (T2I) offer a new level of flexibility by allowing ...
research
04/06/2023

Training-Free Layout Control with Cross-Attention Guidance

Recent diffusion-based generators can produce high-quality images based ...
research
08/25/2017

Chisio: A Compound Graph Editing and Layout Framework

We introduce a new free, open-source compound graph editing and layout f...
research
05/25/2023

Break-A-Scene: Extracting Multiple Concepts from a Single Image

Text-to-image model personalization aims to introduce a user-provided co...

Please sign up or login with your details

Forgot password? Click here to reset