Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

08/30/2023
by   Zhanbo Feng, et al.
0

Denoising diffusion models have shown outstanding performance in image editing. Existing works tend to use either image-guided methods, which provide a visual reference but lack control over semantic coherence, or text-guided methods, which ensure faithfulness to text guidance but lack visual quality. To address the problem, we propose the Zero-shot Inversion Process (ZIP), a framework that injects a fusion of generated visual reference and text guidance into the semantic latent space of a frozen pre-trained diffusion model. Only using a tiny neural network, the proposed ZIP produces diverse content and attributes under the intuitive control of the text prompt. Moreover, ZIP shows remarkable robustness for both in-domain and out-of-domain attribute manipulation on real images. We perform detailed experiments on various benchmark datasets. Compared to state-of-the-art methods, ZIP produces images of equivalent quality while providing a realistic editing effect.

READ FULL TEXT

page 7

page 8

page 10

page 16

page 17

page 18

page 19

page 20

research
05/24/2023

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Editing real facial images is a crucial task in computer vision with sig...
research
02/08/2023

Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

Recent advancements in large scale text-to-image models have opened new ...
research
08/11/2023

Zero-shot Text-driven Physically Interpretable Face Editing

This paper proposes a novel and physically interpretable method for face...
research
06/21/2023

Local 3D Editing via 3D Distillation of CLIP Knowledge

3D content manipulation is an important computer vision task with many r...
research
11/18/2022

A Structure-Guided Diffusion Model for Large-Hole Diverse Image Completion

Diverse image completion, a problem of generating various ways of fillin...
research
07/24/2023

Interpolating between Images with Diffusion Models

One little-explored frontier of image generation and editing is the task...
research
03/19/2023

SKED: Sketch-guided Text-based 3D Editing

Text-to-image diffusion models are gradually introduced into computer gr...

Please sign up or login with your details

Forgot password? Click here to reset