A Reproducible Extraction of Training Images from Diffusion Models

05/15/2023
by   Ryan Webster, et al.
0

Recently, Carlini et al. demonstrated the widely used model Stable Diffusion can regurgitate real training samples, which is troublesome from a copyright perspective. In this work, we provide an efficient extraction attack on par with the recent attack, with several order of magnitudes less network evaluations. In the process, we expose a new phenomena, which we dub template verbatims, wherein a diffusion model will regurgitate a training sample largely in tact. Template verbatims are harder to detect as they require retrieval and masking to correctly label. Furthermore, they are still generated by newer systems, even those which de-duplicate their training set, and we give insight into why they still appear during generation. We extract training images from several state of the art systems, including Stable Diffusion 2.0, Deep Image Floyd, and finally Midjourney v4. We release code to verify our extraction attack, perform the attack, as well as all extracted prompts at <https://github.com/ryanwebster90/onestep-extraction>.

READ FULL TEXT

page 3

page 5

page 8

research
09/25/2022

Personalizing Text-to-Image Generation via Aesthetic Gradients

This work proposes aesthetic gradients, a method to personalize a CLIP-c...
research
03/17/2023

A Recipe for Watermarking Diffusion Models

Recently, diffusion models (DMs) have demonstrated their advantageous po...
research
12/07/2022

Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models

Cutting-edge diffusion models produce images with high quality and custo...
research
06/22/2023

DiffWA: Diffusion Models for Watermark Attack

With the rapid development of deep neural networks(DNNs), many robust bl...
research
03/10/2023

TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets

Diffusion models have achieved great success in a range of tasks, such a...
research
03/29/2023

A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion

Despite the record-breaking performance in Text-to-Image (T2I) generatio...
research
05/16/2023

A Method for Training-free Person Image Picture Generation

The current state-of-the-art Diffusion model has demonstrated excellent ...

Please sign up or login with your details

Forgot password? Click here to reset