Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness?

05/28/2023
by   Manuel Brack, et al.
1

Text-conditioned image generation models have recently achieved astonishing results in image quality and text alignment and are consequently employed in a fast-growing number of applications. Since they are highly data-driven, relying on billion-sized datasets randomly scraped from the web, they also reproduce inappropriate human behavior. Specifically, we demonstrate inappropriate degeneration on a large-scale for various generative text-to-image models, thus motivating the need for monitoring and moderating them at deployment. To this end, we evaluate mitigation strategies at inference to suppress the generation of inappropriate content. Our findings show that we can use models' representations of the world's ugliness to align them with human preferences.

READ FULL TEXT

page 2

page 3

page 7

research
11/09/2022

Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models

Text-conditioned image generation models have recently achieved astonish...
research
09/20/2023

Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge

Text-conditioned image generation models have recently achieved astonish...
research
02/07/2023

Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness

Generative AI models have recently achieved astonishing results in quali...
research
02/16/2023

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models

The incredible generative ability of large-scale text-to-image (T2I) mod...
research
09/07/2023

T2IW: Joint Text to Image Watermark Generation

Recent developments in text-conditioned image generative models have rev...
research
05/26/2023

Stereotypes and Smut: The (Mis)representation of Non-cisgender Identities by Text-to-Image Models

Cutting-edge image generation has been praised for producing high-qualit...
research
04/04/2023

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation

Human evaluation is critical for validating the performance of text-to-i...

Please sign up or login with your details

Forgot password? Click here to reset