PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks

01/30/2023
by   Arian Bakhtiarnia, et al.
0

Many deep learning tasks require annotations that are too time consuming for human operators, resulting in small dataset sizes. This is especially true for dense regression problems such as crowd counting which requires the location of every person in the image to be annotated. Techniques such as data augmentation and synthetic data generation based on simulations can help in such cases. In this paper, we introduce PromptMix, a method for artificially boosting the size of existing datasets, that can be used to improve the performance of lightweight networks. First, synthetic images are generated in an end-to-end data-driven manner, where text prompts are extracted from existing datasets via an image captioning deep network, and subsequently introduced to text-to-image diffusion models. The generated images are then annotated using one or more high-performing deep networks, and mixed with the real dataset for training the lightweight network. By extensive experiments on five datasets and two tasks, we show that PromptMix can significantly increase the performance of lightweight networks by up to 26

READ FULL TEXT

page 2

page 4

page 6

page 7

research
05/03/2023

Multimodal Data Augmentation for Image Captioning using Diffusion Models

Image captioning, an important vision-language task, often requires a tr...
research
09/10/2023

Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning

While impressive performance has been achieved in image captioning, the ...
research
08/08/2023

The Five-Dollar Model: Generating Game Maps and Sprites from Sentence Embeddings

The five-dollar model is a lightweight text-to-image generative architec...
research
10/27/2021

Training Lightweight CNNs for Human-Nanodrone Proximity Interaction from Small Datasets using Background Randomization

We consider the task of visually estimating the pose of a human from ima...
research
09/18/2022

Siamese Network-based Lightweight Framework for Tomato Leaf Disease Recognition

Automatic tomato disease recognition from leaf images is vital to avoid ...
research
03/30/2018

Guide Me: Interacting with Deep Networks

Interaction and collaboration between humans and intelligent machines ha...
research
06/09/2023

Boosting GUI Prototyping with Diffusion Models

GUI (graphical user interface) prototyping is a widely-used technique in...

Please sign up or login with your details

Forgot password? Click here to reset