Interactive Machine Learning for Image Captioning

02/28/2022
by   Mareike Hartmann, et al.
0

We propose an approach for interactive learning for an image captioning model. As human feedback is expensive and modern neural network based approaches often require large amounts of supervised data to be trained, we envision a system that exploits human feedback as good as possible by multiplying the feedback using data augmentation methods, and integrating the resulting training examples into the model in a smart way. This approach has three key components, for which we need to find suitable practical implementations: feedback collection, data augmentation, and model update. We outline our idea and review different possibilities to address these tasks.

READ FULL TEXT
research
06/06/2023

Towards Adaptable and Interactive Image Captioning with Data Augmentation and Episodic Memory

Interactive machine learning (IML) is a beneficial learning paradigm in ...
research
05/03/2023

Multimodal Data Augmentation for Image Captioning using Diffusion Models

Image captioning, an important vision-language task, often requires a tr...
research
01/31/2020

iCap: Interative Image Captioning with Predictive Text

In this paper we study a brand new topic of interactive image captioning...
research
02/22/2021

Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation

Image Captioning, or the automatic generation of descriptions for images...
research
06/06/2023

Putting Humans in the Image Captioning Loop

Image Captioning (IC) models can highly benefit from human feedback in t...
research
06/10/2021

Data augmentation to improve robustness of image captioning solutions

In this paper, we study the impact of motion blur, a common quality flaw...
research
05/03/2016

Improving Image Captioning by Concept-based Sentence Reranking

This paper describes our winning entry in the ImageCLEF 2015 image sente...

Please sign up or login with your details

Forgot password? Click here to reset