Michele Cafagna

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Anette Frank
39 publications
Malvina Nissim
35 publications
Albert Gatt
33 publications
Marco Guerini
19 publications
Kees van Deemter
18 publications
Iacer Calixto
16 publications
Lina M. Rojas Barahona
16 publications
Huiyuan Lai
10 publications
Letitia Parcalabescu
6 publications
Felice Dell'Orletta
4 publications
Lorenzo De Mattei
2 publications

research

∙ 04/28/2023

Interpreting Vision and Language Generative Models with Semantic Visual Priors

When applied to Image-to-text models, interpretability methods often pro...

5 Michele Cafagna, et al. ∙

research

∙ 02/23/2023

HL Dataset: Grounding High-Level Linguistic Concepts in Vision

Current captioning datasets, focus on object-centric captions, describin...

4 Michele Cafagna, et al. ∙

research

∙ 11/09/2022

Understanding Cross-modal Interactions in V L Models that Generate Scene Descriptions

Image captioning models tend to describe images in an object-centric way...

0 Michele Cafagna, et al. ∙

research

∙ 12/14/2021

VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

We propose VALSE (Vision And Language Structured Evaluation), a novel be...

2 Letitia Parcalabescu, et al. ∙

research

∙ 09/15/2021

What Vision-Language Models `See' when they See Scenes

Images can be described in terms of the objects they contain, or in term...

10 Michele Cafagna, et al. ∙

research

∙ 01/05/2021

On the interaction of automatic evaluation and task framing in headline style transfer

An ongoing debate in the NLG community concerns the best way to evaluate...

10 Lorenzo De Mattei, et al. ∙

research

∙ 04/29/2020

GePpeTto Carves Italian into a Language Model

In the last few years, pre-trained neural architectures have provided im...

0 Lorenzo De Mattei, et al. ∙

Success!

An error occurred

Michele Cafagna

Featured Co-authors

Interpreting Vision and Language Generative Models with Semantic Visual Priors

HL Dataset: Grounding High-Level Linguistic Concepts in Vision

Understanding Cross-modal Interactions in V L Models that Generate Scene Descriptions

VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

What Vision-Language Models `See' when they See Scenes

On the interaction of automatic evaluation and task framing in headline style transfer

GePpeTto Carves Italian into a Language Model

Sign in with Google

Consider DeepAI Pro