Latent Visual Cues for Neural Machine Translation

11/01/2018
by   Iacer Calixto, et al.
0

In this work, we propose to model the interaction between visual and textual features for multi-modal neural machine translation through a latent variable model. This latent variable can be seen as a stochastic embedding and it is used in the target-language decoder and also to predict image features. Importantly, even though in our model formulation we capture correlations between visual and textual features, we do not require that images be available at test time. We show that our latent variable MMT formulation improves considerably over strong baselines, including the multi-task learning approach of Elliott and Kadar (2017) and the conditional variational auto-encoder approach of Toyama et al. (2016). Finally, in an ablation study we show that (i) predicting image features in addition to only conditioning on them and (ii) imposing a constraint on the minimum amount of information encoded in the latent variable slightly improved translations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2018

Conditional Variational Autoencoder for Neural Machine Translation

We explore the performance of latent variable models for conditional tex...
research
05/26/2023

Structured Latent Variable Models for Articulated Object Interaction

In this paper, we investigate a scenario in which a robot learns a low-d...
research
04/24/2019

Latent Variable Session-Based Recommendation

Session based recommendation provides an attractive alternative to the t...
research
06/13/2018

Generative Neural Machine Translation

We introduce Generative Neural Machine Translation (GNMT), a latent vari...
research
11/25/2016

Neural Machine Translation with Latent Semantic of Image and Text

Although attention-based Neural Machine Translation have achieved great ...
research
08/28/2018

A Discriminative Latent-Variable Model for Bilingual Lexicon Induction

We introduce a novel discriminative latent-variable model for the task o...
research
05/28/2020

Variational Neural Machine Translation with Normalizing Flows

Variational Neural Machine Translation (VNMT) is an attractive framework...

Please sign up or login with your details

Forgot password? Click here to reset