Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task

07/15/2019
by   Alexandre Berard, et al.
0

This paper describes the systems that we submitted to the WMT19 Machine Translation robustness task. This task aims to improve MT's robustness to noise found on social media, like informal language, spelling mistakes and other orthographic variations. The organizers provide parallel data extracted from a social media website in two language pairs: French-English and Japanese-English (in both translation directions). The goal is to obtain the best scores on unseen test sets from the same source, according to automatic metrics (BLEU) and human evaluation. We proposed one single and one ensemble system for each translation direction. Our ensemble models ranked first in all language pairs, according to BLEU evaluation. We discuss the pre-processing choices that we made, and present our solutions for robustness to noise and domain adaptation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2019

UFRGS Participation on the WMT Biomedical Translation Shared Task

This paper describes the machine translation systems developed by the Un...
research
06/19/2019

Robust Machine Translation with Domain Sensitive Pseudo-Sources: Baidu-OSU WMT19 MT Robustness Shared Task System Report

This paper describes the machine translation system developed jointly by...
research
06/27/2019

Findings of the First Shared Task on Machine Translation Robustness

We share the findings of the first shared task on improving robustness o...
research
10/31/2019

Machine Translation of Restaurant Reviews: New Corpus for Domain Adaptation and Robustness

We share a French-English parallel corpus of Foursquare restaurant revie...
research
02/25/2019

Improving Robustness of Machine Translation with Synthetic Noise

Modern Machine Translation (MT) systems perform consistently well on cle...
research
07/09/2019

NTT's Machine Translation Systems for WMT19 Robustness Task

This paper describes NTT's submission to the WMT19 robustness task. This...
research
02/19/2022

CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

To date, efforts in the code-switching literature have focused for the m...

Please sign up or login with your details

Forgot password? Click here to reset