Learning to Evaluate Translation Beyond English: BLEURT Submissions to the WMT Metrics 2020 Shared Task

10/08/2020
by   Thibault Sellam, et al.
0

The quality of machine translation systems has dramatically improved over the last decade, and as a result, evaluation has become an increasingly challenging problem. This paper describes our contribution to the WMT 2020 Metrics Shared Task, the main benchmark for automatic evaluation of translation. We make several submissions based on BLEURT, a previously published metric based on transfer learning. We extend the metric beyond English and evaluate it on 14 language pairs for which fine-tuning data is available, as well as 4 "zero-shot" language pairs, for which we have no labelled examples. Additionally, we focus on English to German and demonstrate how to combine BLEURT's predictions with those of YiSi and use alternative reference translations to enhance the performance. Empirical results show that the models achieve competitive results on the WMT Metrics 2019 Shared Task, indicating their promise for the 2020 edition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2019

Machine Translation Evaluation with BERT Regressor

We introduce the metric using BERT (Bidirectional Encoder Representation...
research
02/28/2023

Large Language Models Are State-of-the-Art Evaluators of Translation Quality

We describe GEMBA, a GPT-based metric for assessment of translation qual...
research
09/01/2018

LIUM-CVC Submissions for WMT18 Multimodal Translation Task

This paper describes the multimodal Neural Machine Translation systems d...
research
09/08/2021

Ensemble Fine-tuned mBERT for Translation Quality Estimation

Quality Estimation (QE) is an important component of the machine transla...
research
10/11/2020

Addressing Exposure Bias With Document Minimum Risk Training: Cambridge at the WMT20 Biomedical Translation Task

The 2020 WMT Biomedical translation task evaluated Medline abstract tran...
research
04/28/2022

RoBLEURT Submission for the WMT2021 Metrics Task

In this paper, we present our submission to Shared Metrics Task: RoBLEUR...
research
09/20/2021

CUNI systems for WMT21: Terminology translation Shared Task

This paper describes Charles University submission for Terminology trans...

Please sign up or login with your details

Forgot password? Click here to reset