Adam Mickiewicz University at WMT 2022: NER-Assisted and Quality-Aware Neural Machine Translation

09/07/2022
by   Artur Nowakowski, et al.
0

This paper presents Adam Mickiewicz University's (AMU) submissions to the constrained track of the WMT 2022 General MT Task. We participated in the Ukrainian ↔ Czech translation directions. The systems are a weighted ensemble of four models based on the Transformer (big) architecture. The models use source factors to utilize the information about named entities present in the input. Each of the models in the ensemble was trained using only the data provided by the shared task organizers. A noisy back-translation technique was used to augment the training corpora. One of the models in the ensemble is a document-level model, trained on parallel and synthetic longer sequences. During the sentence-level decoding process, the ensemble generated the n-best list. The n-best list was merged with the n-best list generated by a single document-level model which translated multiple sentences at a time. Finally, existing quality estimation models and minimum Bayes risk decoding were used to rerank the n-best list so that the best hypothesis was chosen according to the COMET evaluation metric. According to the automatic evaluation results, our systems rank first in both translation directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2019

Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation

This paper describes the Microsoft Translator submissions to the WMT19 n...
research
05/02/2022

Quality-Aware Decoding for Neural Machine Translation

Despite the progress in machine translation quality estimation and evalu...
research
06/08/2023

On Search Strategies for Document-Level Neural Machine Translation

Compared to sentence-level systems, document-level neural machine transl...
research
09/12/2017

SYSTRAN Purely Neural MT Engines for WMT2017

This paper describes SYSTRAN's systems submitted to the WMT 2017 shared ...
research
08/31/2018

Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report

This paper describes multimodal machine translation systems developed jo...
research
12/01/2022

CUNI Systems for the WMT22 Czech-Ukrainian Translation Task

We present Charles University submissions to the WMT22 General Translati...
research
02/10/2022

Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET

Neural metrics have achieved impressive correlation with human judgement...

Please sign up or login with your details

Forgot password? Click here to reset