Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation

04/21/2018
by   Zhong Zhou, et al.
0

We work on translation from rich-resource languages to low-resource languages. The main challenges we identify are the lack of low-resource language data, effective methods for cross-lingual transfer, and the variable-binding problem that is common in neural systems. We build a translation system that addresses these challenges using eight European language families as our test ground. Firstly, we add source and target family labels and study intra- family and inter-family influences for effective cross-lingual transfer. We achieve improvement of +8.4 BLEU score compared to single-family multi-source multi- target NMT baseline. We find that training two neighboring families closest to the low-resource language is often enough. Secondly, we construct an ablation study and find that reasonably good results can be achieved even with considerably less target data. Thirdly, we address the variable-binding problem by building an order-preserving named entity translation model. We obtain 60.6 translations are akin to human translations in a preliminary study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2019

Low-Resource Syntactic Transfer with Unsupervised Source Reordering

We describe a cross-lingual transfer method for dependency parsing that ...
research
11/02/2021

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Speech processing systems currently do not support the vast majority of ...
research
05/18/2021

Exploiting Adapters for Cross-lingual Low-resource Speech Recognition

Cross-lingual speech adaptation aims to solve the problem of leveraging ...
research
09/05/2018

BPE and CharCNNs for Translation of Morphology: A Cross-Lingual Comparison and Analysis

Neural Machine Translation (NMT) in low-resource settings and of morphol...
research
04/12/2021

Family of Origin and Family of Choice: Massively Parallel Lexiconized Iterative Pretraining for Severely Low Resource Machine Translation

We translate a closed text that is known in advance into a severely low ...
research
04/13/2019

End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning

End-to-end text-to-speech (TTS) has shown great success on large quantit...
research
11/27/2018

Cross-Lingual Approaches to Reference Resolution in Dialogue Systems

In the slot-filling paradigm, where a user can refer back to slots in th...

Please sign up or login with your details

Forgot password? Click here to reset