This paper considers continual learning of large-scale pretrained neural...
The many-to-many multilingual neural machine translation can translate
b...
Multilingual neural machine translation with a single model has drawn mu...
Although teacher forcing has become the main training paradigm for neura...
Domain Adaptation is widely used in practical applications of neural mac...
Neural machine translation (NMT) models usually suffer from catastrophic...
There exists a token imbalance phenomenon in natural language as differe...
Neural machine translation models usually adopt the teacher forcing stra...
Context modeling is essential to generate coherent and consistent transl...
Multi-head attention advances neural machine translation by working out
...
In domain adaptation for neural machine translation, translation perform...