Word Alignment in the Era of Deep Learning: A Tutorial

11/30/2022
by   Bryan Li, et al.
0

The word alignment task, despite its prominence in the era of statistical machine translation (SMT), is niche and under-explored today. In this two-part tutorial, we argue for the continued relevance for word alignment. The first part provides a historical background to word alignment as a core component of the traditional SMT pipeline. We zero-in on GIZA++, an unsupervised, statistical word aligner with surprising longevity. Jumping forward to the era of neural machine translation (NMT), we show how insights from word alignment inspired the attention mechanism fundamental to present-day NMT. The second part shifts to a survey approach. We cover neural word aligners, showing the slow but steady progress towards surpassing GIZA++ performance. Finally, we cover the present-day applications of word alignment, from cross-lingual annotation projection, to improving translation.

READ FULL TEXT
research
07/25/2017

Machine Translation at Booking.com: Journey and Lessons Learned

We describe our recently developed neural machine translation (NMT) syst...
research
09/03/2021

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

Differently from the traditional statistical MT that decomposes the tran...
research
04/30/2020

End-to-End Neural Word Alignment Outperforms GIZA++

Word alignment was once a core unsupervised learning task in natural lan...
research
03/31/2021

Leveraging Neural Machine Translation for Word Alignment

The most common tools for word-alignment rely on a large amount of paral...
research
12/12/2018

SMT vs NMT: A Comparison over Hindi & Bengali Simple Sentences

In the present article, we identified the qualitative differences betwee...
research
01/14/2019

Unsupervised Neural Machine Translation with SMT as Posterior Regularization

Without real bilingual corpus available, unsupervised Neural Machine Tra...
research
03/16/2022

Graph Neural Networks for Multiparallel Word Alignment

After a period of decrease, interest in word alignments is increasing ag...

Please sign up or login with your details

Forgot password? Click here to reset