String Transduction with Target Language Models and Insertion Handling

09/19/2018
by   Garrett Nicolai, et al.
0

Many character-level tasks can be framed as sequence-to-sequence transduction, where the target is a word from a natural language. We show that leveraging target language models derived from unannotated target corpora, combined with a precise alignment of the training data, yields state-of-the art results on cognate projection, inflection generation, and phoneme-to-grapheme conversion.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset