Connecting Phrase based Statistical Machine Translation Adaptation

07/29/2016
by   Rui Wang, et al.
0

Although more additional corpora are now available for Statistical Machine Translation (SMT), only the ones which belong to the same or similar domains with the original corpus can indeed enhance SMT performance directly. Most of the existing adaptation methods focus on sentence selection. In comparison, phrase is a smaller and more fine grained unit for data selection, therefore we propose a straightforward and efficient connecting phrase based adaptation method, which is applied to both bilingual phrase pair and monolingual n-gram adaptation. The proposed method is evaluated on IWSLT/NIST data sets, and the results show that phrase based SMT performance are significantly improved (up to +1.6 in comparison with phrase based SMT baseline system and +0.9 in comparison with existing methods).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2018

Demo of Sanskrit-Hindi SMT System

The demo proposal presents a Phrase-based Sanskrit-Hindi (SaHiT) Statist...
research
09/04/2018

Unsupervised Statistical Machine Translation

While modern machine translation has relied on large parallel corpora, a...
research
10/24/2016

Reordering rules for English-Hindi SMT

Reordering is a preprocessing stage for Statistical Machine Translation ...
research
02/06/2017

A Hybrid Approach For Hindi-English Machine Translation

In this paper, an extended combined approach of phrase based statistical...
research
10/13/2016

Fast, Scalable Phrase-Based SMT Decoding

The utilization of statistical machine translation (SMT) has grown enorm...
research
07/18/2017

Story Generation from Sequence of Independent Short Descriptions

Existing Natural Language Generation (NLG) systems are weak AI systems a...
research
03/25/2017

Simplifying the Bible and Wikipedia Using Statistical Machine Translation

I started this work with the hope of generating a text synthesizer (like...

Please sign up or login with your details

Forgot password? Click here to reset