An Interactive Machine Translation Framework for Modernizing Historical Documents

10/08/2019
by   Miguel Domingo, et al.
0

Due to the nature of human language, historical documents are hard to comprehend by contemporary people. This limits their accessibility to scholars specialized in the time period in which the documents were written. Modernization aims at breaking this language barrier by generating a new version of a historical document, written in the modern version of the document's original language. However, while it is able to increase the document's comprehension, modernization is still far from producing an error-free version. In this work, we propose a collaborative framework in which a scholar can work together with the machine to generate the new version. We tested our approach on a simulated environment, achieving significant reductions of the human effort needed to produce the modernized version of the document.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2019

Modernizing Historical Documents: a User Study

Accessibility to historical documents is mostly limited to scholars. Thi...
research
02/02/2021

Two Demonstrations of the Machine Translation Applications to Historical Documents

We present our demonstration of two machine translation applications to ...
research
05/20/2022

Translating Hanja historical documents to understandable Korean and English

The Annals of Joseon Dynasty (AJD) contain the daily records of the King...
research
05/03/2023

DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents

Language identification describes the task of recognizing the language o...
research
05/24/2019

Compiler Design for Legal Document Translation in Digital Government

One of the main purposes of a computer is automation. In fact, automatio...
research
06/26/2023

Transfer Learning across Several Centuries: Machine and Historian Integrated Method to Decipher Royal Secretary's Diary

A named entity recognition and classification plays the first and foremo...
research
03/06/2010

Local Space-Time Smoothing for Version Controlled Documents

Unlike static documents, version controlled documents are continuously e...

Please sign up or login with your details

Forgot password? Click here to reset