Towards a General-Purpose Linguistic Annotation Backend

12/13/2018
by   Graham Neubig, et al.
0

Language documentation is inherently a time-intensive process; transcription, glossing, and corpus management consume a significant portion of documentary linguists' work. Advances in natural language processing can help to accelerate this work, using the linguists' past decisions as training material, but questions remain about how to prioritize human involvement. In this extended abstract, we describe the beginnings of a new project that will attempt to ease this language documentation process through the use of natural language processing (NLP) technology. It is based on (1) methods to adapt NLP tools to new languages, based on recent advances in massively multilingual neural networks, and (2) backend APIs and interfaces that allow linguists to upload their data. We then describe our current progress on two fronts: automatic phoneme transcription, and glossing. Finally, we briefly describe our future directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

Natural Language Processing 4 All (NLP4All): A New Online Platform for Teaching and Learning NLP Concepts

Natural Language Processing offers new insights into language data acros...
research
02/01/2023

User Study for Improving Tools for Bible Translation

Technology has increasingly become an integral part of the Bible transla...
research
08/08/2023

CLASSLA-Stanza: The Next Step for Linguistic Processing of South Slavic Languages

We present CLASSLA-Stanza, a pipeline for automatic linguistic annotatio...
research
04/25/2023

Lessons Learned from a Citizen Science Project for Natural Language Processing

Many Natural Language Processing (NLP) systems use annotated corpora for...
research
06/09/2021

What Would a Teacher Do? Predicting Future Talk Moves

Recent advances in natural language processing (NLP) have the ability to...
research
08/11/2021

Ensuring the Inclusive Use of Natural Language Processing in the Global Response to COVID-19

Natural language processing (NLP) plays a significant role in tools for ...
research
03/04/2022

Deep Lexical Hypothesis: Identifying personality structure in natural language

Recent advances in natural language processing (NLP) have produced gener...

Please sign up or login with your details

Forgot password? Click here to reset