A Three Step Training Approach with Data Augmentation for Morphological Inflection

09/14/2021
by   Gabor Szolnok, et al.
0

We present the BME submission for the SIGMORPHON 2021 Task 0 Part 1, Generalization Across Typologically Diverse Languages shared task. We use an LSTM encoder-decoder model with three step training that is first trained on all languages, then fine-tuned on each language families and finally finetuned on individual languages. We use a different type of data augmentation technique in the first two steps. Our system outperformed the only other submission. Although it remains worse than the Transformer baseline released by the organizers, our model is simpler and our data augmentation techniques are easily applicable to new languages. We perform ablation studies and show that the augmentation techniques and the three training steps often help but sometimes have a negative effect.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2018

Low Resource Multi-modal Data Augmentation for End-to-end ASR

We explore training attention-based encoder-decoder ASR for low-resource...
research
09/03/2018

Data Augmentation for Neural Online Chat Response Selection

Data augmentation seeks to manipulate the available data for training to...
research
10/25/2022

On Robust Incremental Learning over Many Multilingual Steps

Recent work in incremental learning has introduced diverse approaches to...
research
11/03/2022

Exploring the State-of-the-Art Language Modeling Methods and Data Augmentation Techniques for Multilingual Clause-Level Morphology

This paper describes the KUIS-AI NLP team's submission for the 1^st Shar...
research
04/13/2021

Can a Transformer Pass the Wug Test? Tuning Copying Bias in Neural Morphological Inflection Models

Deep learning sequence models have been successfully applied to the task...
research
01/28/2021

Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources

We propose a novel hybrid approach to lemmatization that enhances the se...
research
06/25/2022

ConcreteGraph: A Data Augmentation Method Leveraging the Properties of Concept Relatedness Estimation

The concept relatedness estimation (CRE) task is to determine whether tw...

Please sign up or login with your details

Forgot password? Click here to reset