FonMTL: Towards Multitask Learning for the Fon Language

08/28/2023
by   Bonaventure F. P. Dossou, et al.
0

The Fon language, spoken by an average 2 million of people, is a truly low-resourced African language, with a limited online presence, and existing datasets (just to name but a few). Multitask learning is a learning paradigm that aims to improve the generalization capacity of a model by sharing knowledge across different but related tasks: this could be prevalent in very data-scarce scenarios. In this paper, we present the first explorative approach to multitask learning, for model capabilities enhancement in Natural Language Processing for the Fon language. Specifically, we explore the tasks of Named Entity Recognition (NER) and Part of Speech Tagging (POS) for Fon. We leverage two language model heads as encoders to build shared representations for the inputs, and we use linear layers blocks for classification relative to each task. Our results on the NER and POS tasks for Fon, show competitive (or better) performances compared to several multilingual pretrained language models finetuned on single tasks. Additionally, we perform a few ablation studies to leverage the efficiency of two different loss combination strategies and find out that the equal loss weighting approach works best in our case. Our code is open-sourced at https://github.com/bonaventuredossou/multitask_fon.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition

The MultiCoNER 2 shared task aims to tackle multilingual named entity re...
research
04/08/2023

WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning Experiments for Slovak Named Entity Recognition

Named Entity Recognition (NER) is a fundamental NLP tasks with a wide ra...
research
04/05/2019

A Multi-task Learning Approach for Named Entity Recognition using Local Detection

Named entity recognition (NER) systems that perform well require task-re...
research
04/29/2022

Polyglot Prompt: Multilingual Multitask PrompTraining

This paper aims for a potential architectural breakthrough for multiling...
research
06/30/2023

DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries

Named entity recognition (NER) is a crucial task for online advertisemen...
research
12/10/2021

Pruning Pretrained Encoders with a Multitask Objective

The sizes of pretrained language models make them challenging and expens...
research
02/22/2021

Subword Pooling Makes a Difference

Contextual word-representations became a standard in modern natural lang...

Please sign up or login with your details

Forgot password? Click here to reset