O-Dang! The Ontology of Dangerous Speech Messages

07/13/2022
by   Marco A. Stranisci, et al.
0

Inside the NLP community there is a considerable amount of language resources created, annotated and released every day with the aim of studying specific linguistic phenomena. Despite a variety of attempts in order to organize such resources has been carried on, a lack of systematic methods and of possible interoperability between resources are still present. Furthermore, when storing linguistic information, still nowadays, the most common practice is the concept of "gold standard", which is in contrast with recent trends in NLP that aim at stressing the importance of different subjectivities and points of view when training machine learning and deep learning methods. In this paper we present O-Dang!: The Ontology of Dangerous Speech Messages, a systematic and interoperable Knowledge Graph (KG) for the collection of linguistic annotated data. O-Dang! is designed to gather and organize Italian datasets into a structured KG, according to the principles shared within the Linguistic Linked Open Data community. The ontology has also been designed to account for a perspectivist approach, since it provides a model for encoding both gold standard and single-annotator labels in the KG. The paper is structured as follows. In Section 1 the motivations of our work are outlined. Section 2 describes the O-Dang! Ontology, that provides a common semantic model for the integration of datasets in the KG. The Ontology Population stage with information about corpora, users, and annotations is presented in Section 3. Finally, in Section 4 an analysis of offensiveness across corpora is provided as a first case study for the resource.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

Ontology Matching Techniques: A Gold Standard Model

Typically an ontology matching technique is a combination of much differ...
research
03/17/2022

Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages

Recent progress in NLP is driven by pretrained models leveraging massive...
research
04/19/2023

A Survey of Corpora for Germanic Low-Resource Languages and Dialects

Despite much progress in recent years, the vast majority of work in natu...
research
03/17/2023

OntoMath^𝐏𝐑𝐎 2.0 Ontology: Updates of the Formal Model

This paper is devoted to the problems of ontology-based mathematical kno...
research
07/15/2023

Implementation of a Service-Oriented Architecture for a e-WALLET System for Cashless Transactions in the Democratic Republic of Congo

The Democratic Republic of Congo is a sleeping giant at the heart of Afr...
research
01/09/2022

Indian Language Wordnets and their Linkages with Princeton WordNet

Wordnets are rich lexico-semantic resources. Linked wordnets are extensi...
research
07/12/2021

EduCOR: An Educational and Career-Oriented Recommendation Ontology

With the increased dependence on online learning platforms and education...

Please sign up or login with your details

Forgot password? Click here to reset