Ontological Multidimensional Data Models and Contextual Data Qality

04/01/2017
by   Leopoldo Bertossi, et al.
0

Data quality assessment and data cleaning are context-dependent activities. Motivated by this observation, we propose the Ontological Multidimensional Data Model (OMD model), which can be used to model and represent contexts as logic-based ontologies. The data under assessment is mapped into the context, for additional analysis, processing, and quality data extraction. The resulting contexts allow for the representation of dimensions, and multidimensional data quality assessment becomes possible. At the core of a multidimensional context we include a generalized multidimensional data model and a Datalog+/- ontology with provably good properties in terms of query answering. These main components are used to represent dimension hierarchies, dimensional constraints, dimensional rules, and define predicates for quality data specification. Query answering relies upon and triggers navigation through dimension hierarchies, and becomes the basic tool for the extraction of quality data. The OMD model is interesting per se, beyond applications to data quality. It allows for a logic-based, and computationally tractable representation of multidimensional data, extending previous multidimensional data models with additional expressive power and functionalities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2016

NdFluents: A Multi-dimensional Contexts Ontology

Annotating semantic data with metadata is becoming more and more importa...
research
09/03/2019

Online Analytical Processsing on Graph Data

Online Analytical Processing (OLAP) comprises tools and algorithms that ...
research
10/26/2019

Contextualization of Big Data Quality: A framework for comparison

With the advent of big data applications and the increasing amount of da...
research
10/04/2021

Internal Data Imputation in Data Warehouse Dimensions

Missing values occur commonly in the multidimensional data warehouses. T...
research
01/30/2022

ClassSPLOM – A Scatterplot Matrix to Visualize Separation of Multiclass Multidimensional Data

In multiclass classification of multidimensional data, the user wants to...
research
12/03/2020

A Novel index-based multidimensional data organization model that enhances the predictability of the machine learning algorithms

Learning from the multidimensional data has been an interesting concept ...

Please sign up or login with your details

Forgot password? Click here to reset