On the Use of Interpretable Machine Learning for the Management of Data Quality

07/29/2020
by   Anna Karanika, et al.
0

Data quality is a significant issue for any application that requests for analytics to support decision making. It becomes very important when we focus on Internet of Things (IoT) where numerous devices can interact to exchange and process data. IoT devices are connected to Edge Computing (EC) nodes to report the collected data, thus, we have to secure data quality not only at the IoT but also at the edge of the network. In this paper, we focus on the specific problem and propose the use of interpretable machine learning to deliver the features that are important to be based for any data processing activity. Our aim is to secure data quality, at least, for those features that are detected as significant in the collected datasets. We have to notice that the selected features depict the highest correlation with the remaining in every dataset, thus, they can be adopted for dimensionality reduction. We focus on multiple methodologies for having interpretability in our learning models and adopt an ensemble scheme for the final decision. Our scheme is capable of timely retrieving the final result and efficiently select the appropriate features. We evaluate our model through extensive simulations and present numerical results. Our aim is to reveal its performance under various experimental scenarios that we create varying a set of parameters adopted in our mechanism.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2020

A Data Imputation Model based on an Ensemble Scheme

Edge Computing (EC) offers an infrastructure that acts as the mediator b...
research
08/12/2020

An Intelligent Edge-Centric Queries Allocation Scheme based on Ensemble Models

The combination of Internet of Things (IoT) and Edge Computing (EC) can ...
research
08/01/2020

Data Synopses Management based on a Deep Learning Model

Pervasive computing involves the placement of processing services close ...
research
07/24/2020

An Intelligent Scheme for Uncertainty Management of Data Synopses Management in Pervasive Computing Applications

Pervasive computing applications deal with the incorporation of intellig...
research
07/28/2020

An Ensemble Scheme for Proactive Data Allocation in Distributed Datasets

The advent of the Internet of Things (IoT) gives the opportunity to nume...
research
07/25/2020

Proactive Tasks Management based on a Deep Learning Model

Pervasive computing applications deal with intelligence surrounding user...
research
08/13/2020

Learnability and Robustness of Shallow Neural Networks Learned With a Performance-Driven BP and a Variant PSO For Edge Decision-Making

In many cases, the computing resources are limited without the benefit f...

Please sign up or login with your details

Forgot password? Click here to reset