Dataset Generation Patterns for Evaluating Knowledge Graph Construction

04/28/2021
by   Markus Schröder, et al.
0

Confidentiality hinders the publication of authentic, labeled datasets of personal and enterprise data, although they could be useful for evaluating knowledge graph construction approaches in industrial scenarios. Therefore, our plan is to synthetically generate such data in a way that it appears as authentic as possible. Based on our assumption that knowledge workers have certain habits when they produce or manage data, generation patterns could be discovered which can be utilized by data generators to imitate real datasets. In this paper, we initially derived 11 distinct patterns found in real spreadsheets from industry and demonstrate a suitable generator called Data Sprout that is able to reproduce them. We describe how the generator produces spreadsheets in general and what altering effects the implemented patterns have.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2020

Construction and Application of Teaching System Based on Crowdsourcing Knowledge Graph

Through the combination of crowdsourcing knowledge graph and teaching sy...
research
08/03/2021

Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph Completion

We present InferWiki, a Knowledge Graph Completion (KGC) dataset that im...
research
02/10/2023

A Comprehensive Survey on Automatic Knowledge Graph Construction

Automatic knowledge graph construction aims to manufacture structured hu...
research
04/15/2022

Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale

We introduce Saga, a next-generation knowledge construction and serving ...
research
05/10/2020

Knowledge Graph semantic enhancement of input data for improving AI

Intelligent systems designed using machine learning algorithms require a...
research
07/30/2019

IPRE: a Dataset for Inter-Personal Relationship Extraction

Inter-personal relationship is the basis of human society. In order to a...
research
06/25/2021

SeaNet – Towards A Knowledge Graph Based Autonomic Management of Software Defined Networks

Automatic network management driven by Artificial Intelligent technologi...

Please sign up or login with your details

Forgot password? Click here to reset