PyGraft: Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips

09/07/2023
by   Nicolas Hubert, et al.
0

Knowledge graphs (KGs) have emerged as a prominent data representation and management paradigm. Being usually underpinned by a schema (e.g. an ontology), KGs capture not only factual information but also contextual knowledge. In some tasks, a few KGs established themselves as standard benchmarks. However, recent works outline that relying on a limited collection of datasets is not sufficient to assess the generalization capability of an approach. In some data-sensitive fields such as education or medicine, access to public datasets is even more limited. To remedy the aforementioned issues, we release PyGraft, a Python-based tool that generates highly customized, domain-agnostic schemas and knowledge graphs. The synthesized schemas encompass various RDFS and OWL constructs, while the synthesized KGs emulate the characteristics and scale of real-world KGs. Logical consistency of the generated resources is ultimately ensured by running a description logic (DL) reasoner. By providing a way of generating both a schema and KG in a single pipeline, PyGraft's aim is to empower the generation of a more diverse array of KGs for benchmarking novel approaches in areas such as graph-based machine learning (ML), or more generally KG processing. In graph-based ML in particular, this should foster a more holistic evaluation of model performance and generalization capability, thereby going beyond the limited collection of available benchmarks. PyGraft is available at: https://github.com/nicolas-hbt/pygraft.

READ FULL TEXT
research
03/04/2020

Knowledge Graphs

In this paper we provide a comprehensive introduction to knowledge graph...
research
05/02/2020

Open Graph Benchmark: Datasets for Machine Learning on Graphs

We present the Open Graph Benchmark (OGB), a diverse set of challenging ...
research
03/25/2020

Property Graph Schema Optimization for Domain-Specific Knowledge Graphs

Enterprises are creating domain-specific knowledge graphs by curating an...
research
05/13/2022

Knowledge Graph Question Answering Datasets and Their Generalizability: Are They Enough for Future Research?

Existing approaches on Question Answering over Knowledge Graphs (KGQA) h...
research
04/18/2020

DGL-KE: Training Knowledge Graph Embeddings at Scale

Knowledge graphs have emerged as a key abstraction for organizing inform...
research
07/02/2021

On-Demand and Lightweight Knowledge Graph Generation – a Demonstration with DBpedia

Modern large-scale knowledge graphs, such as DBpedia, are datasets which...
research
06/09/2023

NeuroGraph: Benchmarks for Graph Machine Learning in Brain Connectomics

Machine learning provides a valuable tool for analyzing high-dimensional...

Please sign up or login with your details

Forgot password? Click here to reset