TUDataset: A collection of benchmark datasets for learning with graphs

07/16/2020
by   Christopher Morris, et al.
25

Recently, there has been an increasing interest in (supervised) learning with graph data, especially using graph neural networks. However, the development of meaningful benchmark datasets and standardized evaluation procedures is lagging, consequently hindering advancements in this area. To address this, we introduce the TUDataset for graph classification and regression. The collection consists of over 120 datasets of varying sizes from a wide range of applications. We provide Python-based data loaders, kernel and graph neural network baseline implementations, and evaluation tools. Here, we give an overview of the datasets, standardized evaluation procedures, and provide baseline experiments. All datasets are available at www.graphlearning.io. The experiments are fully reproducible from the code available at www.github.com/chrsmrrs/tudataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2018

A simple yet effective baseline for non-attribute graph classification

Graphs are complex objects that do not lend themselves easily to typical...
research
06/22/2020

Graph Neural Networks in TensorFlow and Keras with Spektral

In this paper we present Spektral, an open-source Python library for bui...
research
07/01/2021

Shared Data and Algorithms for Deep Learning in Fundamental Physics

We introduce a collection of datasets from fundamental physics research ...
research
03/15/2022

PDNS-Net: A Large Heterogeneous Graph Benchmark Dataset of Network Resolutions for Graph Learning

In order to advance the state of the art in graph learning algorithms, i...
research
01/09/2023

PatentsView-Evaluation: Evaluation Datasets and Tools to Advance Research on Inventor Name Disambiguation

We present PatentsView-Evaluation, a Python package that enables researc...
research
06/09/2023

NeuroGraph: Benchmarks for Graph Machine Learning in Brain Connectomics

Machine learning provides a valuable tool for analyzing high-dimensional...
research
01/17/2023

Simplistic Collection and Labeling Practices Limit the Utility of Benchmark Datasets for Twitter Bot Detection

Accurate bot detection is necessary for the safety and integrity of onli...

Please sign up or login with your details

Forgot password? Click here to reset