TinyGenius: Intertwining Natural Language Processing with Microtask Crowdsourcing for Scholarly Knowledge Graph Creation

05/09/2022
by   Allard Oelen, et al.
0

As the number of published scholarly articles grows steadily each year, new methods are needed to organize scholarly knowledge so that it can be more efficiently discovered and used. Natural Language Processing (NLP) techniques are able to autonomously process scholarly articles at scale and to create machine readable representations of the article content. However, autonomous NLP methods are by far not sufficiently accurate to create a high-quality knowledge graph. Yet quality is crucial for the graph to be useful in practice. We present TinyGenius, a methodology to validate NLP-extracted scholarly knowledge statements using microtasks performed with crowdsourcing. The scholarly context in which the crowd workers operate has multiple challenges. The explainability of the employed NLP methods is crucial to provide context in order to support the decision process of crowd workers. We employed TinyGenius to populate a paper-centric knowledge graph, using five distinct NLP methods. In the end, the resulting knowledge graph serves as a digital library for scholarly articles.

READ FULL TEXT

page 1

page 3

research
10/09/2020

Sentence, Phrase, and Triple Annotations to Build a Knowledge Graph of Natural Language Processing Contributions – A Trial Dataset

Purpose: The aim of this work is to normalize the NLPCONTRIBUTIONS schem...
research
04/16/2018

CL Scholar: The ACL Anthology Knowledge Graph Miner

We present CL Scholar, the ACL Anthology knowledge graph miner to facili...
research
06/15/2022

KGEA: A Knowledge Graph Enhanced Article Quality Identification Dataset

With so many articles of varying quality being produced at every moment,...
research
08/31/2022

End-to-End Rationale Reconstruction

The logic behind design decisions, called design rationale, is very valu...
research
11/23/2021

Identifying Terms and Conditions Important to Consumers using Crowdsourcing

Terms and conditions (T Cs) are pervasive on the web and often contain...
research
05/24/2022

Learning Context-Aware Service Representation for Service Recommendation in Workflow Composition

As increasingly more software services have been published onto the Inte...
research
10/11/2020

The Knowledge Graph for Macroeconomic Analysis with Alternative Big Data

The current knowledge system of macroeconomics is built on interactions ...

Please sign up or login with your details

Forgot password? Click here to reset