Domain-independent Extraction of Scientific Concepts from Research Articles

01/09/2020
by   Arthur Brack, et al.
0

We examine the novel task of domain-independent scientific concept extraction from abstracts of scholarly articles and present two contributions. First, we suggest a set of generic scientific concepts that have been identified in a systematic annotation process. This set of concepts is utilised to annotate a corpus of scientific abstracts from 10 domains of Science, Technology and Medicine at the phrasal level in a joint effort with domain experts. The resulting dataset is used in a set of benchmark experiments to (a) provide baseline performance for this task, (b) examine the transferability of concepts between domains. Second, we present two deep learning systems as baselines. In particular, we propose active learning to deal with different domains in our task. The experimental results show that (1) a substantial agreement is achievable by non-experts after consultation with domain experts, (2) the baseline system achieves a fairly high F1 score, (3) active learning enables us to nearly halve the amount of required training data.

READ FULL TEXT

page 8

page 10

research
10/06/2017

Unsupervised Extraction of Representative Concepts from Scientific Literature

This paper studies the automated categorization and extraction of scient...
research
04/01/2022

Efficient Argument Structure Extraction with Transfer Learning and Active Learning

The automation of extracting argument structures faces a pair of challen...
research
04/08/2022

CrudeOilNews: An Annotated Crude Oil News Corpus for Event Extraction

In this paper, we present CrudeOilNews, a corpus of English Crude Oil ne...
research
06/04/2020

The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain

This paper presents a new challenging information extraction task in the...
research
01/29/2018

Improving Active Learning in Systematic Reviews

Systematic reviews are essential to summarizing the results of different...
research
01/04/2021

Coreference Resolution in Research Papers from Multiple Domains

Coreference resolution is essential for automatic text understanding to ...

Please sign up or login with your details

Forgot password? Click here to reset