Hierarchical Classification of Research Fields in the "Web of Science" Using Deep Learning

02/01/2023
by   Susie Xi Rao, et al.
0

This paper presents a hierarchical classification system that automatically categorizes a scholarly publication using its abstract into a three-tier hierarchical label set (discipline, field, subfield) in a multi-class setting. This system enables a holistic categorization of research activities in the mentioned hierarchy in terms of knowledge production through articles and impact through citations, permitting those activities to fall into multiple categories. The classification system distinguishes 44 disciplines, 718 fields and 1,485 subfields among 160 million abstract snippets in Microsoft Academic Graph (version 2018-05-17). We used batch training in a modularized and distributed fashion to address and allow for interdisciplinary and interfield classifications in single-label and multi-label settings. In total, we have conducted 3,140 experiments in all considered models (Convolutional Neural Networks, Recurrent Neural Networks, Transformers). The classification accuracy is > 90 classifications, respectively. We examine the advantages of our classification by its ability to better align research texts and output with disciplines, to adequately classify them in an automated way, and to capture the degree of interdisciplinarity. The proposed system (a set of pre-trained models) can serve as a backbone to an interactive system for indexing scientific publications in the future.

READ FULL TEXT
research
04/02/2022

SciNoBo : A Hierarchical Multi-Label Classifier of Scientific Publications

Classifying scientific publications according to Field-of-Science (FoS) ...
research
10/15/2021

Improving overlay maps of science: combining overview and detail

Overlay maps of science are global base maps over which subsets of publi...
research
03/21/2022

Academic Resource Text Level Multi-label Classification based on Attention

Hierarchical multi-label academic text classification (HMTC) is to assig...
research
04/26/2021

Semantic Analysis for Automated Evaluation of the Potential Impact of Research Articles

Can the analysis of the semantics of words used in the text of a scienti...
research
10/10/2019

Multi-label Categorization of Accounts of Sexism using a Neural Framework

Sexism, an injustice that subjects women and girls to enormous suffering...
research
12/04/2021

Label Hierarchy Transition: Modeling Class Hierarchies to Enhance Deep Classifiers

Hierarchical classification aims to sort the object into a hierarchy of ...
research
07/08/2018

Automated labeling of bugs and tickets using attention-based mechanisms in recurrent neural networks

We explore solutions for automated labeling of content in bug trackers a...

Please sign up or login with your details

Forgot password? Click here to reset