Academic Resource Text Level Multi-label Classification based on Attention

by   Yue Wang, et al.

Hierarchical multi-label academic text classification (HMTC) is to assign academic texts into a hierarchically structured labeling system. We propose an attention-based hierarchical multi-label classification algorithm of academic texts (AHMCA) by integrating features such as text, keywords, and hierarchical structure, the academic documents are classified into the most relevant categories. We utilize word2vec and BiLSTM to obtain embedding and latent vector representations of text, keywords, and hierarchies. We use hierarchical attention mechanism to capture the associations between keywords, label hierarchies, and text word vectors to generate hierarchical-specific document embedding vectors to replace the original text embeddings in HMCN-F. The experimental results on the academic text dataset demonstrate the effectiveness of the AHMCA algorithm.


page 1

page 2

page 3

page 4


HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification

Extreme multi-label text classification (XMTC) addresses the problem of ...

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification

We propose a novel model for multi-label text classification, which is b...

Hierarchical Classification of Research Fields in the "Web of Science" Using Deep Learning

This paper presents a hierarchical classification system that automatica...

Hawk: An Industrial-strength Multi-label Document Classifier

There are a plethora of methods and algorithms that solve the classical ...

Decentralized creation of academic documents using a Network Attached Storage (NAS) server

Scholarly document creation continues to face various obstacles. Scholar...

Please sign up or login with your details

Forgot password? Click here to reset