Joint Keyphrase Chunking and Salience Ranking with BERT

04/28/2020
by   Si Sun, et al.
0

An effective keyphrase extraction system requires to produce self-contained high quality phrases that are also key to the document topic. This paper presents BERT-JointKPE, a multi-task BERT-based model for keyphrase extraction. JointKPE employs a chunking network to identify high-quality phrases and a ranking network to learn their salience in the document. The model is trained jointly on the chunking task and the ranking task, balancing the estimation of keyphrase quality and salience. Experiments on two benchmarks demonstrate JointKPE's robust effectiveness with different BERT variants. Our analyses show that JointKPE has advantages in predicting long keyphrases and extracting phrases that are not entities but also meaningful. The source code of this paper can be obtained from https://github.com/thunlp/BERT-KPE

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2022

Topic Aware Contextualized Embeddings for High Quality Phrase Extraction

Keyphrase extraction from a given document is the task of automatically ...
research
04/17/2020

Learning-to-Rank with BERT in TF-Ranking

This paper describes a machine learning algorithm for document (re)ranki...
research
04/25/2022

Groupwise Query Performance Prediction with BERT

While large-scale pre-trained language models like BERT have advanced th...
research
05/12/2021

UIUC_BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP Contributions

We propose a cascade of neural models that performs sentence classificat...
research
08/20/2020

PARADE: Passage Representation Aggregation for Document Reranking

We present PARADE, an end-to-end Transformer-based model that considers ...
research
05/31/2019

Improving Open Information Extraction via Iterative Rank-Aware Learning

Open information extraction (IE) is the task of extracting open-domain a...
research
11/07/2016

Keyphrase Annotation with Graph Co-Ranking

Keyphrase annotation is the task of identifying textual units that repre...

Please sign up or login with your details

Forgot password? Click here to reset