Improving Neural Named Entity Recognition with Gazetteers

03/06/2020
by   Chan Hee Song, et al.
0

The goal of this work is to improve the performance of a neural named entity recognition system by adding input features that indicate a word is part of a name included in a gazetteer. This article describes how to generate gazetteers from the Wikidata knowledge graph as well as how to integrate the information into a neural NER system. Experiments reveal that the approach yields performance gains in two distinct languages: a high-resource, word-based language, English and a high-resource, character-based language, Chinese. Experiments were also performed in a low-resource language, Russian on a newly annotated Russian NER corpus from Reddit tagged with four core types and twelve extended types. This article reports a baseline score. It is a longer version of a paper in the 33rd FLAIRS conference (Song et al. 2020).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2019

Back Attention Knowledge Transfer for Low-resource Named Entity Recognition

In recent years, great success has been achieved in the field of natural...
research
05/04/2020

Soft Gazetteers for Low-Resource Named Entity Recognition

Traditional named entity recognition models use gazetteers (lists of ent...
research
09/13/2018

On the Strength of Character Language Models for Multilingual Named Entity Recognition

Character-level patterns have been widely used as features in English Na...
research
09/15/2021

Low-Resource Named Entity Recognition Based on Multi-hop Dependency Trigger

This paper presents a simple and effective approach in low-resource name...
research
12/09/2022

AUC Maximization for Low-Resource Named Entity Recognition

Current work in named entity recognition (NER) uses either cross entropy...
research
07/16/2020

SLK-NER: Exploiting Second-order Lexicon Knowledge for Chinese NER

Although character-based models using lexicon have achieved promising re...
research
08/22/2018

Neural Named Entity Recognition from Subword Units

Named entity recognition (NER) is a vital task in language technology. E...

Please sign up or login with your details

Forgot password? Click here to reset