Rethinking the Value of Gazetteer in Chinese Named Entity Recognition

07/06/2022
by   Qianglong Chen, et al.
0

Gazetteer is widely used in Chinese named entity recognition (NER) to enhance span boundary detection and type classification. However, to further understand the generalizability and effectiveness of gazetteers, the NLP community still lacks a systematic analysis of the gazetteer-enhanced NER model. In this paper, we first re-examine the effectiveness several common practices of the gazetteer-enhanced NER models and carry out a series of detailed analysis to evaluate the relationship between the model performance and the gazetteer characteristics, which can guide us to build a more suitable gazetteer. The findings of this paper are as follows: (1) the gazetteer improves most of the situations that the traditional NER model datasets are difficult to learn. (2) the performance of model greatly benefits from the high-quality pre-trained lexeme embeddings. (3) a good gazetteer should cover more entities that can be matched in both the training set and testing set.

READ FULL TEXT
research
09/16/2021

MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition

Pre-trained language models lead Named Entity Recognition (NER) into a n...
research
05/01/2020

Partially-Typed NER Datasets Integration: Connecting Practice to Theory

While typical named entity recognition (NER) models require the training...
research
04/29/2022

What do we Really Know about State of the Art NER?

Named Entity Recognition (NER) is a well researched NLP task and is wide...
research
12/19/2022

Statistical Dataset Evaluation: Reliability, Difficulty, and Validity

Datasets serve as crucial training resources and model performance track...
research
07/25/2023

Embedding Models for Supervised Automatic Extraction and Classification of Named Entities in Scientific Acknowledgements

Acknowledgments in scientific papers may give an insight into aspects of...
research
11/04/2022

Unintended Memorization and Timing Attacks in Named Entity Recognition Models

Named entity recognition models (NER), are widely used for identifying n...
research
04/26/2022

Boundary Smoothing for Named Entity Recognition

Neural named entity recognition (NER) models may easily encounter the ov...

Please sign up or login with your details

Forgot password? Click here to reset