Same but Different: Distant Supervision for Predicting and Understanding Entity Linking Difficulty

12/13/2018
by   Renato Stoffalette João, et al.
0

Entity Linking (EL) is the task of automatically identifying entity mentions in a piece of text and resolving them to a corresponding entity in a reference knowledge base like Wikipedia. There is a large number of EL tools available for different types of documents and domains, yet EL remains a challenging task where the lack of precision on particularly ambiguous mentions often spoils the usefulness of automated disambiguation results in real applications. A priori approximations of the difficulty to link a particular entity mention can facilitate flagging of critical cases as part of semi-automated EL systems, while detecting latent factors that affect the EL performance, like corpus-specific features, can provide insights on how to improve a system based on the special characteristics of the underlying corpus. In this paper, we first introduce a consensus-based method to generate difficulty labels for entity mentions on arbitrary corpora. The difficulty labels are then exploited as training data for a supervised classification task able to predict the EL difficulty of entity mentions using a variety of features. Experiments over a corpus of news articles show that EL difficulty can be estimated with high accuracy, revealing also latent features that affect EL performance. Finally, evaluation results demonstrate the effectiveness of the proposed method to inform semi-automated EL pipelines.

READ FULL TEXT
research
01/14/2021

Better Together – An Ensemble Learner for Combining the Results of Ready-made Entity Linking Systems

Entity linking (EL) is the task of automatically identifying entity ment...
research
05/25/2023

Learn to Not Link: Exploring NIL Prediction in Entity Linking

Entity linking models have achieved significant success via utilizing pr...
research
06/25/2017

Automatic Synonym Discovery with Knowledge Bases

Recognizing entity synonyms from text has become a crucial task in many ...
research
05/09/2022

BLINK with Elasticsearch for Efficient Entity Linking in Business Conversations

An Entity Linking system aligns the textual mentions of entities in a te...
research
07/12/2022

Effective Few-Shot Named Entity Linking by Meta-Learning

Entity linking aims to link ambiguous mentions to their corresponding en...
research
08/07/2017

Corpus-level Fine-grained Entity Typing

This paper addresses the problem of corpus-level entity typing, i.e., in...
research
08/08/2022

Learning Entity Linking Features for Emerging Entities

Entity linking (EL) is the process of linking entity mentions appearing ...

Please sign up or login with your details

Forgot password? Click here to reset