Biomedical Entity Linking with Triple-aware Pre-Training

08/28/2023
by   Xi Yan, et al.
0

Linking biomedical entities is an essential aspect in biomedical natural language processing tasks, such as text mining and question answering. However, a difficulty of linking the biomedical entities using current large language models (LLM) trained on a general corpus is that biomedical entities are scarcely distributed in texts and therefore have been rarely seen during training by the LLM. At the same time, those LLMs are not aware of high level semantic connection between different biomedical entities, which are useful in identifying similar concepts in different textual contexts. To cope with aforementioned problems, some recent works focused on injecting knowledge graph information into LLMs. However, former methods either ignore the relational knowledge of the entities or lead to catastrophic forgetting. Therefore, we propose a novel framework to pre-train the powerful generative LLM by a corpus synthesized from a KG. In the evaluations we are unable to confirm the benefit of including synonym, description or relational information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2022

RuBioRoBERTa: a pre-trained biomedical language model for Russian language biomedical text mining

This paper presents several BERT-based models for Russian language biome...
research
07/03/2023

Exploring the In-context Learning Ability of Large Language Model for Biomedical Concept Linking

The biomedical field relies heavily on concept linking in various areas ...
research
10/22/2020

Self-alignment Pre-training for Biomedical Entity Representations

Despite the widespread success of self-supervised learning via masked la...
research
04/11/2022

Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning

Entities lie in the heart of biomedical natural language understanding, ...
research
05/24/2023

Injecting Knowledge into Biomedical Pre-trained Models via Polymorphism and Synonymous Substitution

Pre-trained language models (PLMs) were considered to be able to store r...
research
11/20/2022

Embracing Ambiguity: Improving Similarity-oriented Tasks with Contextual Synonym Knowledge

Contextual synonym knowledge is crucial for those similarity-oriented ta...
research
10/07/2020

COMETA: A Corpus for Medical Entity Linking in the Social Media

Whilst there has been growing progress in Entity Linking (EL) for genera...

Please sign up or login with your details

Forgot password? Click here to reset