Chinese Word Sense Embedding with SememeWSD and Synonym Set

06/29/2022
by   Yangxi Zhou, et al.
0

Word embedding is a fundamental natural language processing task which can learn feature of words. However, most word embedding methods assign only one vector to a word, even if polysemous words have multi-senses. To address this limitation, we propose SememeWSD Synonym (SWSDS) model to assign a different vector to every sense of polysemous words with the help of word sense disambiguation (WSD) and synonym set in OpenHowNet. We use the SememeWSD model, an unsupervised word sense disambiguation model based on OpenHowNet, to do word sense disambiguation and annotate the polysemous word with sense id. Then, we obtain top 10 synonyms of the word sense from OpenHowNet and calculate the average vector of synonyms as the vector of the word sense. In experiments, We evaluate the SWSDS model on semantic similarity calculation with Gensim's wmdistance method. It achieves improvement of accuracy. We also examine the SememeWSD model on different BERT models to find the more effective model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2020

Attention Word Embedding

Word embedding models learn semantically rich vector representations of ...
research
03/29/2022

An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese Codex

Word embedding is a modern distributed word representations approach wid...
research
04/02/2019

Using Multi-Sense Vector Embeddings for Reverse Dictionaries

Popular word embedding methods such as word2vec and GloVe assign a singl...
research
11/11/2019

Word Sense Disambiguation using Knowledge-based Word Similarity

In natural language processing, word-sense disambiguation (WSD) is an op...
research
07/29/2016

A Novel Bilingual Word Embedding Method for Lexical Translation Using Bilingual Sense Clique

Most of the existing methods for bilingual word embedding only consider ...
research
02/22/2017

One Representation per Word - Does it make Sense for Composition?

In this paper, we investigate whether an a priori disambiguation of word...
research
06/09/2023

Word sense extension

Humans often make creative use of words to express novel senses. A long-...

Please sign up or login with your details

Forgot password? Click here to reset