A Joint Model for Word Embedding and Word Morphology

06/08/2016
by   Kris Cao, et al.
0

This paper presents a joint model for performing unsupervised morphological analysis on words, and learning a character-level composition function from morphemes to word embeddings. Our model splits individual words into segments, and weights each segment according to its ability to predict context words. Our morphological analysis is comparable to dedicated morphological analyzers at the task of morpheme boundary recovery, and also performs better than word-based embedding models at the task of syntactic analogy answering. Finally, we show that incorporating morphology explicitly into character-level models help them produce embeddings for unseen words which correlate better with human judgments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2019

Chinese Embedding via Stroke and Glyph Information: A Dual-channel View

Recent studies have consistently given positive hints that morphology is...
research
01/04/2017

Joint Semantic Synthesis and Morphological Analysis of the Derived Word

Much like sentences are composed of words, words themselves are composed...
research
08/22/2018

A Characterwise Windowed Approach to Hebrew Morphological Segmentation

This paper presents a novel approach to the segmentation of orthographic...
research
03/11/2021

Evaluation of Morphological Embeddings for the Russian Language

A number of morphology-based word embedding models were introduced in re...
research
04/26/2017

From Characters to Words to in Between: Do We Capture Morphology?

Words can be represented by composing the representations of subword uni...
research
03/08/2015

An Unsupervised Method for Uncovering Morphological Chains

Most state-of-the-art systems today produce morphological analysis based...
research
11/12/2019

Morphological Segmentation Inside-Out

Morphological segmentation has traditionally been modeled with non-hiera...

Please sign up or login with your details

Forgot password? Click here to reset