Indicatements that character language models learn English morpho-syntactic units and regularities

08/31/2018
by   Yova Kementchedjhieva, et al.
0

Character language models have access to surface morphological patterns, but it is not clear whether or how they learn abstract morphological regularities. We instrument a character language model with several probes, finding that it can develop a specific unit to identify word boundaries and, by extension, morpheme boundaries, which allows it to capture linguistic properties and regularities of these units. Our language model proves surprisingly good at identifying the selectional restrictions of English derivational morphemes, a task that requires both morphological and syntactic awareness. Thus we conclude that, when morphemes overlap extensively with the words of a language, a character language model can perform morphological abstraction.

READ FULL TEXT
research
04/10/2017

Character-Word LSTM Language Models

We present a Character-Word Long Short-Term Memory Language Model which ...
research
06/17/2019

Tabula nearly rasa: Probing the Linguistic Knowledge of Character-Level Neural Language Models Trained on Unsegmented Text

Recurrent neural networks (RNNs) have reached striking performance in ma...
research
09/20/2023

The Scenario Refiner: Grounding subjects in images at the morphological level

Derivationally related words, such as "runner" and "running", exhibit se...
research
04/26/2017

From Characters to Words to in Between: Do We Capture Morphology?

Words can be represented by composing the representations of subword uni...
research
09/17/2023

A novel approach to measuring patent claim scope based on probabilities obtained from (large) language models

This work proposes to measure the scope of a patent claim as the recipro...
research
04/12/2022

Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change

Morphological and syntactic changes in word usage (as captured, e.g., by...
research
11/05/2018

Do RNNs learn human-like abstract word order preferences?

RNN language models have achieved state-of-the-art results on various ta...

Please sign up or login with your details

Forgot password? Click here to reset