Long-Range Correlation Underlying Childhood Language and Generative Models

12/11/2017
by   Kumiko Tanaka-Ishii, et al.
0

Long-range correlation, a property of time series exhibiting long-term memory, is mainly studied in the statistical physics domain and has been reported to exist in natural language. Using a state-of-the-art method for such analysis, long-range correlation is first shown to occur in long CHILDES data sets. To understand why, Bayesian generative models of language, originally proposed in the cognitive scientific domain, are investigated. Among representative models, the Simon model was found to exhibit surprisingly good long-range correlation, but not the Pitman-Yor model. Since the Simon model is known not to correctly reflect the vocabulary growth of natural language, a simple new model is devised as a conjunct of the Simon and Pitman-Yor models, such that long-range correlation holds with a correct vocabulary growth rate. The investigation overall suggests that uniform sampling is one cause of long-range correlation and could thus have a relation with actual linguistic processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2019

Compressive Transformers for Long-Range Sequence Modelling

We present the Compressive Transformer, an attentive sequence model whic...
research
04/08/2019

A Statistical Investigation of Long Memory in Language and Music

Representation and learning of long-range dependencies is a central chal...
research
07/16/2017

Do Neural Nets Learn Statistical Laws behind Natural Language?

The performance of deep learning in natural language processing has been...
research
11/28/2021

Long-range and hierarchical language predictions in brains and algorithms

Deep learning has recently made remarkable progress in natural language ...
research
11/14/2019

Long-range Prediction of Vital Signs Using Generative Boosting via LSTM Networks

Vital signs including heart rate, respiratory rate, body temperature and...
research
04/03/2020

TimeGate: Conditional Gating of Segments in Long-range Activities

When recognizing a long-range activity, exploring the entire video is ex...
research
12/13/2018

Shortcut Matrix Product States and its applications

Matrix Product States (MPS), also known as Tensor Train (TT) decompositi...

Please sign up or login with your details

Forgot password? Click here to reset