Evaluating Models of Robust Word Recognition with Serial Reproduction

01/24/2021
by   Stephan C. Meylan, et al.
1

Spoken communication occurs in a "noisy channel" characterized by high levels of environmental noise, variability within and between speakers, and lexical and syntactic ambiguity. Given these properties of the received linguistic input, robust spoken word recognition – and language processing more generally – relies heavily on listeners' prior knowledge to evaluate whether candidate interpretations of that input are more or less likely. Here we compare several broad-coverage probabilistic generative language models in their ability to capture human linguistic expectations. Serial reproduction, an experimental paradigm where spoken utterances are reproduced by successive participants similar to the children's game of "Telephone," is used to elicit a sample that reflects the linguistic expectations of English-speaking adults. When we evaluate a suite of probabilistic generative language models against the yielded chains of utterances, we find that those models that make use of abstract representations of preceding linguistic context (i.e., phrase structure) best predict the changes made by people in the course of serial reproduction. A logistic regression model predicting which words in an utterance are most likely to be lost or changed in the course of spoken transmission corroborates this result. We interpret these findings in light of research highlighting the interaction of memory-based constraints and representations in language processing.

READ FULL TEXT

page 5

page 6

page 20

page 22

page 30

research
02/06/2021

Child-directed Listening: How Caregiver Inference Enables Children's Early Verbal Communication

How do adults understand children's speech? Children's productions over ...
research
06/15/2022

How Adults Understand What Young Children Say

Children's early speech often bears little resemblance to adult speech i...
research
05/01/2021

It's not what you said, it's how you said it: discriminative perception of speech as a multichannel communication system

People convey information extremely effectively through spoken interacti...
research
01/01/1997

SCREEN: Learning a Flat Syntactic and Semantic Spoken Language Analysis Using Artificial Neural Networks

Previous approaches of analyzing spontaneously spoken language often hav...
research
05/22/2023

Prompt-based methods may underestimate large language models' linguistic generalizations

Prompting is now a dominant method for evaluating the linguistic knowled...
research
06/10/2019

Hierarchical Representation in Neural Language Models: Suppression and Recovery of Expectations

Deep learning sequence models have led to a marked increase in performan...
research
06/16/2021

Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

This paper proposes methods for unsupervised lexical acquisition for rel...

Please sign up or login with your details

Forgot password? Click here to reset