Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study

06/16/2021
by   Badr M. Abdullah, et al.
0

Several variants of deep neural networks have been successfully employed for building parametric models that project variable-duration spoken word segments onto fixed-size vector representations, or acoustic word embeddings (AWEs). However, it remains unclear to what degree we can rely on the distance in the emerging AWE space as an estimate of word-form similarity. In this paper, we ask: does the distance in the acoustic embedding space correlate with phonological dissimilarity? To answer this question, we empirically investigate the performance of supervised approaches for AWEs with different neural architectures and learning objectives. We train AWE models in controlled settings for two languages (German and Czech) and evaluate the embeddings on two tasks: word discrimination and phonological similarity. Our experiments show that (1) the distance in the embedding space in the best cases only moderately correlates with phonological distance, and (2) improving the performance on the word discrimination task does not necessarily yield models that better reflect word phonological similarity. Our findings highlight the necessity to rethink the current intrinsic evaluations for AWEs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2015

Deep convolutional acoustic word embeddings using word-pair side information

Recent studies have been revisiting whole words as the basic modelling u...
research
09/21/2021

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

How do neural networks "perceive" speech sounds from unknown languages? ...
research
01/08/2023

Analyzing the Representational Geometry of Acoustic Word Embeddings

Acoustic word embeddings (AWEs) are vector representations such that dif...
research
09/14/2022

Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

Models of acoustic word embeddings (AWEs) learn to map variable-length s...
research
09/30/2021

Phonetic Word Embeddings

This work presents a novel methodology for calculating the phonetic simi...
research
12/01/2020

Intrinsic analysis for dual word embedding space models

Recent word embeddings techniques represent words in a continuous vector...
research
07/26/2023

The flow of ideas in word embeddings

The flow of ideas has been extensively studied by physicists, psychologi...

Please sign up or login with your details

Forgot password? Click here to reset