Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models

10/29/2022
by   Sung-Lin Yeh, et al.
0

While discrete latent variable models have had great success in self-supervised learning, most models assume that frames are independent. Due to the segmental nature of phonemes in speech perception, modeling dependencies among latent variables at the frame level can potentially improve the learned representations on phonetic-related tasks. In this work, we assume Markovian dependencies among latent variables, and propose to learn speech representations with neural hidden Markov models. Our general framework allows us to compare to self-supervised models that assume independence, while keeping the number of parameters fixed. The added dependencies improve the accessibility of phonetic information, phonetic segmentation, and the cluster purity of phones, showcasing the benefit of the assumed dependencies.

READ FULL TEXT
research
06/11/2020

Discrete Latent Variable Representations for Low-Resource Text Classification

While much work on deep latent variable models of text uses continuous l...
research
10/07/2022

GOLLIC: Learning Global Context beyond Patches for Lossless High-Resolution Image Compression

Neural-network-based approaches recently emerged in the field of data co...
research
06/08/2021

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Self-supervised representation learning has shown remarkable success in ...
research
02/13/2020

Variational Conditional-Dependence Hidden Markov Models for Human Action Recognition

Hidden Markov Models (HMMs) are a powerful generative approach for model...
research
06/17/2022

Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE

The human perception system is often assumed to recruit motor knowledge ...
research
11/10/2015

Anchored Discrete Factor Analysis

We present a semi-supervised learning algorithm for learning discrete fa...
research
08/20/2020

A Value of Information Framework for Latent Variable Models

In this paper, a general value of information (VoI) framework is formali...

Please sign up or login with your details

Forgot password? Click here to reset