Dynamical Variational Autoencoders: A Comprehensive Review

08/28/2020
by   Laurent Girin, et al.
0

The Variational Autoencoder (VAE) is a powerful deep generative model that is now extensively used to represent high-dimensional complex data via a low-dimensional latent space that is learned in an unsupervised manner. In the original VAE model, input data vectors are processed independently. In the recent years, a series of papers have presented different extensions of the VAE to sequential data, that not only model the latent space, but also model the temporal dependencies within a sequence of data vectors and/or corresponding latent vectors, relying on recurrent neural networks or state space models. In this paper we perform an extensive literature review of these models. Importantly, we introduce and discuss a general class of models called Dynamical Variational Autoencoders (DVAEs) that encompass a large subset of these temporal VAE extensions. Then we present in details seven different instances of DVAE that were recently proposed in the literature, with an effort to homogenize the notations and presentation lines, as well as to relate those models with existing classical temporal models (that are also presented for the sake of completeness). We reimplemented those seven DVAE models and we present the results of an experimental benchmark that we conducted on the speech analysis-resynthesis task (the PyTorch code will be made publicly available). An extensive discussion is presented at the end of the paper, aiming to comment on important issues concerning the DVAE class of models and to describe future research guidelines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2021

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model th...
research
04/05/2023

Shape complexity estimation using VAE

In this paper, we compare methods for estimating the complexity of two-d...
research
03/07/2023

Speech Modeling with a Hierarchical Transformer Dynamical VAE

The dynamical variational autoencoders (DVAEs) are a family of latent-va...
research
11/25/2018

Sequential Variational Autoencoders for Collaborative Filtering

Variational autoencoders were proven successful in domains such as compu...
research
11/24/2020

AI Discovering a Coordinate System of Chemical Elements: Dual Representation by Variational Autoencoders

The periodic table is a fundamental representation of chemical elements ...
research
07/12/2022

Markovian Gaussian Process Variational Autoencoders

Deep generative models are widely used for modelling high-dimensional ti...
research
12/03/2021

Estimating the Value-at-Risk by Temporal VAE

Estimation of the value-at-risk (VaR) of a large portfolio of assets is ...

Please sign up or login with your details

Forgot password? Click here to reset