Deconvolutional Latent-Variable Model for Text Sequence Matching

09/21/2017
by   Dinghan Shen, et al.
0

A latent-variable model is introduced for text matching, inferring sentence representations by jointly optimizing generative and discriminative objectives. To alleviate typical optimization challenges in latent-variable models for text, we employ deconvolutional networks as the sequence decoder (generator), providing learned latent codes with more semantic information and better generalization. Our model, trained in an unsupervised manner, yields stronger empirical predictive performance than a decoder based on Long Short-Term Memory (LSTM), with less parameters and considerably faster training. Further, we apply it to text sequence-matching problems. The proposed model significantly outperforms several strong sentence-encoding baselines, especially in the semi-supervised setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2019

A Cross-Sentence Latent Variable Model for Semi-Supervised Text Sequence Matching

We present a latent variable model for predicting the relationship betwe...
research
01/05/2023

Deep Latent Variable Models for Semi-supervised Paraphrase Generation

This paper explores deep latent variable models for semi-supervised para...
research
06/23/2019

Variational Sequential Labelers for Semi-Supervised Learning

We introduce a family of multitask variational methods for semi-supervis...
research
09/13/2016

An Experimental Study of LSTM Encoder-Decoder Model for Text Simplification

Text simplification (TS) aims to reduce the lexical and structural compl...
research
09/23/2016

Language as a Latent Variable: Discrete Generative Models for Sentence Compression

In this work we explore deep generative models of text in which the late...
research
02/07/2016

Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings

One-hot CNN (convolutional neural network) has been shown to be effectiv...
research
09/29/2020

Zero-Shot Clinical Acronym Expansion with a Hierarchical Metadata-Based Latent Variable Model

We introduce Latent Meaning Cells, a deep latent variable model which le...

Please sign up or login with your details

Forgot password? Click here to reset