Learning to Compose Task-Specific Tree Structures

07/10/2017
by   Jihun Choi, et al.
0

For years, recursive neural networks (RvNNs) have been shown to be suitable for representing text into fixed-length vectors and achieved good performance on several natural language processing tasks. However, the main drawback of RvNNs is that they require structured input, which makes data preparation and model implementation hard. In this paper, we propose Gumbel Tree-LSTM, a novel tree-structured long short-term memory architecture that learns how to compose task-specific tree structures only from plain text data efficiently. Our model uses Straight-Through Gumbel-Softmax estimator to decide the parent node among candidates dynamically and to calculate gradients of the discrete decision. We evaluate the proposed model on natural language inference and sentiment analysis, and show that our model outperforms or is at least comparable to previous models. We also find that our model converges significantly faster than other models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2015

Long Short-Term Memory Over Tree Structures

The chain-structured long short-term memory (LSTM) has showed to be effe...
research
09/24/2018

Text Summarization as Tree Transduction by Top-Down TreeLSTM

Extractive compression is a challenging natural language processing prob...
research
02/08/2023

Ordered Memory Baselines

Natural language semantics can be modeled using the phrase-structured mo...
research
11/09/2017

The Lifted Matrix-Space Model for Semantic Composition

Recent advances in tree structured sentence encoding models have shown t...
research
02/13/2020

A Framework for End-to-End Learning on Semantic Tree-Structured Data

While learning models are typically studied for inputs in the form of a ...
research
11/28/2016

Learning to Compose Words into Sentences with Reinforcement Learning

We use reinforcement learning to learn tree-structured neural networks f...
research
02/05/2019

Deep Tree Transductions - A Short Survey

The paper surveys recent extensions of the Long-Short Term Memory networ...

Please sign up or login with your details

Forgot password? Click here to reset