Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes

06/30/2016
by   Caglar Gulcehre, et al.
0

We extend neural Turing machine (NTM) model into a dynamic neural Turing machine (D-NTM) by introducing a trainable memory addressing scheme. This addressing scheme maintains for each memory cell two separate vectors, content and address vectors. This allows the D-NTM to learn a wide variety of location-based addressing strategies including both linear and nonlinear ones. We implement the D-NTM with both continuous, differentiable and discrete, non-differentiable read/write mechanisms. We investigate the mechanisms and effects of learning to read and write into a memory through experiments on Facebook bAbI tasks using both a feedforward and GRUcontroller. The D-NTM is evaluated on a set of Facebook bAbI tasks and shown to outperform NTM and LSTM baselines. We have done extensive analysis of our model and different variations of NTM on bAbI task. We also provide further experimental results on sequential pMNIST, Stanford Natural Language Inference, associative recall and copy tasks.

READ FULL TEXT
research
12/07/2016

Neural Turing Machines: Convergence of Copy Tasks

The architecture of neural Turing machines is differentiable end to end ...
research
10/27/2020

A short note on the decision tree based neural turing machine

Turing machine and decision tree have developed independently for a long...
research
02/28/2016

Lie Access Neural Turing Machine

Following the recent trend in explicit neural memory structures, we pres...
research
01/30/2017

Memory Augmented Neural Networks with Wormhole Connections

Recent empirical results on long-term dependency tasks have shown that n...
research
12/12/2016

Tracking the World State with Recurrent Entity Networks

We introduce a new model, the Recurrent Entity Network (EntNet). It is e...
research
10/12/2017

HyperENTM: Evolving Scalable Neural Turing Machines through HyperNEAT

Recent developments within memory-augmented neural networks have solved ...
research
06/13/2019

Multigrid Neural Memory

We introduce a novel architecture that integrates a large addressable me...

Please sign up or login with your details

Forgot password? Click here to reset