PonderNet: Learning to Ponder

07/12/2021
by   Andrea Banino, et al.
0

In standard neural networks the amount of computation used grows with the size of the inputs, but not with the complexity of the problem being learnt. To overcome this limitation we introduce PonderNet, a new algorithm that learns to adapt the amount of computation based on the complexity of the problem at hand. PonderNet learns end-to-end the number of computational steps to achieve an effective compromise between training prediction accuracy, computational cost and generalization. On a complex synthetic problem, PonderNet dramatically improves performance over previous adaptive computation methods and additionally succeeds at extrapolation tests where traditional neural networks fail. Also, our method matched the current state of the art results on a real world question and answering dataset, but using less compute. Finally, PonderNet reached state of the art results on a complex task designed to test the reasoning capabilities of neural networks.1

READ FULL TEXT
research
05/06/2022

Scalable computation of prediction intervals for neural networks via matrix sketching

Accounting for the uncertainty in the predictions of modern neural netwo...
research
07/05/2021

Training Adaptive Computation for Open-Domain Question Answering with Computational Constraints

Adaptive Computation (AC) has been shown to be effective in improving th...
research
01/07/2016

Learning to Compose Neural Networks for Question Answering

We describe a question answering model that applies to both images and s...
research
11/10/2020

Don't Read Too Much into It: Adaptive Computation for Open-Domain Question Answering

Most approaches to Open-Domain Question Answering consist of a light-wei...
research
02/13/2017

Multitask Learning with Deep Neural Networks for Community Question Answering

In this paper, we developed a deep neural network (DNN) that learns to s...
research
12/15/2017

Reducing Deep Network Complexity with Fourier Transform Methods

We propose a novel way that uses shallow densely connected neuron networ...
research
03/18/2019

Lorenz Trajectories Prediction: Travel Through Time

In this article the Lorenz dynamical system is revived and revisited and...

Please sign up or login with your details

Forgot password? Click here to reset