Improve Long-term Memory Learning Through Rescaling the Error Temporally

07/21/2023
by   Shida Wang, et al.
0

This paper studies the error metric selection for long-term memory learning in sequence modelling. We examine the bias towards short-term memory in commonly used errors, including mean absolute/squared error. Our findings show that all temporally positive-weighted errors are biased towards short-term memory in learning linear functionals. To reduce this bias and improve long-term memory learning, we propose the use of a temporally rescaled error. In addition to reducing the bias towards short-term memory, this approach can also alleviate the vanishing gradient issue. We conduct numerical experiments on different long-memory tasks and sequence models to validate our claims. Numerical results confirm the importance of appropriate temporally rescaled error for effective long-term memory learning. To the best of our knowledge, this is the first work that quantitatively analyzes different errors' memory bias towards short-term memory in sequence modelling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2018

Automating Network Error Detection using Long-Short Term Memory Networks

In this work, we investigate the current flaws with identifying network-...
research
12/22/2017

Learning Based on CC1 and CC4 Neural Networks

We propose that a general learning system should have three kinds of age...
research
04/22/2010

Price Trackers Inspired by Immune Memory

In this paper we outline initial concepts for an immune inspired algorit...
research
07/27/2022

Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes

In many sequential tasks, a model needs to remember relevant events from...
research
02/04/2014

Short-term plasticity as cause-effect hypothesis testing in distal reward learning

Asynchrony, overlaps and delays in sensory-motor signals introduce ambig...
research
11/26/2018

Augmenting Robot Knowledge Consultants with Distributed Short Term Memory

Human-robot communication in situated environments involves a complex in...
research
03/14/2015

Dynamic Move Tables and Long Branches with Backtracking in Computer Chess

The idea of dynamic move chains has been described in a preceding paper ...

Please sign up or login with your details

Forgot password? Click here to reset