Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations

01/18/2021
by   Isaac J. Sledge, et al.
21

Deep-predictive-coding networks (DPCNs) are hierarchical, generative models that rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial element of DPCNs is a forward-backward inference procedure to uncover sparse states of a dynamic model, which are used for invariant feature extraction. However, this inference and the corresponding backwards network parameter updating are major computational bottlenecks. They severely limit the network depths that can be reasonably implemented and easily trained. We therefore propose an optimization strategy, with better empirical and theoretical convergence, based on accelerated proximal gradients. We demonstrate that the ability to construct deeper DPCNs leads to receptive fields that capture well the entire notions of objects on which the networks are trained. This improves the feature representations. It yields completely unsupervised classifiers that surpass convolutional and convolutional-recurrent autoencoders and are on par with convolutional networks trained in a supervised manner. This is despite the DPCNs having orders of magnitude fewer parameters.

READ FULL TEXT

page 6

page 7

page 8

page 9

research
01/16/2013

Deep Predictive Coding Networks

The quality of data representation in deep learning methods is directly ...
research
02/20/2019

Meaningful representations emerge from Sparse Deep Predictive Coding

Convolutional Neural Networks (CNNs) are the state-of-the-art algorithms...
research
07/31/2020

Feature Learning for Accelerometer based Gait Recognition

Recent advances in pattern matching, such as speech or object recognitio...
research
09/14/2019

Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations

Learning representations that accurately capture long-range dependencies...
research
12/01/2021

GANORCON: Are Generative Models Useful for Few-shot Segmentation?

Advances in generative modeling based on GANs has motivated the communit...
research
12/03/2020

Classification and reconstruction of optical quantum states with deep neural networks

We apply deep-neural-network-based techniques to quantum state classific...

Please sign up or login with your details

Forgot password? Click here to reset