Understanding Convolutional Neural Networks with A Mathematical Model

09/14/2016
by   C. -C. Jay Kuo, et al.
0

This work attempts to address two fundamental questions about the structure of the convolutional neural networks (CNN): 1) why a non-linear activation function is essential at the filter output of every convolutional layer? 2) what is the advantage of the two-layer cascade system over the one-layer system? A mathematical model called the "REctified-COrrelations on a Sphere" (RECOS) is proposed to answer these two questions. After the CNN training process, the converged filter weights define a set of anchor vectors in the RECOS model. Anchor vectors represent the frequently occurring patterns (or the spectral components). The necessity of rectification is explained using the RECOS model. Then, the behavior of a two-layer RECOS system is analyzed and compared with its one-layer counterpart. The LeNet-5 and the MNIST dataset are used to illustrate discussion points. Finally, the RECOS model is generalized to a multi-layer system with the AlexNet as an example. Keywords: Convolutional Neural Network (CNN), Nonlinear Activation, RECOS Model, Rectified Linear Unit (ReLU), MNIST Dataset.

READ FULL TEXT

page 8

page 14

research
08/22/2016

Local Binary Convolutional Neural Networks

We propose local binary convolution (LBC), an efficient alternative to c...
research
02/01/2017

Design, Analysis and Application of A Volumetric Convolutional Neural Network

The design, analysis and application of a volumetric convolutional neura...
research
05/21/2018

How Many Samples are Needed to Learn a Convolutional Neural Network?

A widespread folklore for explaining the success of convolutional neural...
research
01/30/2017

CNN as Guided Multi-layer RECOS Transform

There is a resurging interest in developing a neural-network-based solut...
research
10/29/2020

Over-parametrized neural networks as under-determined linear systems

We draw connections between simple neural networks and under-determined ...
research
06/26/2017

Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription

This paper presents a new approach in understanding how deep neural netw...

Please sign up or login with your details

Forgot password? Click here to reset