On the Empirical Neural Tangent Kernel of Standard Finite-Width Convolutional Neural Network Architectures

06/24/2020
by   Maxim Samarin, et al.
0

The Neural Tangent Kernel (NTK) is an important milestone in the ongoing effort to build a theory for deep learning. Its prediction that sufficiently wide neural networks behave as kernel methods, or equivalently as random feature models, has been confirmed empirically for certain wide architectures. It remains an open question how well NTK theory models standard neural network architectures of widths common in practice, trained on complex datasets such as ImageNet. We study this question empirically for two well-known convolutional neural network architectures, namely AlexNet and LeNet, and find that their behavior deviates significantly from their finite-width NTK counterparts. For wider versions of these networks, where the number of channels and widths of fully-connected layers are increased, the deviation decreases.

READ FULL TEXT
research
07/21/2023

Local Kernel Renormalization as a mechanism for feature learning in overparametrized Convolutional Neural Networks

Feature learning, or the ability of deep neural networks to automaticall...
research
12/08/2020

Analyzing Finite Neural Networks: Can We Trust Neural Tangent Kernel Theory?

Neural Tangent Kernel (NTK) theory is widely used to study the dynamics ...
research
11/05/2020

Neural networks for classification of strokes in electrical impedance tomography on a 3D head model

We consider the problem of the detection of brain hemorrhages from three...
research
03/19/2019

Kernel-based Translations of Convolutional Networks

Convolutional Neural Networks, as most artificial neural networks, are c...
research
06/06/2023

Deep neural networks architectures from the perspective of manifold learning

Despite significant advances in the field of deep learning in ap-plicati...
research
04/26/2019

On Exact Computation with an Infinitely Wide Neural Net

How well does a classic deep net architecture like AlexNet or VGG19 clas...
research
05/18/2017

Building effective deep neural network architectures one feature at a time

Successful training of convolutional neural networks is often associated...

Please sign up or login with your details

Forgot password? Click here to reset