How the fundamental concepts of mathematics and physics explain deep learning

11/01/2018
by   Jean Thierry-Mieg, et al.
0

Starting from the Fermat's principle of least action, which governs classical and quantum mechanics and from the theory of exterior differential forms, which governs the geometry of curved manifolds, we show how to derive the equations governing neural networks in an intrinsic, coordinate invariant way, where the differential dW of the parameters W appears as the cotangent pullback of the differential of the loss function L: dW = -eta f*(dL) where f denotes the action of the network, and eta the learning rate. To be covariant, these equations imply a layer metric which is instrumental in pretraining and explains the role of conjugation when using complex numbers. The differential formalism also clarifies the relation of the gradient descent optimizer with Aristotelian and Newtonian mechanics and why large learning steps break the logic of the linearization procedure. We hope that this formal presentation of the differential geometry of neural networks will encourage some physicists to dive into deep learning, and reciprocally, that the specialists of deep learning will better appreciate the close interconnection of their subject with the foundations of classical and quantum field theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2020

Generalized and graded geometry for mechanics: a comprehensive introduction

In this paper we make an overview of results relating the recent "discov...
research
09/09/2019

Differential equations as models of deep neural networks

In this work we systematically analyze general properties of differentia...
research
05/18/2021

Deep learning for solution and inversion of structural mechanics and vibrations

Deep learning has been the most popular machine learning method in the l...
research
11/27/2020

Deep Learning for Classical Mechanics

Deep learning has been widely and actively used in various research area...
research
07/26/2018

Discovering physical concepts with neural networks

The formalism of quantum physics is built upon that of classical mechani...
research
07/04/2019

Least Action Principles and Well-Posed Learning Problems

Machine Learning algorithms are typically regarded as appropriate optimi...
research
05/24/2019

Doctor of Crosswise: Reducing Over-parametrization in Neural Networks

Dr. of Crosswise proposes a new architecture to reduce over-parametrizat...

Please sign up or login with your details

Forgot password? Click here to reset