Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation

03/09/2023
by   Simone Ciceri, et al.
0

To achieve near-zero training error in a classification problem, the layers of a deep network have to disentangle the manifolds of data points with different labels, to facilitate the discrimination. However, excessive class separation can bring to overfitting since good generalisation requires learning invariant features, which involve some level of entanglement. We report on numerical experiments showing how the optimisation dynamics finds representations that balance these opposing tendencies with a non-monotonic trend. After a fast segregation phase, a slower rearrangement (conserved across data sets and architectures) increases the class entanglement. The training error at the inversion is remarkably stable under subsampling, and across network initialisations and optimisers, which characterises it as a property solely of the data structure and (very weakly) of the architecture. The inversion is the manifestation of tradeoffs elicited by well-defined and maximally stable elements of the training set, coined "stragglers", particularly influential for generalisation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

Deep-Learning Inversion Method for the Interpretation of Noisy Logging-While-Drilling Resistivity Measurements

Deep Learning (DL) inversion is a promising method for real time interpr...
research
02/13/2018

Homological analysis of multi-qubit entanglement

We propose the usage of persistent homologies to characterize multiparti...
research
02/05/2019

Analyzing and Improving Representations with the Soft Nearest Neighbor Loss

We explore and expand the Soft Nearest Neighbor Loss to measure the enta...
research
09/05/2022

Numerical dynamics of integrodifference equations: Hierarchies of invariant bundles in L^p(Ω)

We study how the "full hierarchy" of invariant manifolds for nonautonomo...
research
05/06/2021

Relative stability toward diffeomorphisms in deep nets indicates performance

Understanding why deep nets can classify data in large dimensions remain...
research
03/26/2018

Bridging Many-Body Quantum Physics and Deep Learning via Tensor Networks

The harnessing of modern computational abilities for many-body wave-func...

Please sign up or login with your details

Forgot password? Click here to reset