On the Joint Interaction of Models, Data, and Features

06/07/2023
by   Yiding Jiang, et al.
0

Learning features from data is one of the defining characteristics of deep learning, but our theoretical understanding of the role features play in deep learning is still rudimentary. To address this gap, we introduce a new tool, the interaction tensor, for empirically analyzing the interaction between data and model through features. With the interaction tensor, we make several key observations about how features are distributed in data and how models with different random seeds learn different features. Based on these observations, we propose a conceptual framework for feature learning. Under this framework, the expected accuracy for a single hypothesis and agreement for a pair of hypotheses can both be derived in closed-form. We demonstrate that the proposed framework can explain empirically observed phenomena, including the recently discovered Generalization Disagreement Equality (GDE) that allows for estimating the generalization error with only unlabeled data. Further, our theory also provides explicit construction of natural data distributions that break the GDE. Thus, we believe this work provides valuable new insight into our understanding of feature learning.

READ FULL TEXT

page 2

page 22

page 24

page 25

page 27

page 32

research
04/01/2015

A Theory of Feature Learning

Feature Learning aims to extract relevant information contained in data ...
research
03/15/2023

The Benefits of Mixup for Feature Learning

Mixup, a simple data augmentation method that randomly mixes two data po...
research
05/31/2021

Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning

How can neural networks trained by contrastive learning extract features...
research
02/07/2022

Diversify and Disambiguate: Learning From Underspecified Data

Many datasets are underspecified, which means there are several equally ...
research
12/28/2022

Feature learning in neural networks and kernel machines that recursively learn features

Neural networks have achieved impressive results on many technological a...
research
04/07/2015

Tensor machines for learning target-specific polynomial features

Recent years have demonstrated that using random feature maps can signif...
research
04/08/2019

Feature Learning Viewpoint of AdaBoost and a New Algorithm

The AdaBoost algorithm has the superiority of resisting overfitting. Und...

Please sign up or login with your details

Forgot password? Click here to reset