Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient for Out-of-Distribution Generalization

02/14/2022
by   Elan Rosenfeld, et al.
0

A common explanation for the failure of deep networks to generalize out-of-distribution is that they fail to recover the "correct" features. Focusing on the domain generalization setting, we challenge this notion with a simple experiment which suggests that ERM already learns sufficient features and that the current bottleneck is not feature learning, but robust regression. We therefore argue that devising simpler methods for learning predictors on existing features is a promising direction for future research. Towards this end, we introduce Domain-Adjusted Regression (DARE), a convex objective for learning a linear predictor that is provably robust under a new model of distribution shift. Rather than learning one function, DARE performs a domain-specific adjustment to unify the domains in a canonical latent space and learns to predict in this space. Under a natural model, we prove that the DARE solution is the minimax-optimal predictor for a constrained set of test distributions. Further, we provide the first finite-environment convergence guarantee to the minimax risk, improving over existing results which show a "threshold effect". Evaluated on finetuned features, we find that DARE compares favorably to prior methods, consistently achieving equal or better performance.

READ FULL TEXT
research
06/28/2022

Domain Agnostic Few-shot Learning for Speaker Verification

Deep learning models for verification systems often fail to generalize t...
research
04/22/2023

Towards Understanding Feature Learning in Out-of-Distribution Generalization

A common explanation for the failure of out-of-distribution (OOD) genera...
research
07/14/2023

DISPEL: Domain Generalization via Domain-Specific Liberating

Domain generalization aims to learn a generalization model that can perf...
research
05/29/2022

The Missing Invariance Principle Found – the Reciprocal Twin of Invariant Risk Minimization

Machine learning models often generalize poorly to out-of-distribution (...
research
10/09/2017

Deeper, Broader and Artier Domain Generalization

The problem of domain generalization is to learn from multiple training ...
research
07/19/2023

Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

To avoid failures on out-of-distribution data, recent works have sought ...

Please sign up or login with your details

Forgot password? Click here to reset