Distributional Anchor Regression

01/20/2021
by   Lucas Kook, et al.
0

Prediction models often fail if train and test data do not stem from the same distribution. Out-of-distribution (OOD) generalization to unseen, perturbed test data is a desirable but difficult-to-achieve property for prediction models and in general requires strong assumptions on the data generating process (DGP). In a causally inspired perspective on OOD generalization, the test data arise from a specific class of interventions on exogenous random variables of the DGP, called anchors. Anchor regression models, introduced by Rothenhäusler et al. (2018), protect against distributional shifts in the test data by employing causal regularization. However, so far anchor regression has only been used with a squared-error loss which is inapplicable to common responses such as censored continuous or ordinal data. Here, we propose a distributional version of anchor regression which generalizes the method to potentially censored responses with at least an ordered sample space. To this end, we combine a flexible class of parametric transformation models for distributional regression with an appropriate causal regularizer under a more general notion of residuals. In an exemplary application and several simulation scenarios we demonstrate the extent to which OOD generalization is possible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2023

Causality-oriented robustness: exploiting general additive interventions

Since distribution shifts are common in real-world applications, there i...
research
05/26/2023

Generalization Error without Independence: Denoising, Linear Regression, and Transfer Learning

Studying the generalization abilities of linear models with real data is...
research
10/16/2020

Ordinal Neural Network Transformation Models: Deep and interpretable regression models for ordinal outcomes

Outcomes with a natural order commonly occur in prediction tasks and oft...
research
05/07/2020

Distributional Robustness of K-class Estimators and the PULSE

In causal settings, such as instrumental variable settings, it is well k...
research
10/17/2020

Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

It is often critical for prediction models to be robust to distributiona...
research
07/20/2022

Causal Models, Prediction, and Extrapolation in Cell Line Perturbation Experiments

In cell line perturbation experiments, a collection of cells is perturbe...
research
07/06/2022

Ordinal Regression via Binary Preference vs Simple Regression: Statistical and Experimental Perspectives

Ordinal regression with anchored reference samples (ORARS) has been prop...

Please sign up or login with your details

Forgot password? Click here to reset