Understanding Pathologies of Deep Heteroskedastic Regression

06/29/2023
by   Eliot Wong-Toi, et al.
0

Several recent studies have reported negative results when using heteroskedastic neural regression models to model real-world data. In particular, for overparameterized models, the mean and variance networks are powerful enough to either fit every single data point (while shrinking the predicted variances to zero), or to learn a constant prediction with an output variance exactly matching every predicted residual (i.e., explaining the targets as pure noise). This paper studies these difficulties from the perspective of statistical physics. We show that the observed instabilities are not specific to any neural network architecture but are already present in a field theory of an overparameterized conditional Gaussian likelihood model. Under light assumptions, we derive a nonparametric free energy that can be solved numerically. The resulting solutions show excellent qualitative agreement with empirical model fits on real-world data and, in particular, prove the existence of phase transitions, i.e., abrupt, qualitative differences in the behaviors of the regressors upon varying the regularization strengths on the two networks. Our work thus provides a theoretical explanation for the necessity to carefully regularize heteroskedastic regression models. Moreover, the insights from our theory suggest a scheme for optimizing this regularization which is quadratically more efficient than the naive approach.

READ FULL TEXT

page 7

page 17

page 18

page 19

research
04/25/2018

On nonparametric inference for spatial regression models under domain expanding and infill asymptotics

In this paper, we develop nonparametric inference on spatial regression ...
research
04/25/2018

On nonparametric inference for spatial regression models under domain expanding and infill observations

In this paper, we develop nonparametric inference on spatial regression ...
research
04/07/2021

Prediction with Missing Data

Missing information is inevitable in real-world data sets. While imputat...
research
05/30/2019

Regression with Conditional GAN

In recent years, impressive progress has been made in the design of impl...
research
03/10/2022

Bias-variance decomposition of overparameterized regression with random linear features

In classical statistics, the bias-variance trade-off describes how varyi...
research
05/04/2018

Distribution Assertive Regression

In regression modelling approach, the main step is to fit the regression...
research
02/13/2020

A Unifying Network Architecture for Semi-Structured Deep Distributional Learning

We propose a unifying network architecture for deep distributional learn...

Please sign up or login with your details

Forgot password? Click here to reset