Navigating the corporate disclosure gap: Modelling of Missing Not at Random Carbon Data

12/14/2021
by   Malgorzata Olesiewicz, et al.
0

Corporate carbon emissions data is disclosed by approximately 65 and mid-sized companies globally, despite being a key indicator of corporate climate performance. With investors increasingly looking to integrate climate risk into their investment strategies and risk reporting, this creates demand for robust prediction models that can generate reliable estimates for missing carbon disclosures. However, these estimates lack transparency and are frequently used in the investment decisions process with the same confidence as corporate reported data. As disclosures remain mostly voluntary and the propensity to disclose is shaped by several factors (e.g. size, sector, geography), missing emissions data should be assumed to be missing not at random (MNAR). However, widely used estimation methods (e.g. linear regression models) typically do not correct for MNAR bias and do not accurately reflect the uncertainty of estimated data. The objective of this paper is to address these issues: (1) account for the uncertainty of the missing data and thus obtain regression coefficients by multiple imputation (MI) (2) correct for potential bias by using MI algorithms based on Heckman's sample selection model introduced by Galimard et al. (3) estimate missing carbon disclosures with linear models based on MI and report on the uncertainty of predicted values, measured as the length of the prediction interval. In the simulation, our approach resulted in an accuracy gain based on root mean squared error of up to 30 applied to commercial data, the results suggested up to 20 proposed methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2020

Robust Optimal Design when Missing Data Happen at Random

In this article, we investigate the robust optimal design problem for th...
research
01/03/2017

New Methods of Enhancing Prediction Accuracy in Linear Models with Missing Data

In this paper, prediction for linear systems with missing information is...
research
07/25/2018

Propensity score estimation using classification and regression trees in the presence of missing covariate data

Data mining and machine learning techniques such as classification and r...
research
01/09/2018

"Robust-squared" Imputation Models Using BART

Examples of "doubly robust" estimator for missing data include augmented...
research
03/29/2023

Correcting for Selection Bias and Missing Response in Regression using Privileged Information

When estimating a regression model, we might have data where some labels...
research
01/31/2023

Naive imputation implicitly regularizes high-dimensional linear models

Two different approaches exist to handle missing values for prediction: ...
research
05/26/2022

RIGID: Robust Linear Regression with Missing Data

We present a robust framework to perform linear regression with missing ...

Please sign up or login with your details

Forgot password? Click here to reset