Imputation with verifiable identification condition for nonignorable missing outcomes

04/22/2022
by   Kenji Beppu, et al.
0

Missing data often results in undesirable bias and loss of efficiency. These results become substantial problems when the response mechanism is nonignorable, such that the response model depends on the unobserved variable. It is often necessary to estimate the joint distribution of the unobserved variables and response indicators to further manage nonignorable nonresponse. However, model misspecification and identification issues prevent robust estimates, despite carefully estimating the target joint distribution. In this study we model the distribution of the observed parts and derived sufficient conditions for model identifiability, assuming a logistic distribution of the response mechanism and a generalized linear model as the main outcome model of interest. More importantly, the derived sufficient conditions are testable with the observed data and do not require any instrumental variables, which have often been assumed to guarantee model identifiability but cannot be practically determined beforehand. To analyse missing data, we propose a new fractional imputation method which incorporates verifiable identifiability using only the observed data. Furthermore, we present the performance of the proposed estimators in numerical studies and apply the proposed method to two sets of real data, namely, Opinion Poll for the 2022 South Korean Presidential Election, and public data collected from the US National Supported Work Evaluation Study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2016

Recoverability of Joint Distribution from Missing Data

A probabilistic query may not be estimable from observed data corrupted ...
research
03/02/2021

Multiple imputation with missing data indicators

Multiple imputation is a well-established general technique for analyzin...
research
04/05/2023

Verifiable identification condition for nonignorable nonresponse data with categorical instrumental variables

We consider a model identification problem in which an outcome variable ...
research
11/13/2018

What is really needed to justify ignoring the response mechanism for modelling purposes?

With incomplete data, the standard argument for when the response mechan...
research
04/10/2020

Full Law Identification In Graphical Models Of Missing Data: Completeness Results

Missing data has the potential to affect analyses conducted in all field...
research
07/18/2022

A self-censoring model for multivariate nonignorable nonmonotone missing data

We introduce a self-censoring model for multivariate nonignorable nonmon...
research
12/05/2022

Identification of Unobservables in Observations

In empirical studies, the data usually don't include all the variables o...

Please sign up or login with your details

Forgot password? Click here to reset