Asymptotic Inference for Multi-Stage Stationary Treatment Policy with High Dimensional Features

01/29/2023
by   Daiqi Gao, et al.
0

Dynamic treatment rules or policies are a sequence of decision functions over multiple stages that are tailored to individual features. One important class of treatment policies for practice, namely multi-stage stationary treatment policies, prescribe treatment assignment probabilities using the same decision function over stages, where the decision is based on the same set of features consisting of both baseline variables (e.g., demographics) and time-evolving variables (e.g., routinely collected disease biomarkers). Although there has been extensive literature to construct valid inference for the value function associated with the dynamic treatment policies, little work has been done for the policies themselves, especially in the presence of high dimensional feature variables. We aim to fill in the gap in this work. Specifically, we first estimate the multistage stationary treatment policy based on an augmented inverse probability weighted estimator for the value function to increase the asymptotic efficiency, and further apply a penalty to select important feature variables. We then construct one-step improvement of the policy parameter estimators. Theoretically, we show that the improved estimators are asymptotically normal, even if nuisance parameters are estimated at a slow convergence rate and the dimension of the feature variables increases exponentially with the sample size. Our numerical studies demonstrate that the proposed method has satisfactory performance in small samples, and that the performance can be improved with a choice of the augmentation term that approximates the rewards or minimizes the variance of the value function.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2019

Inference on weighted average value function in high-dimensional state space

This paper gives a consistent, asymptotically normal estimator of the ex...
research
06/25/2023

Inference for relative sparsity

In healthcare, there is much interest in estimating policies, or mapping...
research
11/25/2019

Resampling-based Confidence Intervals for Model-free Robust Inference on Optimal Treatment Regimes

Recently, there has been growing interest in estimating optimal treatmen...
research
09/12/2022

Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach

In an Markov decision process (MDP), unobservable confounders may exist ...
research
08/09/2021

Data-guided Treatment Recommendation with Feature Scores

In this paper, we consider the use of large-scale genomics data for trea...
research
10/17/2021

Rejoinder: Learning Optimal Distributionally Robust Individualized Treatment Rules

We thank the opportunity offered by editors for this discussion and the ...
research
12/15/2022

Comparing two spatial variables with the probability of agreement

Computing the agreement between two continuous sequences is of great int...

Please sign up or login with your details

Forgot password? Click here to reset