Double Machine Learning for Partially Linear Mixed-Effects Models with Repeated Measurements

08/31/2021
by   Corinne Emmenegger, et al.
0

Traditionally, spline or kernel approaches in combination with parametric estimation are used to infer the linear coefficient (fixed effects) in a partially linear mixed-effects model (PLMM) for repeated measurements. Using machine learning algorithms allows us to incorporate more complex interaction structures and high-dimensional variables. We employ double machine learning to cope with the nonparametric part of the PLMM: the nonlinear variables are regressed out nonparametrically from both the linear variables and the response. This adjustment can be performed with any machine learning algorithm, for instance random forests. The adjusted variables satisfy a linear mixed-effects model, where the linear coefficient can be estimated with standard linear mixed-effects techniques. We prove that the estimated fixed effects coefficient converges at the parametric rate and is asymptotically Gaussian distributed and semiparametrically efficient. Empirical examples demonstrate our proposed algorithm. We present two simulation studies and analyze a dataset with repeated CD4 cell counts from HIV patients. Software code for our method is available in the R-package dmlalg.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2021

Regularizing Double Machine Learning in Partially Linear Endogenous Models

We estimate the linear coefficient in a partially linear model with conf...
research
01/31/2019

Random forests for high-dimensional longitudinal data

Random forests is a state-of-the-art supervised machine learning method ...
research
09/30/2020

Double/Debiased Machine Learning for Logistic Partially Linear Model

We propose double/debiased machine learning approaches to infer (at the ...
research
12/16/2019

Statistical significance in high-dimensional linear mixed models

This paper concerns the development of an inferential framework for high...
research
01/28/2022

Bayesian Nonlinear Models for Repeated Measurement Data: An Overview, Implementation, and Applications

Nonlinear mixed effects models have become a standard platform for analy...
research
01/23/2023

ddml: Double/debiased machine learning in Stata

We introduce the package ddml for Double/Debiased Machine Learning (DDML...
research
09/21/2019

DECoVaC: Design of Experiments with Controlled Variability Components

Reproducible research in Machine Learning has seen a salutary abundance ...

Please sign up or login with your details

Forgot password? Click here to reset