Main effects and interactions in mixed and incomplete data frames

06/26/2018
by   Geneviève Robin, et al.
0

A mixed data frame (MDF) is a table collecting categorical, numerical and count observations. The use of MDF is widespread in statistics and the applications are numerous from abundance data in ecology to recommender systems. In many cases, an MDF exhibits simultaneously main effects, such as row, column or group effects and interactions, for which a low-rank model has often been suggested. Although the literature on low-rank approximations is very substantial, with few exceptions, existing methods do not allow to incorporate main effects and interactions while providing statistical guarantees. The present work fills this gap. We propose an estimation method which allows to recover simultaneously the main effects and the interactions. We show that our method is near optimal under conditions which are met in our targeted applications. Numerical experiments using both simulated and survey data are provided to support our claims.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset