Convex Latent Effect Logit Model via Sparse and Low-rank Decomposition

08/22/2021
by   Hongyuan Zhan, et al.
0

In this paper, we propose a convex formulation for learning logistic regression model (logit) with latent heterogeneous effect on sub-population. In transportation, logistic regression and its variants are often interpreted as discrete choice models under utility theory (McFadden, 2001). Two prominent applications of logit models in the transportation domain are traffic accident analysis and choice modeling. In these applications, researchers often want to understand and capture the individual variation under the same accident or choice scenario. The mixed effect logistic regression (mixed logit) is a popular model employed by transportation researchers. To estimate the distribution of mixed logit parameters, a non-convex optimization problem with nested high-dimensional integrals needs to be solved. Simulation-based optimization is typically applied to solve the mixed logit parameter estimation problem. Despite its popularity, the mixed logit approach for learning individual heterogeneity has several downsides. First, the parametric form of the distribution requires domain knowledge and assumptions imposed by users, although this issue can be addressed to some extent by using a non-parametric approach. Second, the optimization problems arise from parameter estimation for mixed logit and the non-parametric extensions are non-convex, which leads to unstable model interpretation. Third, the simulation size in simulation-assisted estimation lacks finite-sample theoretical guarantees and is chosen somewhat arbitrarily in practice. To address these issues, we are motivated to develop a formulation that models the latent individual heterogeneity while preserving convexity, and avoids the need for simulation-based approximation. Our setup is based on decomposing the parameters into a sparse homogeneous component in the population and low-rank heterogeneous parts for each individual.

READ FULL TEXT
research
10/26/2020

A Sparse β-Model with Covariates for Networks

Data in the form of networks are increasingly encountered in modern scie...
research
12/02/2019

Factor Analysis on Citation, Using a Combined Latent and Logistic Regression Model

We propose a combined model, which integrates the latent factor model an...
research
01/27/2021

An Early Stopping Bayesian Data Assimilation Approach for Mixed-Logit Estimation

The mixed-logit model is a flexible tool in transportation choice analys...
research
10/01/2016

Tuning Parameter Calibration in High-dimensional Logistic Regression With Theoretical Guarantees

Feature selection is a standard approach to understanding and modeling h...
research
11/09/2019

Tensor Regression Using Low-rank and Sparse Tucker Decompositions

This paper studies a tensor-structured linear regression model with a sc...
research
05/09/2017

SILVar: Single Index Latent Variable Models

A semi-parametric, non-linear regression model in the presence of latent...
research
04/15/2021

Heterogeneous Tensor Mixture Models in High Dimensions

We consider the problem of jointly modeling and clustering populations o...

Please sign up or login with your details

Forgot password? Click here to reset