Sparse Markov Models for High-dimensional Inference

02/16/2022
by   Guilherme Ost, et al.
0

Finite order Markov models are theoretically well-studied models for dependent discrete data. Despite their generality, application in empirical work when the order is large is rare. Practitioners avoid using higher order Markov models because (1) the number of parameters grow exponentially with the order and (2) the interpretation is often difficult. Mixture of transition distribution models (MTD) were introduced to overcome both limitations. MTD represent higher order Markov models as a convex mixture of single step Markov chains, reducing the number of parameters and increasing the interpretability. Nevertheless, in practice, estimation of MTD models with large orders are still limited because of curse of dimensionality and high algorithm complexity. Here, we prove that if only few lags are relevant we can consistently and efficiently recover the lags and estimate the transition probabilities of high-dimensional MTD models. The key innovation is a recursive procedure for the selection of the relevant lags of the model. Our results are based on (1) a new structural result of the MTD and (2) an improved martingale concentration inequality. We illustrate our method using simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2019

Estimation and selection for high-order Markov chains with Bayesian mixture transition distribution models

We develop two models for Bayesian estimation and selection in high-orde...
research
02/11/2022

Fitting Sparse Markov Models to Categorical Time Series Using Regularization

The major problem of fitting a higher order Markov model is the exponent...
research
04/20/2017

Retrospective Higher-Order Markov Processes for User Trails

Users form information trails as they browse the web, checkin with a geo...
research
04/03/2023

A mixture transition distribution modeling for higher-order circular Markov processes

The stationary higher-order Markov process for circular data is consider...
research
06/28/2019

Bias-Variance Trade-Off in Hierarchical Probabilistic Models Using Higher-Order Feature Interactions

Hierarchical probabilistic models are able to use a large number of para...
research
09/18/2007

Bayesian Classification and Regression with High Dimensional Features

This thesis responds to the challenges of using a large number, such as ...
research
09/08/2018

A high dimensional Central Limit Theorem for martingales, with applications to context tree models

We establish a central limit theorem for (a sequence of) multivariate ma...

Please sign up or login with your details

Forgot password? Click here to reset