High-dimensional changepoint estimation with heterogeneous missingness

08/03/2021
by   Bertille Follain, et al.
0

We propose a new method for changepoint estimation in partially-observed, high-dimensional time series that undergo a simultaneous change in mean in a sparse subset of coordinates. Our first methodological contribution is to introduce a 'MissCUSUM' transformation (a generalisation of the popular Cumulative Sum statistics), that captures the interaction between the signal strength and the level of missingness in each coordinate. In order to borrow strength across the coordinates, we propose to project these MissCUSUM statistics along a direction found as the solution to a penalised optimisation problem tailored to the specific sparsity structure. The changepoint can then be estimated as the location of the peak of the absolute value of the projected univariate series. In a model that allows different missingness probabilities in different component series, we identify that the key interaction between the missingness and the signal is a weighted sum of squares of the signal change in each coordinate, with weights given by the observation probabilities. More specifically, we prove that the angle between the estimated and oracle projection directions, as well as the changepoint location error, are controlled with high probability by the sum of two terms, both involving this weighted sum of squares, and representing the error incurred due to noise and the error due to missingness respectively. A lower bound confirms that our changepoint estimator, which we call 'MissInspect', is optimal up to a logarithmic factor. The striking effectiveness of the MissInspect methodology is further demonstrated both on simulated data, and on an oceanographic data set covering the Neogene period.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2021

Estimation of high-dimensional change-points under a group sparsity structure

Change-points are a routine feature of 'big data' observed in the form o...
research
05/30/2017

High Dimensional Structured Superposition Models

High dimensional superposition models characterize observations using pa...
research
05/18/2023

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

We study the problem of change point (CP) detection with high dimensiona...
research
03/07/2020

High-dimensional, multiscale online changepoint detection

We introduce a new method for high-dimensional, online changepoint detec...
research
01/15/2020

Detecting Changes in the Second Moment Structure of High-Dimensional Sensor-Type Data in a K-Sample Setting

The K sample problem for high-dimensional vector time series is studied,...
research
03/21/2019

Estimating the three-month series of the Chilean Gross Domestic Product

In this paper the methodology proponed by Cerqueira et al, 2008; is appl...
research
06/07/2023

Efficient sparsity adaptive changepoint estimation

We propose a new, computationally efficient, sparsity adaptive changepoi...

Please sign up or login with your details

Forgot password? Click here to reset