Seeded Binary Segmentation: A general methodology for fast and optimal change point detection

02/16/2020
by   Solt Kovács, et al.
0

In recent years, there has been an increasing demand on efficient algorithms for large scale change point detection problems. To this end, we propose seeded binary segmentation, an approach relying on a deterministic construction of background intervals, called seeded intervals, in which single change points are searched. The final selection of change points based on the candidates from seeded intervals can be done in various ways, adapted to the problem at hand. Thus, seeded binary segmentation is easy to adapt to a wide range of change point detection problems, let that be univariate, multivariate or even high-dimensional. We consider the univariate Gaussian change in mean setup in detail. For this specific case we show that seeded binary segmentation leads to a near-linear time approach (i.e. linear up to a logarithmic factor) independent of the underlying number of change points. Furthermore, using appropriate selection methods, the methodology is shown to be asymptotically minimax optimal. While computationally more efficient, the finite sample estimation performance remains competitive compared to state of the art procedures. Moreover, we illustrate the methodology for high-dimensional settings with an inverse covariance change point detection problem where our proposal leads to massive computational gains while still exhibiting good statistical performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Optimistic search strategy: Change point detection for large-scale data via adaptive logarithmic queries

As a classical and ever reviving topic, change point detection is often ...
research
12/28/2017

Optimal Covariance Change Point Detection in High Dimension

We study the problem of change point detection for covariance matrices i...
research
01/26/2023

Divide and Conquer Dynamic Programming: An Almost Linear Time Change Point Detection Methodology in High Dimensions

We develop a novel, general and computationally efficient framework, cal...
research
06/23/2020

Seeded intervals and noise level estimation in change point detection: A discussion of Fryzlewicz (2020)

In this discussion, we compare the choice of seeded intervals and that o...
research
07/19/2022

A unified framework for change point detection in high-dimensional linear models

In recent years, change point detection for high dimensional data has be...
research
12/23/2020

Data segmentation algorithms: Univariate mean change and beyond

Data segmentation a.k.a. multiple change point analysis has received con...

Please sign up or login with your details

Forgot password? Click here to reset