Anomaly Detection in Stationary Settings: A Permutation-Based Higher Criticism Approach

09/07/2020
by   Ivo V. Stoepker, et al.
0

Anomaly detection when observing a large number of data streams is essential in a variety of applications, ranging from epidemiological studies to monitoring of complex systems. High-dimensional scenarios are usually tackled with scan-statistics and related methods, requiring stringent modeling assumptions for proper calibration. In this work we take a non-parametric stance, and propose a permutation-based variant of the higher criticism statistic not requiring knowledge of the null distribution. This results in an exact test in finite samples which is asymptotically optimal in the wide class of exponential models. We demonstrate the power loss in finite samples is minimal with respect to the oracle test. Furthermore, since the proposed statistic does not rely on asymptotic approximations it typically performs better than popular variants of higher criticism that rely on such approximations. We include recommendations such that the test can be readily applied in practice, and demonstrate its applicability in monitoring the daily number of COVID-19 cases in the Netherlands.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/03/2023

A fast and accurate kernel-based independence test with applications to high-dimensional and functional data

Testing the dependency between two random variables is an important infe...
research
12/15/2017

Efficient Global Monitoring Statistics for High-Dimensional Data

Global monitoring statistics play an important role for developing effic...
research
11/27/2022

A Permutation-free Kernel Two-Sample Test

The kernel Maximum Mean Discrepancy (MMD) is a popular multivariate dist...
research
02/06/2015

Learning Efficient Anomaly Detectors from K-NN Graphs

We propose a non-parametric anomaly detection algorithm for high dimensi...
research
04/11/2019

Comparing a Large Number of Multivariate Distributions

In this paper, we propose a test for the equality of multiple distributi...
research
05/07/2019

Reduction of Monitoring Register on Software Defined Networks

Characterization of data network monitoring registers allows for reducti...
research
12/11/2013

Near-optimal Anomaly Detection in Graphs using Lovasz Extended Scan Statistic

The detection of anomalous activity in graphs is a statistical problem t...

Please sign up or login with your details

Forgot password? Click here to reset