Context-Aware Drift Detection

03/16/2022
by   Oliver Cobb, et al.
0

When monitoring machine learning systems, two-sample tests of homogeneity form the foundation upon which existing approaches to drift detection build. They are used to test for evidence that the distribution underlying recent deployment data differs from that underlying the historical reference data. Often, however, various factors such as time-induced correlation mean that batches of recent deployment data are not expected to form an i.i.d. sample from the historical data distribution. Instead we may wish to test for differences in the distributions conditional on context that is permitted to change. To facilitate this we borrow machinery from the causal inference domain to develop a more general drift detection framework built upon a foundation of two-sample tests for conditional distributional treatment effects. We recommend a particular instantiation of the framework based on maximum conditional mean discrepancies. We then provide an empirical study demonstrating its effectiveness for various drift detection problems of practical interest, such as detecting drift in the distributions underlying subpopulations of data in a manner that is insensitive to their respective prevalences. The study additionally demonstrates applicability to ImageNet-scale vision problems.

READ FULL TEXT

page 7

page 17

page 21

research
10/16/2022

Class Distribution Monitoring for Concept Drift Detection

We introduce Class Distribution Monitoring (CDM), an effective concept-d...
research
09/07/2023

Uncovering Drift in Textual Data: An Unsupervised Method for Detecting and Mitigating Drift in Machine Learning Models

Drift in machine learning refers to the phenomenon where the statistical...
research
10/30/2017

Monotonicity and robustness in Wiener disorder detection

We study the problem of detecting a drift change of a Brownian motion un...
research
05/13/2022

Precise Change Point Detection using Spectral Drift Detection

The notion of concept drift refers to the phenomenon that the data gener...
research
04/11/2018

KS(conf ): A Light-Weight Test if a ConvNet Operates Outside of Its Specifications

Computer vision systems for automatic image categorization have become a...
research
06/24/2020

ANOVA exemplars for understanding data drift

The distributions underlying complex datasets, such as images, text or t...
research
08/16/2022

Temporal Concept Drift and Alignment: An empirical approach to comparing Knowledge Organization Systems over time

This research explores temporal concept drift and temporal alignment in ...

Please sign up or login with your details

Forgot password? Click here to reset