Computational Performance Aware Benchmarking of Unsupervised Concept Drift Detection

04/17/2023
by   Elias Werner, et al.
0

For many AI systems, concept drift detection is crucial to ensure the systems reliability. These systems often have to deal with large amounts of data or react in real time. Thus, drift detectors must meet computational requirements or constraints with a comprehensive performance evaluation. However, so far, the focus of developing drift detectors is on detection quality, e.g. accuracy, but not on computational performance, such as running time. We show that the previous works consider computational performance only as a secondary objective and do not have a benchmark for such evaluation. Hence, we propose a novel benchmark suite for drift detectors that accounts both detection quality and computational performance to ensure a detector's applicability in various AI systems. In this work, we focus on unsupervised drift detectors that are not restricted to the availability of labeled data and thus being widely applicable. Our benchmark suite supports configurable synthetic and real world data streams. Moreover, it provides means for simulating a machine learning model's output to unify the performance evaluation across different drift detectors. This allows a fair and comprehensive comparison of drift detectors proposed in related work. Our benchmark suite is integrated in the existing framework, Massive Online Analysis (MOA). To evaluate our benchmark suite's capability, we integrate two representative unsupervised drift detectors. Our work enables the scientific community to achieve a baseline for unsupervised drift detectors with respect to both detection quality and computational performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2020

Adversarial Concept Drift Detection under Poisoning Attacks for Robust Data Stream Mining

Continuous learning from streaming data is among the most challenging to...
research
01/31/2022

Implicit Concept Drift Detection for Multi-label Data Streams

Many real-world applications adopt multi-label data streams as the need ...
research
03/31/2017

On the Reliable Detection of Concept Drift from Streaming Unlabeled Data

Classifiers deployed in the real world operate in a dynamic environment,...
research
06/25/2018

Request-and-Reverify: Hierarchical Hypothesis Testing for Concept Drift Detection with Expensive Labels

One important assumption underlying common classification models is the ...
research
06/18/2021

Labelling Drifts in a Fault Detection System for Wind Turbine Maintenance

A failure detection system is the first step towards predictive maintena...
research
02/23/2023

Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

Temporal concept drift refers to the problem of data changing over time....
research
03/09/2023

Fast kernel methods for Data Quality Monitoring as a goodness-of-fit test

We here propose a machine learning approach for monitoring particle dete...

Please sign up or login with your details

Forgot password? Click here to reset