McDiarmid Drift Detection Methods for Evolving Data Streams

10/05/2017
by   Ali Pesaranghader, et al.
0

Increasingly, Internet of Things (IoT) domains, such as sensor networks, smart cities, and social networks, generate vast amounts of data. Such data are not only unbounded and rapidly evolving. Rather, the content thereof dynamically evolves over time, often in unforeseen ways. These variations are due to so-called concept drifts, caused by changes in the underlying data generation mechanisms. In a classification setting, concept drift causes the previously learned models to become inaccurate, unsafe and even unusable. Accordingly, concept drifts need to be detected, and handled, as soon as possible. In medical applications and military zones, for example, change in behaviors should be detected in near real-time, to avoid potential loss of life. To this end, we introduce the McDiarmid Drift Detection Method (MDDM), which utilizes McDiarmid's inequality in order to detect concept drift. The MDDM approach proceeds by sliding a window over prediction results, and associate window entries with weights. Higher weights are assigned to the most recent entries, in order to emphasize their importance. As instances are processed, the detection algorithm compares a weighted mean of elements inside the sliding window with the maximum weighted mean observed so far. A significant difference between the two weighted means, upper-bounded by the McDiarmid inequality, implies a concept drift. Our extensive experimentation against synthetic and real-world data streams show that our novel method outperforms the state-of-the-art. Specifically, MDDM yields shorter detection delays as well as lower false negative rates, while maintaining high classification accuracies.

READ FULL TEXT

page 9

page 10

page 12

research
05/19/2023

OPTWIN: Drift identification with optimal sub-windows

Online Learning (OL) is a field of research that is increasingly gaining...
research
04/19/2023

Advances on Concept Drift Detection in Regression Tasks using Social Networks Theory

Mining data streams is one of the main studies in machine learning area ...
research
07/05/2021

Detecting Concept Drift With Neural Network Model Uncertainty

Deployed machine learning models are confronted with the problem of chan...
research
04/13/2020

Diverse Instances-Weighting Ensemble based on Region Drift Disagreement for Concept Drift Adaptation

Concept drift refers to changes in the distribution of underlying data a...
research
11/03/2022

Demo: LE3D: A Privacy-preserving Lightweight Data Drift Detection Framework

This paper presents LE3D; a novel data drift detection framework for pre...
research
07/24/2019

Towards AutoML in the presence of Drift: first results

Research progress in AutoML has lead to state of the art solutions that ...
research
09/07/2017

Reservoir of Diverse Adaptive Learners and Stacking Fast Hoeffding Drift Detection Methods for Evolving Data Streams

The last decade has seen a surge of interest in adaptive learning algori...

Please sign up or login with your details

Forgot password? Click here to reset