Active Anomaly Detection via Ensembles

09/17/2018
by   Shubhomoy Das, et al.
0

In critical applications of anomaly detection including computer security and fraud prevention, the anomaly detector must be configurable by the analyst to minimize the effort on false positives. One important way to configure the anomaly detector is by providing true labels for a few instances. We study the problem of label-efficient active learning to automatically tune anomaly detection ensembles and make four main contributions. First, we present an important insight into how anomaly detector ensembles are naturally suited for active learning. This insight allows us to relate the greedy querying strategy to uncertainty sampling, with implications for label-efficiency. Second, we present a novel formalism called compact description to describe the discovered anomalies and show that it can also be employed to improve the diversity of the instances presented to the analyst without loss in the anomaly discovery rate. Third, we present a novel data drift detection algorithm that not only detects the drift robustly, but also allows us to take corrective actions to adapt the detector in a principled manner. Fourth, we present extensive experiments to evaluate our insights and algorithms in both batch and streaming settings. Our results show that in addition to discovering significantly more anomalies than state-of-the-art unsupervised baselines, our active learning algorithms under the streaming-data setup are competitive with the batch setup.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2019

Active Anomaly Detection via Ensembles: Insights, Algorithms, and Interpretability

Anomaly detection (AD) task corresponds to identifying the true anomalie...
research
04/24/2021

Supervised Anomaly Detection via Conditional Generative Adversarial Network and Ensemble Active Learning

Anomaly detection has wide applications in machine intelligence but is s...
research
01/07/2023

How to Allocate your Label Budget? Choosing between Active Learning and Learning to Reject in Anomaly Detection

Anomaly detection attempts at finding examples that deviate from the exp...
research
10/28/2022

Learning to Detect Interesting Anomalies

Anomaly detection algorithms are typically applied to static, unchanging...
research
08/30/2017

Incorporating Feedback into Tree-based Anomaly Detection

Anomaly detectors are often used to produce a ranked list of statistical...
research
01/25/2022

Little Help Makes a Big Difference: Leveraging Active Learning to Improve Unsupervised Time Series Anomaly Detection

Key Performance Indicators (KPI), which are essentially time series data...
research
07/08/2022

Active Learning-based Isolation Forest (ALIF): Enhancing Anomaly Detection in Decision Support Systems

The detection of anomalous behaviours is an emerging need in many applic...

Please sign up or login with your details

Forgot password? Click here to reset