Isolation forests: looking beyond tree depth

11/23/2021
by   David Cortes, et al.
0

The isolation forest algorithm for outlier detection exploits a simple yet effective observation: if taking some multivariate data and making uniformly random cuts across the feature space recursively, it will take fewer such random cuts for an outlier to be left alone in a given subspace as compared to regular observations. The original idea proposed an outlier score based on the tree depth (number of random cuts) required for isolation, but experiments here show that using information about the size of the feature space taken and the number of points assigned to it can result in improved results in many situations without any modification to the tree structure, especially in the presence of categorical features.

READ FULL TEXT
research
09/20/2023

Distribution and volume based scoring for Isolation Forests

We make two contributions to the Isolation Forest method for anomaly and...
research
10/26/2021

Revisiting randomized choices in isolation forests

Isolation forest or "iForest" is an intuitive and widely used algorithm ...
research
10/27/2019

Distance approximation using Isolation Forests

This work briefly explores the possibility of approximating spatial dist...
research
11/06/2018

Extended Isolation Forest

We present an extension to the model-free anomaly detection algorithm, I...
research
08/07/2020

A boosted outlier detection method based on the spectrum of the Laplacian matrix of a graph

This paper explores a new outlier detection algorithm based on the spect...
research
08/14/2023

Quantifying Outlierness of Funds from their Categories using Supervised Similarity

Mutual fund categorization has become a standard tool for the investment...
research
10/18/2022

Multivariate outlier explanations using Shapley values and Mahalanobis distances

For the purpose of explaining multivariate outlyingness, it is shown tha...

Please sign up or login with your details

Forgot password? Click here to reset