Unsupervised anomaly detection for discrete sequence healthcare data

07/20/2020
by   Victoria Snorovikhina, et al.
0

Fraud in healthcare is widespread, as doctors could prescribe unnecessary treatments to increase bills. Insurance companies want to detect these anomalous fraudulent bills and reduce their losses. Traditional fraud detection methods use expert rules and manual data processing. Recently, machine learning techniques automate this process, but hand-labeled data is extremely costly and usually out of date. That is why unsupervised fraud detection system in healthcare is also of great importance. However, there are almost no applications of unsupervised anomaly detection based on the processing of sequential data. To process sequential data, we propose two deep learning approaches: LSTM neural network for prediction next patient visit and a seq2seq model. We assume that errors of predictions correspond to anomality for both cases and compare different ways to aggregate errors and detect abnormality of the whole sequence. For normalization of anomaly scores, we consider Empirical Distribution Function (EDF) approach: the algorithm can work with high class imbalance problems during aggregation of errors. We use real data on sequences of patients' visits data from a major insurance company. The results show that both models outperform a baseline for unsupervised anomaly detection. Our EDF approach improves the quality of LSTM model. Moreover, both models provide new state-of-the-art results for unsupervised anomaly detection for fraud detection in healthcare insurance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/12/2021

Double-Adversarial Activation Anomaly Detection: Adversarial Autoencoders are Anomaly Generators

Anomaly detection is a challenging task for machine learning algorithms ...
research
07/20/2021

A Comparison of Supervised and Unsupervised Deep Learning Methods for Anomaly Detection in Images

Anomaly detection in images plays a significant role for many applicatio...
research
10/27/2020

Anomaly detection in injection molding process data based on unsupervised learning

Plastic processing companies in high-wage countries are facing continuou...
research
05/29/2022

Diminishing Empirical Risk Minimization for Unsupervised Anomaly Detection

Unsupervised anomaly detection (AD) is a challenging task in realistic a...
research
02/12/2018

Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal KPIs in Web Applications

To ensure undisrupted business, large Internet companies need to closely...
research
12/20/2021

Unsupervised deep learning techniques for powdery mildew recognition based on multispectral imaging

Objectives. Sustainable management of plant diseases is an open challeng...
research
02/09/2014

Classification Tree Diagrams in Health Informatics Applications

Health informatics deal with the methods used to optimize the acquisitio...

Please sign up or login with your details

Forgot password? Click here to reset