How effective are Graph Neural Networks in Fraud Detection for Network Data?

05/30/2021
by   Ronald D. R. Pereira, et al.
0

Graph-based Neural Networks (GNNs) are recent models created for learning representations of nodes (and graphs), which have achieved promising results when detecting patterns that occur in large-scale data relating different entities. Among these patterns, financial fraud stands out for its socioeconomic relevance and for presenting particular challenges, such as the extreme imbalance between the positive (fraud) and negative (legitimate transactions) classes, and the concept drift (i.e., statistical properties of the data change over time). Since GNNs are based on message propagation, the representation of a node is strongly impacted by its neighbors and by the network's hubs, amplifying the imbalance effects. Recent works attempt to adapt undersampling and oversampling strategies for GNNs in order to mitigate this effect without, however, accounting for concept drift. In this work, we conduct experiments to evaluate existing techniques for detecting network fraud, considering the two previous challenges. For this, we use real data sets, complemented by synthetic data created from a new methodology introduced here. Based on this analysis, we propose a series of improvement points that should be investigated in future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2021

Ego-GNNs: Exploiting Ego Structures in Graph Neural Networks

Graph neural networks (GNNs) have achieved remarkable success as a frame...
research
02/03/2023

LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation

Recent works have demonstrated the benefits of capturing long-distance d...
research
10/22/2021

Distance-wise Prototypical Graph Neural Network in Node Imbalance Classification

Recent years have witnessed the significant success of applying graph ne...
research
06/18/2022

Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs

Graph Neural Networks (GNNs) have achieved great success on a node class...
research
09/02/2022

Rethinking Efficiency and Redundancy in Training Large-scale Graphs

Large-scale graphs are ubiquitous in real-world scenarios and can be tra...
research
02/12/2021

Two Sides of the Same Coin: Heterophily and Oversmoothing in Graph Convolutional Neural Networks

Most graph neural networks (GNN) perform poorly in graphs where neighbor...
research
08/02/2023

Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective

We live in a vast ocean of data, and deep neural networks are no excepti...

Please sign up or login with your details

Forgot password? Click here to reset