Deep Learning for Malicious Flow Detection

02/09/2018
by   Yun-Chun Chen, et al.
0

Cyber security has grown up to be a hot issue in recent years. How to identify potential malware becomes a challenging task. To tackle this challenge, we adopt deep learning approaches and perform flow detection on real data. However, real data often encounters an issue of imbalanced data distribution which will lead to a gradient dilution issue. When training a neural network, this problem will not only result in a bias toward the majority class but show the inability to learn from the minority classes. In this paper, we propose an end-to-end trainable Tree-Shaped Deep Neural Network (TSDNN) which classifies the data in a layer-wise manner. To better learn from the minority classes, we propose a Quantity Dependent Backpropagation (QDBP) algorithm which incorporates the knowledge of the disparity between classes. We evaluate our method on an imbalanced data set. Experimental result demonstrates that our approach outperforms the state-of-the-art methods and justifies that the proposed method is able to overcome the difficulty of imbalanced learning. We also conduct a partial flow experiment which shows the feasibility of real-time detection and a zero-shot learning experiment which justifies the generalization capability of deep learning in cyber security.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2020

Deep Learning based Frameworks for Handling Imbalance in DGA, Email, and URL Data Analysis

Deep learning is a state of the art method for a lot of applications. Th...
research
10/13/2022

Rebalanced Zero-shot Learning

Zero-shot learning (ZSL) aims to identify unseen classes with zero sampl...
research
12/08/2017

Class Rectification Hard Mining for Imbalanced Deep Learning

Recognising detailed facial or clothing attributes in images of people i...
research
10/11/2022

Efficient Gaussian Process Model on Class-Imbalanced Datasets for Generalized Zero-Shot Learning

Zero-Shot Learning (ZSL) models aim to classify object classes that are ...
research
05/23/2019

SelectNet: Learning to Sample from the Wild for Imbalanced Data Training

Supervised learning from training data with imbalanced class sizes, a co...
research
07/17/2018

Pseudo-Feature Generation for Imbalanced Data Analysis in Deep Learning

We generate pseudo-features by multivariate probability distributions ob...
research
05/19/2022

Learning-based AC-OPF Solvers on Realistic Network and Realistic Loads

Deep learning approaches for the Alternating Current-Optimal Power Flow ...

Please sign up or login with your details

Forgot password? Click here to reset