A Feature Set of Small Size for the PDF Malware Detection

08/09/2023
by   Ran Liu, et al.
0

Machine learning (ML)-based malware detection systems are becoming increasingly important as malware threats increase and get more sophisticated. PDF files are often used as vectors for phishing attacks because they are widely regarded as trustworthy data resources, and are accessible across different platforms. Therefore, researchers have developed many different PDF malware detection methods. Performance in detecting PDF malware is greatly influenced by feature selection. In this research, we propose a small features set that don't require too much domain knowledge of the PDF file. We evaluate proposed features with six different machine learning models. We report the best accuracy of 99.75 set, which consists of just 12 features, is one of the most conciseness in the field of PDF malware detection. Despite its modest size, we obtain comparable results to state-of-the-art that employ a much larger set of features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2022

Machine Learning for Detecting Malware in PE Files

The increasing number of sophisticated malware poses a major cybersecuri...
research
05/03/2023

Can Feature Engineering Help Quantum Machine Learning for Malware Detection?

With the increasing number and sophistication of malware attacks, malwar...
research
07/05/2022

Malware and Ransomware Detection Models

Cybercrime is one of the major digital threats of this century. In parti...
research
08/01/2019

KiloGrams: Very Large N-Grams for Malware Classification

N-grams have been a common tool for information retrieval and machine le...
research
12/10/2017

Improving Malware Detection Accuracy by Extracting Icon Information

Detecting PE malware files is now commonly approached using statistical ...
research
06/12/2018

Static Malware Detection & Subterfuge: Quantifying the Robustness of Machine Learning and Current Anti-Virus

As machine-learning (ML) based systems for malware detection become more...
research
01/22/2021

A novel DL approach to PE malware detection: exploring Glove vectorization, MCC_RCNN and feature fusion

In recent years, malware becomes more threatening. Concerning the increa...

Please sign up or login with your details

Forgot password? Click here to reset