Windows Pe Malware Detection Using Ensemble Learning

Loading...
Publication Logo

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Mdpi

Open Access Color

GOLD

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 1%
Influence
Top 10%
Popularity
Top 1%

Research Projects

Journal Issue

Abstract

In this Internet age, there are increasingly many threats to the security and safety of users daily. One of such threats is malicious software otherwise known as malware (ransomware, Trojans, viruses, etc.). The effect of this threat can lead to loss or malicious replacement of important information (such as bank account details, etc.). Malware creators have been able to bypass traditional methods of malware detection, which can be time-consuming and unreliable for unknown malware. This motivates the need for intelligent ways to detect malware, especially new malware which have not been evaluated or studied before. Machine learning provides an intelligent way to detect malware and comprises two stages: feature extraction and classification. This study suggests an ensemble learning-based method for malware detection. The base stage classification is done by a stacked ensemble of fully-connected and one-dimensional convolutional neural networks (CNNs), whereas the end-stage classification is done by a machine learning algorithm. For a meta-learner, we analyzed and compared 15 machine learning classifiers. For comparison, five machine learning algorithms were used: naive Bayes, decision tree, random forest, gradient boosting, and AdaBoosting. The results of experiments made on the Windows Portable Executable (PE) malware dataset are presented. The best results were obtained by an ensemble of seven neural networks and the ExtraTrees classifier as a final-stage classifier.

Description

Damaševičius, Robertas/0000-0001-9990-1084; Misra, Sanjay/0000-0002-3556-9331; azeez, nureni ayofe/0000-0002-1475-2612

Keywords

malware detection, deep learning, ensemble learning, stacking, malware detection, QA75 Electronic computers. Computer science, stacking, deep learning, ensemble learning, Q Science (General), Information technology, T58.5-58.64

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q3

Scopus Q

Q2
OpenCitations Logo
OpenCitations Citation Count
74

Source

Informatics

Volume

8

Issue

1

Start Page

10

End Page

Collections

PlumX Metrics
Citations

CrossRef : 76

Scopus : 105

Captures

Mendeley Readers : 188

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
11.9873

Sustainable Development Goals

SDG data is not available