Predictive Modeling for Malware Detection in Windows Systems using Ensemble Learning
Figure 1: Diagram of Structure of MALPRED
You can refer to data_cleaning/
for the data cleaning system. However, for a basic idea, refer to the following diagram:
Figure 2: Diagram of Feature Engineering in MALPRED
Models | AUC-ROC | Accuracy | Recall | Precision |
---|---|---|---|---|
Stack Model | 73.03 | 66.45 | 66.25 | 66.49 |
XGBoost | 72.19 | 65.79 | 65.18 | 65.96 |
LightGBM | 71.25 | 65.08 | 64.21 | 65.33 |
CatBoost | 70.92 | 64.78 | 63.50 | 65.15 |
MLP | 70.72 | 64.62 | 63.56 | 64.94 |
GBC | 69.50 | 63.57 | 64.22 | 63.38 |
ADA | 68.75 | 63.14 | 64.87 | 62.68 |
LDA | 66.72 | 61.68 | 57.79 | 62.64 |
QDA | 66.23 | 56.98 | 65.85 | 61.51 |
DT | 57.60 | 57.60 | 57.73 | 57.56 |
Dummy | 50.00 | 50.02 | 00.00 | 00.00 |
Ridge | - | 61.68 | 57.79 | 62.64 |
SVM | - | 61.34 | 51.93 | 63.97 |