Reassessing feature-based Android malware detection in a contemporary context
Table 33
Summary of results, comparing originally published accuracies, F1 scores and TPRs (Original) against those of our reimplementations (Ours) and ensemble models.
Where multiple models were evaluated in a study, only the best result is shown for each metric. Where a metric was not reported in the original study, we have indicated —. Bold highlighting is used to show whether the original or reimplemented study produced a better result for each metric. The overall best values for each metric are underlined.