UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Drift Forensics of Malware Classifiers

Chow, Theo; Kan, Zeliang; Linhardt, Lorenz; Cavallaro, Lorenzo; Arp, Daniel; Pierazzi, Fabio; (2023) Drift Forensics of Malware Classifiers. In: Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security. (pp. pp. 197-207). ACM: Copenhagen, Denmark. Green open access

[thumbnail of Drift Forensics of Malware Classifiers.pdf]
Preview
Text
Drift Forensics of Malware Classifiers.pdf - Accepted Version

Download (1MB) | Preview

Abstract

The widespread occurrence of mobile malware still poses a significant security threat to billions of smartphone users. To counter this threat, several machine learning-based detection systems have been proposed within the last decade. These methods have achieved impressive detection results in many settings, without requiring the manual crafting of signatures. Unfortunately, recent research has demonstrated that these systems often suffer from significant performance drops over time if the underlying distribution changes—a phenomenon referred to as concept drift. So far, however, it is still an open question which main factors cause the drift in the data and, in turn, the drop in performance of current detection systems. To address this question, we present a framework for the indepth analysis of dataset affected by concept drift. The framework allows gaining a better understanding of the root causes of concept drift, a fundamental stepping stone for building robust detection methods. To examine the effectiveness of our framework, we use it to analyze a commonly used dataset for Android malware detection as a first case study. Our analysis yields two key insights into the drift that affects several state-of-the-art methods. First, we find that most of the performance drop can be explained by the rise of two malware families in the dataset. Second, we can determine how the evolution of certain malware families and even goodware samples affects the classifier’s performance. Our findings provide a novel perspective on previous evaluations conducted using this dataset and, at the same time, show the potential of the proposed framework to obtain a better understanding of concept drift in mobile malware and related settings.

Type: Proceedings paper
Title: Drift Forensics of Malware Classifiers
Event: CCS '23: ACM SIGSAC Conference on Computer and Communications Security
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/3605764.3623918
Publisher version: https://doi.org/10.1145/3605764.3623918
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10182372
Downloads since deposit
34Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item