Drift Forensics of Malware Classifiers


연구 분야: Strategies



학회: AISec '23: Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security


초록

The widespread occurrence of mobile malware still poses a significant security threat to billions of smartphone users. To counter this threat, several machine learning-based detection systems have been proposed within the last decade. These methods have achieved impressive detection results in many settings, without requiring the manual crafting of signatures. Unfortunately, recent research has demonstrated that these systems often suffer from significant performance drops over time if the underlying distribution changes---a phenomenon referred to as concept drift. So far, however, it is still an open question which main factors cause the drift in the data and, in turn, the drop in performance of current detection systems. To address this question, we present a framework for the in-depth analysis of dataset affected by concept drift. The framework allows gaining a better understanding of the root causes of concept drift, a fundamental stepping stone for building robust detection methods. To examine the effectiveness of our framework, we use it to analyze a commonly used dataset for Android malware detection as a first case study. Our analysis yields two key insights into the drift that affects several state-of-the-art methods. First, we find that most of the performance drop can be explained by the rise of two malware families in the dataset. Second, we can determine how the evolution of certain malware families and even goodware samples affects the classifier's performance. Our findings provide a novel perspective on previous evaluations conducted using this dataset and, at the same time, show the potential of the proposed framework to obtain a better understanding of concept drift in mobile malware and related settings.


Author Profile
Theo Chow

King's College London London United Kingdom

United Kingdom
Author Profile
Zeliang Kan

King's College London & University College London London United Kingdom

United Kingdom
Author Profile
Lorenz Linhardt

TU Berlin & BIFOLD Berlin United Kingdom

United Kingdom

📄 논문 정보

발행 연도 2023년
인용수 7
출판 국가 United Kingdom
사이트 ACM
좋아요 수 0

연관 논문 목록 (351건)