Classification and online clustering of zero-day malware


연구 분야: Safety



학회: Journal of Computer Virology and Hacking Techniques


초록

A large amount of new malware is constantly being generated, which must not only be distinguished from benign samples, but also classified into malware families. For this purpose, investigating how existing malware families are developed and examining emerging families need to be explored. This paper focuses on the online processing of incoming malicious samples to assign them to existing families or, in the case of samples from new families, to cluster them. We experimented with seven prevalent malware families from the EMBER dataset, four in the training set and three additional new families in the test set. The features were extracted by static analysis of portable executable files for the Windows operating system. Based on the classification score of the multilayer perceptron, we determined which samples would be classified and which would be clustered into new malware families. We classified 97.21% of streaming data with a balanced accuracy of 95.33%. Then, we clustered the remaining data using a self-organizing map, achieving a purity from 47.61% for four clusters to 77.68% for ten clusters. These results indicate that our approach has the potential to be applied to the classification and clustering of zero-day malware into malware families.


Author Profile
Martin Jureček

Faculty of Information Technology Czech Technical University in Prague Prague Czechia

India
Author Profile
Olha Jurečková

Faculty of Information Technology Czech Technical University in Prague Prague Czechia

India
Author Profile
Mark Stamp

Department of Computer Science San Jose State University San Jose CA USA

Canada

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 India, Canada
사이트 Springer
좋아요 수 0

연관 논문 목록 (271건)