Explainable AI model for PDFMal detection based on gradient boosting model


연구 분야: Strategies



학회: Neural Computing and Applications


초록

Portable document formats (PDFs) are widely used for document exchange due to their widespread usage and versatility. However, PDFs are highly vulnerable to malware attacks, which pose significant security risks. Existing defense mechanisms often struggle to effectively detect and mitigate these threats, highlighting the need for more robust solutions. This paper introduces a robust framework that uses advanced tree-based ensemble models to detect malicious PDFs using the Evasive-PDFMal2022 dataset. The proposed model achieves a recall rate of 100%, an accuracy rate of 99.95%, and a fast inference time of 0.1723 s. Furthermore, the framework exhibits minimal false positive and false negative rates, ensuring a high level of precision in distinguishing between malicious and benign PDFs. Shapley additive explanations are used to improve the interpretability and reliability of the model’s predictions. The results highlight the effectiveness of the proposed model in improving PDF document security and addressing the challenges posed by malware attacks.


Author Profile
Mona Elattar

Department of Computer Science Faculty of Science Tanta University Tanta 31527 Egypt

Egypt
Author Profile
Ahmed Younes

Department of Mathematics and Computer Science Faculty of Science Alexandria University Alexandria 21511 Egypt

Andorra
Author Profile
Ibrahim Gad

Department of Mathematics and Computer Science Faculty of Science Alexandria University Alexandria 21511 Egypt

Andorra

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Egypt, Andorra
사이트 Springer
좋아요 수 0

연관 논문 목록 (13건)