Proactively Identifying Emerging Hacker Threats from the Dark Web: A Diachronic Graph Embedding Framework (D-GEF)


연구 분야: Safety



학회: ACM Transactions on Privacy and Security (TOPS), Volume 23, Issue 4


초록

Cybersecurity experts have appraised the total global cost of malicious hacking activities to be $450 billion annually. Cyber Threat Intelligence (CTI) has emerged as a viable approach to combat this societal issue. However, existing processes are criticized as inherently reactive to known threats. To combat these concerns, CTI experts have suggested proactively examining emerging threats in the vast, international online hacker community. In this study, we aim to develop proactive CTI capabilities by exploring online hacker forums to identify emerging threats in terms of popularity and tool functionality. To achieve these goals, we create a novel Diachronic Graph Embedding Framework (D-GEF). D-GEF operates on a Graph-of-Words (GoW) representation of hacker forum text to generate word embeddings in an unsupervised manner. Semantic displacement measures adopted from diachronic linguistics literature identify how terminology evolves. A series of benchmark experiments illustrate D-GEF's ability to generate higher quality than state-of-the-art word embedding models (e.g., word2vec) in tasks pertaining to semantic analogy, clustering, and threat classification. D-GEF's practical utility is illustrated with in-depth case studies on web application and denial of service threats targeting PHP and Windows technologies, respectively. We also discuss the implications of the proposed framework for strategic, operational, and tactical CTI scenarios. All datasets and code are publicly released to facilitate scientific reproducibility and extensions of this work.


Author Profile
Sagar Samtani

Department of Operations and Decision Technologies Indiana University Bloomington Indiana

Andorra
Author Profile
Hongyi Zhu

Department of Information Systems and Cyber Security University of Texas at San Antonio San Antonio TX

Andorra
Author Profile
Hsinchun Chen

Department of Management Information Systems University of Arizona Tucson AZ

Azerbaijan

📄 논문 정보

발행 연도 2020년
인용수 56
출판 국가 Azerbaijan, Andorra
사이트 ACM
좋아요 수 0

연관 논문 목록 (548건)