연구 분야: Safety
학회: 2024 11th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)
This research enhances the development of natural language processing (NLP) by integrating part of speech (POS) tagging and named entity recognition (NER) techniques to annotate unstructured data from advanced persistent threat (APT) news. This paper uses combination of these two techniques with deep learning methods, specifically bidirectional long short-term memory (BiLSTM) and conditional random field (CRF) to automate data annotation. The objective is to automatically convert the unstructured APT data into structured threat information expression (STIX) format. The proposed approach can be further developed into cyber threat intelligence (CTI) system for information exchange and early warning against cybercrime. This research is currently in the preliminary stage with progress including data collection and preprocessing, and BiLSTM model building. The expected outcome is a model that capable of APT data labeling to enhance CTI systems.
| 발행 연도 | 2024년 |
|---|---|
| 인용수 | 101 |
| 출판 국가 | Indonesia, Andorra |
| 사이트 | IEEE |
| 좋아요 수 | 0 |