DrSec: Flexible Distributed Representations for Efficient Endpoint Security


연구 분야: Safety



학회: 2024 IEEE Symposium on Security and Privacy (SP)


초록

The increasing complexity of attacks has given rise to varied security applications tackling profound tasks, ranging from alert triage to attack reconstruction. Yet, security products, such as Endpoint Detection and Response, bring together applications that are developed in isolation, trigger many false positives, miss actual attacks, and produce limited labels useful in supervised learning schemes. To address these challenges, we propose DrSec—a system employing self-supervised learning to pre-train foundation language models (LMs) that ingest event-sequence data and emit distributed representations for processes. Once pre-trained, the LMs can be adapted to solve different downstream tasks with limited to no supervision, helping unify the currently fractured application ecosystem. We trained DrSec with two LM types on a real-world dataset containing ∼91M processes and ∼2.55B events, and tested it in three application domains. We found that DrSec enables accurate, unsupervised process identification; outperforms leading methods on alert triage to reduce alert fatigue (e.g., 75.11% vs. ≤64.31% precision-recall area under curve); and accurately learns expert-developed rules, allowing tuning incident detectors to control false positives and negatives.


Author Profile
Mahmood Sharif

Tel Aviv University

정보 없음
Author Profile
Pubali Datta

University of Massachusetts Amherst

정보 없음
Author Profile
Andy Riddle

University of Illinois Urbana-Champaign

정보 없음

📄 논문 정보

발행 연도 2024년
인용수 3
출판 국가
사이트 IEEE
좋아요 수 0

연관 논문 목록 (393건)