PERFUME: Programmatic Extraction and Refinement for Usability of Mathematical Expression


연구 분야: Safety



학회: Checkmate '21: Proceedings of the 2021 Research on offensive and defensive techniques in the Context of Man At The End (MATE) Attacks


초록

Algorithmic identification is the crux for several binary analysis applications, including malware analysis, vulnerability discovery, and embedded firmware reverse engineering. However, data-driven and signature-based approaches often break down when encountering outlier realizations of a particular algorithm. Moreover, reverse engineering of domain-specific binaries often requires collaborative analysis between reverse engineers and domain experts. Communicating the behavior of an unidentified binary program to non-reverse engineers necessitates the recovery of algorithmic semantics in a human-digestible form. This paper presents PERFUME, a framework that extracts symbolic math expressions from low-level binary representations of an algorithm. PERFUME works by translating a symbolic output representation of a binary function to a high-level mathematical expression. In particular, we detail how source and target representations are generated for training a machine translation model. We integrate PERFUME as a plug-in for Ghidra--an open-source reverse engineering framework. We present our preliminary findings for domain-specific use cases and formalize open challenges in mathematical expression extraction from algorithmic implementations.


Author Profile
Nicolaas Weideman

University of Southern California Marina Del Rey CA USA

Canada
Author Profile
Virginia K Felkner

University of Southern California Marina Del Rey CA USA

Canada
Author Profile
Weicheng Wu

University of Southern California Marina Del Rey CA USA

Canada

📄 논문 정보

발행 연도 2021년
인용수 4
출판 국가 Canada
사이트 ACM
좋아요 수 0

연관 논문 목록 (297건)