Improving Security Tasks Using Compiler Provenance Information Recovered At the Binary-Level


연구 분야: Safety



학회: CCS '23: Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security


초록

The complex optimizations supported by modern compilers allow for compiler provenance recovery at many levels. For instance, it is possible to identify the compiler family and optimization level used when building a binary, as well as the individual compiler passes applied to functions within the binary. Yet, many downstream applications of compiler provenance remain unexplored. To bridge that gap, we train and evaluate a multi-label compiler provenance model on data collected from over 27,000 programs built using LLVM 14, and apply the model to a number of security-related tasks. Our approach considers 68 distinct compiler passes and achieves an average F-1 score of 84.4%. We first use the model to examine the magnitude of compiler-induced vulnerabilities, identifying 53 information leak bugs in 10 popular projects. We also show that several compiler optimization passes introduce a substantial amount of functional code reuse gadgets that negatively impact security. Beyond vulnerability detection, we evaluate other security applications, including using recovered provenance information to verify the correctness of Rich header data in Windows binaries (e.g., forensic analysis), as well as for binary decomposition tasks (e.g., third party library detection).


Author Profile
Yufei Du

Georgia Institute of Technology Atlanta GA USA

Gabon
Author Profile
Omar Alrawi

Georgia Institute of Technology Atlanta GA USA

Gabon
Author Profile
Kevin Z Snow

Zeropoint Dynamics Chapel Hill NC USA

New Caledonia

📄 논문 정보

발행 연도 2023년
인용수 5
출판 국가 Gabon, New Caledonia
사이트 ACM
좋아요 수 0

연관 논문 목록 (114건)