A lightweight framework for function name reassignment based on large-scale stripped binaries


연구 분야: Analysis



학회: ISSTA 2021: Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis


초록

Software in the wild is usually released as stripped binaries that contain no debug information (e.g., function names). This paper studies the issue of reassigning descriptive names for functions to help facilitate reverse engineering. Since the essence of this issue is a data-driven prediction task, persuasive research should be based on sufficiently large-scale and diverse data. However, prior studies can only be based on small-scale datasets because their techniques suffer from heavyweight binary analysis, making them powerless in the face of big-size and large-scale binaries. This paper presents the Neural Function Rename Engine (NFRE), a lightweight framework for function name reassignment that utilizes both sequential and structural information of assembly code. NFRE uses fine-grained and easily acquired features to model assembly code, making it more effective and efficient than existing techniques. In addition, we construct a large-scale dataset and present two data-preprocessing approaches to help improve its usability. Benefiting from the lightweight design, NFRE can be efficiently trained on the large-scale dataset, thereby having better generalization capability for unknown functions. The comparative experiments show that NFRE outperforms two existing techniques by a relative improvement of 32% and 16%, respectively, while the time cost for binary analysis is much less.


Author Profile
Han Gao

University of Science and Technology of China China

Andorra
Author Profile
Shaoyin Cheng

University of Science and Technology of China China

Andorra
Author Profile
Yinxing Xue

University of Science and Technology of China China

Andorra

📄 논문 정보

발행 연도 2021년
인용수 24
출판 국가 Andorra
사이트 ACM
좋아요 수 0

연관 논문 목록 (33건)