Teaching AI the ‘Why’ and ‘How’ of Software Vulnerability Fixes


연구 분야: Strategies



학회: Proceedings of the ACM on Software Engineering, Volume 2, Issue FSE


초록

Understanding software vulnerabilities and their resolutions is crucial for securing modern software systems. This study presents a novel traceability model that links a pair of sentences describing at least one of the three types of semantics (triggers, crash phenomenon and fix action) for a vulnerability in natural language (NL) vulnerability artifacts, to their corresponding pair of code statements. Different from the traditional traceability models, our trace links between a pair of related NL sentences and a pair of code statements can recover the semantic relationship between code statements so that the specific role played by each code statement in a vulnerability can be automatically identified. Our end-to-end approach is implemented in two key steps: VulnExtract and VulnTrace. VulnExtract automatically extracts sentences describing triggers, crash phenomenon and/or fix action for a vulnerability using 37 discourse patterns derived from NL artifacts (CVE summary, bug reports and commit messages). VulnTrace employs pre-trained code search models to trace these sentences to corresponding code statements. Our empirical study, based on 341 CVEs and their associated code snippets, demonstrates the effectiveness of our approach, with recall exceeding 90% in most cases for NL sentence extraction. VulnTrace achieves a Top5 accuracy of over 68.2% for mapping a pair of related NL sentences to corresponding pair of code statements. The end-to-end combined VulnExtract+VulnTrace achieves a Top5 accuracy of 59.6% and 53.1% for mapping two pairs of NL sentences to code statements. These results highlight the potential of our method in automating vulnerability comprehension and reducing manual effort.


Author Profile
Amiao Gao

Southern Methodist University Dallas USA

United States
Author Profile
Zenong Zhang

University of Texas at Dallas Richardson USA

Austria
Author Profile
Simin Wang

Southern Methodist University Dallas USA

United States

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 United States, Austria
사이트 ACM
좋아요 수 0

연관 논문 목록 (102건)