Harden and Catch for Just-in-Time Assured LLM-Based Software Testing: Open Research Challenges


연구 분야: Verification



학회: FSE Companion '25: Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering


초록

Despite decades of research and practice in automated software testing, several fundamental concepts remain ill-defined and under-explored, yet offer enormous potential real-world impact. We show that these concepts raise exciting new challenges in the context of Large Language Models for software test generation. More specifically, we formally define and investigate the properties of hardening and catching tests. A hardening test is one that seeks to protect against future regressions, while a catching test is one that catches such a regression or a fault in new functionality introduced by a code change. Hardening tests can be generated at any time and may become catching tests when a future regression is caught. We also define and motivate the Catching 'Just-in-Time' (JiTTest) Challenge, in which tests are generated 'just-in-time' to catch new faults before they land into production. We show that any solution to Catching JiTTest generation can also be repurposed to catch latent faults in legacy code. We enumerate possible outcomes for hardening and catching tests and JiTTests, and discuss open research problems, deployment options, and initial results from our work on automated LLM-based hardening at Meta. This paper1 was written to accompany the keynote by the authors at the ACM International Conference on the Foundations of Software Engineering (FSE) 2025.


Author Profile
Mark Harman

Meta platforms and UCL London United Kingdom

Andorra
Author Profile
Peter W O'Hearn

Meta platforms and UCL London United Kingdom

Andorra
Author Profile
Shubho Sengupta

Independent consultant San Francisco USA

United States

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Andorra, United States
사이트 ACM
좋아요 수 0

연관 논문 목록 (58건)