Harden and Catch for Just-in-Time Assured LLM-Based Software Testing: Open Research Challenges

April 23, 2025 · Declared Dead · 🏛 SIGSOFT FSE Companion

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Mark Harman, Peter O'Hearn, Shubho Sengupta arXiv ID 2504.16472 Category cs.SE: Software Engineering Cross-listed cs.AI Citations 3 Venue SIGSOFT FSE Companion Last Checked 3 months ago

Abstract

Despite decades of research and practice in automated software testing, several fundamental concepts remain ill-defined and under-explored, yet offer enormous potential real-world impact. We show that these concepts raise exciting new challenges in the context of Large Language Models for software test generation. More specifically, we formally define and investigate the properties of hardening and catching tests. A hardening test is one that seeks to protect against future regressions, while a catching test is one that catches such a regression or a fault in new functionality introduced by a code change. Hardening tests can be generated at any time and may become catching tests when a future regression is caught. We also define and motivate the Catching 'Just-in-Time' (JiTTest) Challenge, in which tests are generated 'just-in-time' to catch new faults before they land into production. We show that any solution to Catching JiTTest generation can also be repurposed to catch latent faults in legacy code. We enumerate possible outcomes for hardening and catching tests and JiTTests, and discuss open research problems, deployment options, and initial results from our work on automated LLM-based hardening at Meta. This paper was written to accompany the keynote by the authors at the ACM International Conference on the Foundations of Software Engineering (FSE) 2025. Author order is alphabetical. The corresponding author is Mark Harman.