| CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification | Feb 12, 2025 | 16k4k | —Unverified | 0 |
| Identifying Flaky Tests in Quantum Code: A Machine Learning Approach | Feb 6, 2025 | software testing | —Unverified | 0 |
| A Systematic Approach for Assessing Large Language Models' Test Case Generation Capability | Feb 5, 2025 | software testingTest Case Creation | —Unverified | 0 |
| Assessing Data Augmentation-Induced Bias in Training and Testing of Machine Learning Models | Feb 3, 2025 | Data Augmentationsoftware testing | CodeCode Available | 0 |
| Toward Neurosymbolic Program Comprehension | Feb 3, 2025 | Code Generationsoftware testing | —Unverified | 0 |
| Many-Objective Neuroevolution for Testing Games | Jan 14, 2025 | software testing | —Unverified | 0 |
| An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering | Jan 12, 2025 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| The Potential of LLMs in Automating Software Testing: From Generation to Reporting | Dec 31, 2024 | software testing | —Unverified | 0 |
| Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation | Dec 18, 2024 | software testing | —Unverified | 0 |
| Design choices made by LLM-based test generators prevent them from finding bugs | Dec 18, 2024 | software testing | —Unverified | 0 |