| Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees | Jun 17, 2025 | Code TranslationHumanEval | —Unverified | 0 |
| Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature survey | Jun 17, 2025 | software testing | CodeCode Available | 0 |
| IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents | Jun 9, 2025 | software testing | —Unverified | 0 |
| The Impact of Software Testing with Quantum Optimization Meets Machine Learning | Jun 2, 2025 | Defect Detectionsoftware testing | —Unverified | 0 |
| EvoGPT: Enhancing Test Suite Robustness via LLM-Based Generation and Genetic Optimization | May 18, 2025 | DiversityFault Detection | —Unverified | 0 |
| On the Need for a Statistical Foundation in Scenario-Based Testing of Autonomous Vehicles | May 4, 2025 | Autonomous Vehiclessoftware testing | —Unverified | 0 |
| Automated Unit Test Case Generation: A Systematic Literature Review | Apr 29, 2025 | software testingSystematic Literature Review | —Unverified | 0 |
| Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning | Apr 26, 2025 | In-Context LearningPhilosophy | CodeCode Available | 0 |
| Harden and Catch for Just-in-Time Assured LLM-Based Software Testing: Open Research Challenges | Apr 23, 2025 | software testing | —Unverified | 0 |
| Expectations vs Reality -- A Secondary Study on AI Adoption in Software Testing | Apr 7, 2025 | software testing | —Unverified | 0 |
| From Code Generation to Software Testing: AI Copilot with Context-Based RAG | Apr 2, 2025 | ChatbotCode Generation | —Unverified | 0 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Integrating Artificial Intelligence with Human Expertise: An In-depth Analysis of ChatGPT's Capabilities in Generating Metamorphic Relations | Mar 28, 2025 | software testing | —Unverified | 0 |
| Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview | Mar 13, 2025 | Automated Theorem Provingsoftware testing | —Unverified | 0 |
| Rule-Guided Reinforcement Learning Policy Evaluation and Improvement | Mar 12, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| ToolFuzz -- Automated Agent Tool Testing | Mar 6, 2025 | Large Language ModelPrompt Engineering | —Unverified | 0 |
| WIP: Assessing the Effectiveness of ChatGPT in Preparatory Testing Activities | Mar 5, 2025 | software testing | —Unverified | 0 |
| Towards Reliable LLM-Driven Fuzz Testing: Vision and Road Ahead | Mar 2, 2025 | software testingvalid | —Unverified | 0 |
| CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification | Feb 12, 2025 | 16k4k | —Unverified | 0 |
| Identifying Flaky Tests in Quantum Code: A Machine Learning Approach | Feb 6, 2025 | software testing | —Unverified | 0 |
| A Systematic Approach for Assessing Large Language Models' Test Case Generation Capability | Feb 5, 2025 | software testingTest Case Creation | —Unverified | 0 |
| Assessing Data Augmentation-Induced Bias in Training and Testing of Machine Learning Models | Feb 3, 2025 | Data Augmentationsoftware testing | CodeCode Available | 0 |
| Toward Neurosymbolic Program Comprehension | Feb 3, 2025 | Code Generationsoftware testing | —Unverified | 0 |
| Many-Objective Neuroevolution for Testing Games | Jan 14, 2025 | software testing | —Unverified | 0 |
| An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering | Jan 12, 2025 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| The Potential of LLMs in Automating Software Testing: From Generation to Reporting | Dec 31, 2024 | software testing | —Unverified | 0 |
| Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation | Dec 18, 2024 | software testing | —Unverified | 0 |
| Design choices made by LLM-based test generators prevent them from finding bugs | Dec 18, 2024 | software testing | —Unverified | 0 |
| CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++? | Dec 3, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Software testing for project report. | Nov 20, 2024 | software testing | —Unverified | 0 |
| VALTEST: Automated Validation of Language Model Generated Test Cases | Nov 13, 2024 | HumanEvalLanguage Modeling | —Unverified | 0 |
| Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs? | Oct 15, 2024 | software testing | CodeCode Available | 0 |
| TAEGAN: Generating Synthetic Tabular Data For Data Augmentation | Oct 2, 2024 | Data AugmentationGenerative Adversarial Network | —Unverified | 0 |
| On the Effectiveness of LLMs for Manual Test Verifications | Sep 19, 2024 | 4ksoftware testing | —Unverified | 0 |
| Computer Vision Intelligence Test Modeling and Generation: A Case Study on Smart OCR | Sep 14, 2024 | 3D ClassificationOptical Character Recognition | —Unverified | 0 |
| Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes | Sep 10, 2024 | software testing | —Unverified | 0 |
| The Future of Software Testing: AI-Powered Test Case Generation and Validation | Sep 9, 2024 | Overall - Testsoftware testing | —Unverified | 0 |
| The Role of Artificial Intelligence and Machine Learning in Software Testing | Sep 4, 2024 | Defect Detectionsoftware testing | —Unverified | 0 |
| Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness | Aug 31, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | Aug 21, 2024 | Bug fixingDescriptive | CodeCode Available | 1 |
| A System for Automated Unit Test Generation Using Large Language Models and Assessment of Generated Test Suites | Aug 14, 2024 | Code Generationsoftware testing | —Unverified | 0 |
| MAO: A Framework for Process Model Generation with Multi-Agent Orchestration | Aug 4, 2024 | Hallucinationsoftware testing | —Unverified | 0 |
| FuzzTheREST: An Intelligent Automated Black-box RESTful API Fuzzer | Jul 19, 2024 | Reinforcement Learning (RL)software testing | —Unverified | 0 |
| SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents | Jun 18, 2024 | Code GenerationCode Repair | CodeCode Available | 2 |
| Data Augmentation by Fuzzing for Neural Test Generation | Jun 12, 2024 | Code GenerationData Augmentation | —Unverified | 0 |
| BugBlitz-AI: An Intelligent QA Assistant | May 17, 2024 | software testing | —Unverified | 0 |
| Artificial intelligence for context-aware visual change detection in software test automation | May 1, 2024 | Change Detectionsoftware testing | —Unverified | 0 |
| Fuzzy Inference System for Test Case Prioritization in Software Testing | Apr 25, 2024 | software testing | —Unverified | 0 |
| LLM-Powered Test Case Generation for Detecting Bugs in Plausible Programs | Apr 16, 2024 | software testing | CodeCode Available | 0 |
| Tasks People Prompt: A Taxonomy of LLM Downstream Tasks in Software Verification and Falsification Approaches | Apr 14, 2024 | ArticlesFault localization | —Unverified | 0 |