| SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents | Jun 18, 2024 | Code GenerationCode Repair | CodeCode Available | 2 | 5 |
| GPflow: A Gaussian process library using TensorFlow | Oct 27, 2016 | Gaussian ProcessesGPU | CodeCode Available | 2 | 5 |
| On the Challenges of Fuzzing Techniques via Large Language Models | Feb 1, 2024 | software testingSurvey | CodeCode Available | 2 | 5 |
| CoverUp: Effective High Coverage Test Generation for Python | Mar 24, 2024 | software testing | CodeCode Available | 2 | 5 |
| Boosting Synthetic Data Generation with Effective Nonlinear Causal Discovery | Jan 18, 2023 | Causal Discoverysoftware testing | CodeCode Available | 1 | 5 |
| Towards Principled Representation Learning from Videos for Reinforcement Learning | Mar 20, 2024 | Contrastive Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Perfect is the enemy of test oracle | Feb 3, 2023 | software testing | CodeCode Available | 1 | 5 |
| Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | Aug 21, 2024 | Bug fixingDescriptive | CodeCode Available | 1 | 5 |
| Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond | Apr 11, 2023 | software testing | CodeCode Available | 1 | 5 |
| Software Engineering for AI-Based Systems: A Survey | May 5, 2021 | Autonomous Drivingsoftware testing | CodeCode Available | 1 | 5 |
| Black-box Explanation of Object Detectors via Saliency Maps | Jun 5, 2020 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Efficient and Effective Generation of Test Cases for Pedestrian Detection -- Search-based Software Testing of Baidu Apollo in SVL | Sep 16, 2021 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 | 5 |
| QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems | Feb 20, 2024 | software testing | CodeCode Available | 0 | 5 |
| Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects | Sep 3, 2020 | BIG-bench Machine Learningsoftware testing | CodeCode Available | 0 | 5 |
| Boosting Operational DNN Testing Efficiency through Conditioning | Jun 6, 2019 | DNN Testingsoftware testing | CodeCode Available | 0 | 5 |
| Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs? | Oct 15, 2024 | software testing | CodeCode Available | 0 | 5 |
| Business Negotiation Definition Language | Jan 4, 2020 | software testing | CodeCode Available | 0 | 5 |
| Fairness-aware Configuration of Machine Learning Libraries | Feb 13, 2022 | BIG-bench Machine LearningFairness | CodeCode Available | 0 | 5 |
| Differential testing for machine learning: an analysis for classification algorithms beyond deep learning | Jul 25, 2022 | Deep Learningsoftware testing | CodeCode Available | 0 | 5 |
| TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing | Jul 28, 2018 | software testing | CodeCode Available | 0 | 5 |
| SECBENCH: A Database of Real Security Vulnerabilities | Oct 31, 2017 | software testing | CodeCode Available | 0 | 5 |
| Test Case Recommendations with Distributed Representation of Code Syntactic Features | Oct 4, 2023 | software testing | CodeCode Available | 0 | 5 |
| Reasoning-Based Software Testing | Mar 2, 2023 | Causal Discoverysoftware testing | CodeCode Available | 0 | 5 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks | Aug 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Recurrent Neural Networks for Fuzz Testing Web Browsers | Dec 12, 2018 | Model Selectionsoftware testing | CodeCode Available | 0 | 5 |
| Testing the Channels of Convolutional Neural Networks | Mar 6, 2023 | channel selectionsoftware testing | CodeCode Available | 0 | 5 |
| Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature survey | Jun 17, 2025 | software testing | CodeCode Available | 0 | 5 |
| Assessing Data Augmentation-Induced Bias in Training and Testing of Machine Learning Models | Feb 3, 2025 | Data Augmentationsoftware testing | CodeCode Available | 0 | 5 |
| An Efficiency Study for SPLADE Models | Jul 8, 2022 | Retrievalsoftware testing | CodeCode Available | 0 | 5 |
| Genetic Micro-Programs for Automated Software Testing with Large Path Coverage | Feb 14, 2023 | software testing | CodeCode Available | 0 | 5 |
| InterEvo-TR: Interactive Evolutionary Test Generation With Readability Assessment | Jan 13, 2024 | software testing | CodeCode Available | 0 | 5 |
| Comprehensive Evaluation and Insights into the Use of Large Language Models in the Automation of Behavior-Driven Development Acceptance Test Formulation | Mar 22, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 | 5 |
| An Autonomous Performance Testing Framework using Self-Adaptive Fuzzy Reinforcement Learning | Aug 19, 2019 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| Generative AI to Generate Test Data Generators | Jan 31, 2024 | software testing | CodeCode Available | 0 | 5 |
| IRG: Generating Synthetic Relational Databases using Deep Learning with Insightful Relational Understanding | Dec 23, 2023 | Data AugmentationGenerative Adversarial Network | CodeCode Available | 0 | 5 |
| Pipelines for Social Bias Testing of Large Language Models | May 1, 2022 | software testing | CodeCode Available | 0 | 5 |
| LLM-Powered Test Case Generation for Detecting Bugs in Plausible Programs | Apr 16, 2024 | software testing | CodeCode Available | 0 | 5 |
| Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning | Apr 26, 2025 | In-Context LearningPhilosophy | CodeCode Available | 0 | 5 |
| Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing | Apr 22, 2022 | BIG-bench Machine Learningsoftware testing | —Unverified | 0 | 0 |
| Column Generation for Interaction Coverage in Combinatorial Software Testing | Dec 19, 2017 | software testing | —Unverified | 0 | 0 |
| Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM | Jan 31, 2024 | software testing | —Unverified | 0 | 0 |
| Artificial Intelligence in Software Testing : Impact, Problems, Challenges and Prospect | Jan 14, 2022 | Autonomous Vehiclessoftware testing | —Unverified | 0 | 0 |
| An Algorithm for Generating Gap-Fill Multiple Choice Questions of an Expert System | Sep 17, 2021 | Multiple-choicesoftware testing | —Unverified | 0 | 0 |
| CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification | Feb 12, 2025 | 16k4k | —Unverified | 0 | 0 |
| Artificial intelligence for context-aware visual change detection in software test automation | May 1, 2024 | Change Detectionsoftware testing | —Unverified | 0 | 0 |
| Can ChatGPT advance software testing intelligence? An experience report on metamorphic testing | Oct 30, 2023 | Chatbotsoftware testing | —Unverified | 0 | 0 |
| Failed Disruption Propagation in Integer Genetic Programming | Apr 4, 2022 | Diversitysoftware testing | —Unverified | 0 | 0 |
| Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes | Sep 10, 2024 | software testing | —Unverified | 0 | 0 |
| A Novel Multiple Ensemble Learning Models Based on Different Datasets for Software Defect Prediction | Aug 30, 2020 | Ensemble Learningsoftware testing | —Unverified | 0 | 0 |