| Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning | Apr 26, 2025 | In-Context LearningPhilosophy | CodeCode Available | 0 |
| Generative AI to Generate Test Data Generators | Jan 31, 2024 | software testing | CodeCode Available | 0 |
| Comprehensive Evaluation and Insights into the Use of Large Language Models in the Automation of Behavior-Driven Development Acceptance Test Formulation | Mar 22, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| SECBENCH: A Database of Real Security Vulnerabilities | Oct 31, 2017 | software testing | CodeCode Available | 0 |
| TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing | Jul 28, 2018 | software testing | CodeCode Available | 0 |
| A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks | Aug 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects | Sep 3, 2020 | BIG-bench Machine Learningsoftware testing | CodeCode Available | 0 |
| Assessing Data Augmentation-Induced Bias in Training and Testing of Machine Learning Models | Feb 3, 2025 | Data Augmentationsoftware testing | CodeCode Available | 0 |
| Test Case Recommendations with Distributed Representation of Code Syntactic Features | Oct 4, 2023 | software testing | CodeCode Available | 0 |