| CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark | Jul 4, 2025 | Bug fixingCode Generation | CodeCode Available | 1 | 5 |
| Unit Test Case Generation with Transformers and Focal Context | Sep 11, 2020 | Denoisingtest driven development | CodeCode Available | 1 | 5 |
| SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner | Jun 10, 2025 | test driven development | CodeCode Available | 1 | 5 |
| TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved? | Dec 3, 2024 | test driven development | CodeCode Available | 1 | 5 |
| Otter: Generating Tests from Issues to Validate SWE Patches | Feb 7, 2025 | test driven development | CodeCode Available | 1 | 5 |
| Comprehensive Evaluation and Insights into the Use of Large Language Models in the Automation of Behavior-Driven Development Acceptance Test Formulation | Mar 22, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 | 5 |
| Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression against Heterogeneous Attacks Toward AI Software Deployment | Jan 2, 2024 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| Safety and Performance, Why not Both? Bi-Objective Optimized Model Compression toward AI Software Deployment | Aug 11, 2022 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| Open Source Evolutionary Computation with Chips-n-Salsa | Dec 2, 2024 | Evolutionary Algorithmstest driven development | —Unverified | 0 | 0 |
| Test-Driven Development for Code Generation | Feb 21, 2024 | Code GenerationHumanEval | —Unverified | 0 | 0 |
| Test-Driven Development of ontologies (extended version) | Dec 19, 2015 | test driven development | —Unverified | 0 | 0 |
| Testing LLMs on Code Generation with Varying Levels of Prompt Specificity | Nov 10, 2023 | Code GenerationSpecificity | —Unverified | 0 | 0 |
| Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation | May 13, 2025 | Code GenerationIn-Context Learning | —Unverified | 0 | 0 |
| TimeGym: Debugging for Time Series Modeling in Python | May 4, 2021 | test driven developmentTime Series | —Unverified | 0 | 0 |
| Unit Testing in ASP Revisited: Language and Test-Driven Development Environment | Jan 4, 2024 | test driven development | —Unverified | 0 | 0 |
| A Comparative Study on the Impact of Test-Driven Development (TDD) and Behavior-Driven Development (BDD) on Enterprise Software Delivery Effectiveness | Nov 5, 2024 | test driven development | —Unverified | 0 | 0 |
| Use Property-Based Testing to Bridge LLM Code Generation and Validation | Jun 23, 2025 | Code Generationtest driven development | —Unverified | 0 | 0 |
| Evaluation-Driven Development of LLM Agents: A Process Model and Reference Architecture | Nov 21, 2024 | test driven development | —Unverified | 0 | 0 |
| Apertium-fin-eng--Rule-based Shallow Machine Translation for WMT 2019 Shared Task | Aug 1, 2019 | Machine Translationtest driven development | —Unverified | 0 | 0 |
| Applied Awareness: Test-Driven GUI Development using Computer Vision and Cryptography | Jun 5, 2020 | test driven development | —Unverified | 0 | 0 |
| From Defects to Demands: A Unified, Iterative, and Heuristically Guided LLM-Based Framework for Automated Software Repair and Requirement Realization | Dec 6, 2024 | Ingenuitytest driven development | —Unverified | 0 | 0 |
| Generating Automotive Code: Large Language Models for Software Development and Verification in Safety-Critical Systems | Jun 4, 2025 | BenchmarkingCode Generation | —Unverified | 0 | 0 |
| LLM4TDD: Best Practices for Test Driven Development Using Large Language Models | Dec 7, 2023 | Program Synthesistest driven development | —Unverified | 0 | 0 |
| More Effective Ontology Authoring with Test-Driven Development | Dec 14, 2018 | test driven development | —Unverified | 0 | 0 |