| Unit Test Case Generation with Transformers and Focal Context | Sep 11, 2020 | Denoisingtest driven development | CodeCode Available | 1 | 5 |
| SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner | Jun 10, 2025 | test driven development | CodeCode Available | 1 | 5 |
| TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved? | Dec 3, 2024 | test driven development | CodeCode Available | 1 | 5 |
| Otter: Generating Tests from Issues to Validate SWE Patches | Feb 7, 2025 | test driven development | CodeCode Available | 1 | 5 |
| CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark | Jul 4, 2025 | Bug fixingCode Generation | CodeCode Available | 1 | 5 |
| Comprehensive Evaluation and Insights into the Use of Large Language Models in the Automation of Behavior-Driven Development Acceptance Test Formulation | Mar 22, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 | 5 |
| Safety and Performance, Why not Both? Bi-Objective Optimized Model Compression toward AI Software Deployment | Aug 11, 2022 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression against Heterogeneous Attacks Toward AI Software Deployment | Jan 2, 2024 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| From Defects to Demands: A Unified, Iterative, and Heuristically Guided LLM-Based Framework for Automated Software Repair and Requirement Realization | Dec 6, 2024 | Ingenuitytest driven development | —Unverified | 0 | 0 |
| Apertium-fin-eng--Rule-based Shallow Machine Translation for WMT 2019 Shared Task | Aug 1, 2019 | Machine Translationtest driven development | —Unverified | 0 | 0 |