| AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct | May 23, 2024 | Class-level Code GenerationCode Completion | CodeCode Available | 4 | 5 |
| Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving | Jul 8, 2025 | Code RepairTransfer Learning | CodeCode Available | 3 | 5 |
| OctoPack: Instruction Tuning Code Large Language Models | Aug 14, 2023 | Code GenerationCode Repair | CodeCode Available | 3 | 5 |
| Guiding Language Models of Code with Global Context using Monitors | Jun 19, 2023 | Code CompletionCode Generation | CodeCode Available | 2 | 5 |
| SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents | Jun 18, 2024 | Code GenerationCode Repair | CodeCode Available | 2 | 5 |
| Fortran2CPP: Automating Fortran-to-C++ Translation using LLMs via Multi-Turn Dialogue and Dual-Agent Integration | Dec 27, 2024 | C++ codeCode Repair | CodeCode Available | 1 | 5 |
| CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation | Feb 9, 2021 | BIG-bench Machine LearningClone Detection | CodeCode Available | 1 | 5 |
| COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis | Aug 9, 2024 | Code GenerationCode Repair | CodeCode Available | 1 | 5 |
| MACER: A Modular Framework for Accelerated Compilation Error Repair | May 28, 2020 | 4kCode Repair | CodeCode Available | 1 | 5 |
| Learning Performance-Improving Code Edits | Feb 15, 2023 | Code GenerationCode Repair | CodeCode Available | 1 | 5 |
| Break-It-Fix-It: Unsupervised Learning for Program Repair | Jun 11, 2021 | C++ codeCode Repair | CodeCode Available | 1 | 5 |
| INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair | Nov 16, 2023 | Code GenerationCode Repair | CodeCode Available | 1 | 5 |
| CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code Repair | Sep 19, 2024 | Code GenerationCode Repair | CodeCode Available | 0 | 5 |
| Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors | Mar 28, 2025 | BenchmarkingCode Generation | CodeCode Available | 0 | 5 |
| CrashFixer: A crash resolution agent for the Linux kernel | Apr 29, 2025 | Code Repair | —Unverified | 0 | 0 |
| DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models | Feb 19, 2024 | Code RepairFew-Shot Learning | —Unverified | 0 | 0 |
| Investigating the Transferability of Code Repair for Low-Resource Programming Languages | Jun 21, 2024 | Code GenerationCode Repair | —Unverified | 0 | 0 |
| Enhanced Automated Code Vulnerability Repair using Large Language Models | Jan 8, 2024 | C++ codeCode Repair | —Unverified | 0 | 0 |
| Enhancing Large Language Models for Secure Code Generation: A Dataset-driven Study on Vulnerability Mitigation | Oct 25, 2023 | Code GenerationCode Repair | —Unverified | 0 | 0 |
| Enhancing Source Code Security with LLMs: Demystifying The Challenges and Generating Reliable Repairs | Sep 1, 2024 | Code Repair | —Unverified | 0 | 0 |
| Code Repair with LLMs gives an Exploration-Exploitation Tradeoff | May 26, 2024 | Code RepairLanguage Modeling | —Unverified | 0 | 0 |
| Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar | Nov 16, 2021 | Code Repair | —Unverified | 0 | 0 |
| Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar | Apr 13, 2022 | Code Repair | —Unverified | 0 | 0 |
| CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks | Jul 14, 2025 | BenchmarkingCode Generation | —Unverified | 0 | 0 |
| Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents | May 30, 2025 | BenchmarkingCode Repair | —Unverified | 0 | 0 |