| SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering | May 6, 2024 | Bug fixingLanguage Modeling | CodeCode Available | 11 |
| AutoCodeRover: Autonomous Program Improvement | Apr 8, 2024 | Bug fixingCode Search | CodeCode Available | 7 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| SWE-bench: Can Language Models Resolve Real-World GitHub Issues? | Oct 10, 2023 | Bug fixingCode Generation | CodeCode Available | 4 |
| CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking | Dec 1, 2024 | Bug fixingCode Generation | CodeCode Available | 2 |
| CodeR: Issue Resolving with Multi-Agent and Task Graphs | Jun 3, 2024 | Bug fixing | CodeCode Available | 2 |
| SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | May 22, 2025 | Bug fixingChatbot | CodeCode Available | 2 |
| From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging | Oct 2, 2024 | Auto DebuggingBug fixing | CodeCode Available | 2 |
| MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs | Nov 5, 2024 | Bug fixingCode Generation | CodeCode Available | 1 |
| Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | Aug 21, 2024 | Bug fixingDescriptive | CodeCode Available | 1 |
| CoditT5: Pretraining for Source Code and Natural Language Editing | Aug 10, 2022 | Bug fixingLanguage Modeling | CodeCode Available | 1 |
| RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style Transformation | Feb 12, 2022 | Authorship AttributionBug fixing | CodeCode Available | 1 |
| Empirical Study of Transformers for Source Code | Oct 15, 2020 | Bug fixingCode Completion | CodeCode Available | 1 |
| FixEval: Execution-based Evaluation of Program Fixes for Programming Problems | Jun 15, 2022 | Bug fixing | CodeCode Available | 1 |
| D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential Analysis | Feb 16, 2021 | Bug fixingVulnerability Detection | CodeCode Available | 1 |
| A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code | Oct 23, 2020 | Bug fixingCode Completion | CodeCode Available | 1 |
| CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark | Jul 4, 2025 | Bug fixingCode Generation | CodeCode Available | 1 |
| Neural Transfer Learning for Repairing Security Vulnerabilities in C Code | Apr 16, 2021 | Bug fixingC++ code | CodeCode Available | 1 |
| Leveraging Causal Inference for Explainable Automatic Program Repair | May 26, 2022 | Bug fixingCausal Inference | —Unverified | 0 |
| VeriDebug: A Unified LLM for Verilog Debugging via Contrastive Embedding and Guided Correction | Apr 27, 2025 | Bug fixing | —Unverified | 0 |
| An Empirical Investigation into Learning Bug-Fixing Patches in the Wild via Neural Machine Translation | Sep 7, 2018 | Bug fixingDecoder | —Unverified | 0 |
| An Empirical Study on LLM-based Agents for Automated Bug Fixing | Nov 15, 2024 | Bug fixingFault localization | —Unverified | 0 |
| APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries | Apr 27, 2025 | Automated Theorem ProvingBug fixing | —Unverified | 0 |
| A Study of Vulnerability Repair in JavaScript Programs with Large Language Models | Mar 19, 2024 | Bug fixingCode Generation | —Unverified | 0 |
| Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol | Mar 7, 2025 | BenchmarkingBug fixing | —Unverified | 0 |
| Bug Fix Time Optimization Using Matrix Factorization and Iterative Gale-Shaply Algorithms | Jul 14, 2022 | Bug fixingInformation Retrieval | —Unverified | 0 |
| Characterising Open Source Co-opetition in Company-hosted Open Source Software Projects: The Cases of PyTorch, TensorFlow, and Transformers | Oct 23, 2024 | Bug fixing | —Unverified | 0 |
| Code Comparison Tuning for Code Large Language Models | Mar 28, 2024 | Bug fixing | —Unverified | 0 |
| Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming | Feb 22, 2024 | Bug fixingCode Generation | —Unverified | 0 |
| Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational Notebooks | Oct 18, 2024 | AI AgentBug fixing | —Unverified | 0 |
| Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5 | Nov 27, 2022 | Bug fixingLanguage Modeling | —Unverified | 0 |
| Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System | Feb 16, 2025 | Bug fixing | —Unverified | 0 |
| Enabling Automatic Repair of Source Code Vulnerabilities Using Data-Driven Methods | Feb 7, 2022 | Bug fixingProgram Repair | —Unverified | 0 |
| EnHMM: On the Use of Ensemble HMMs and Stack Traces to Predict the Reassignment of Bug Report Fields | Mar 15, 2021 | Bug fixing | —Unverified | 0 |
| Fix-Filter-Fix: Intuitively Connect Any Models for Effective Bug Fixing | Nov 1, 2021 | Bug fixingMachine Translation | —Unverified | 0 |
| GrACE: Generation using Associated Code Edits | May 23, 2023 | Bug fixingCode Generation | —Unverified | 0 |
| GRAPHIX: A Pre-trained Graph Edit Model for Automated Program Repair | Sep 29, 2021 | Bug fixingDecoder | —Unverified | 0 |
| LongCodeBench: Evaluating Coding LLMs at 1M Context Windows | May 12, 2025 | Bug fixing | —Unverified | 0 |
| MarsCode Agent: AI-native Automated Bug Fixing | Sep 2, 2024 | Bug fixingCode Completion | —Unverified | 0 |
| Model Card and Evaluations for Claude Models | Jul 11, 2023 | Arithmetic ReasoningBug fixing | —Unverified | 0 |
| On Learning Meaningful Code Changes via Neural Machine Translation | Jan 25, 2019 | Bug fixingMachine Translation | —Unverified | 0 |
| On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software | Apr 2, 2025 | Autonomous DrivingBug fixing | —Unverified | 0 |
| PDC & DM-SFT: A Road for LLM SQL Bug-Fix Enhancing | Nov 11, 2024 | Bug fixingCode Generation | —Unverified | 0 |
| RAPGen: An Approach for Fixing Code Inefficiencies in Zero-Shot | Jun 29, 2023 | Bug fixingLanguage Modeling | —Unverified | 0 |
| SecureFalcon: Are We There Yet in Automated Software Vulnerability Detection with LLMs? | Jul 13, 2023 | Binary ClassificationBug fixing | —Unverified | 0 |
| Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix | Jul 17, 2021 | Bug fixingProgram Repair | —Unverified | 0 |
| The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries | Jun 14, 2025 | Bug fixingInference Optimization | —Unverified | 0 |
| Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks | Mar 26, 2024 | Bug fixing | —Unverified | 0 |
| A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation | Nov 12, 2024 | Bug fixingCode Generation | —Unverified | 0 |
| Repository-level Code Search with Neural Retrieval Methods | Feb 10, 2025 | Bug fixingCode Search | CodeCode Available | 0 |