| AutoCodeRover: Autonomous Program Improvement | Apr 8, 2024 | Bug fixingCode Search | CodeCode Available | 7 |
| Agentless: Demystifying LLM-based Software Engineering Agents | Jul 1, 2024 | Program Repair | CodeCode Available | 7 |
| HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale | Sep 9, 2024 | Code GenerationFault localization | CodeCode Available | 3 |
| From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging | Oct 2, 2024 | Auto DebuggingBug fixing | CodeCode Available | 2 |
| RepairAgent: An Autonomous, LLM-Based Agent for Program Repair | Mar 25, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| CoCoNuT: Combining Context-Aware Neural Translation Models using Ensemble for Program Repair | Jul 18, 2020 | Ensemble LearningMachine Translation | CodeCode Available | 1 |
| o3-mini vs DeepSeek-R1: Which One is Safer? | Jan 30, 2025 | Code GenerationProgram Repair | CodeCode Available | 1 |
| CURE: Code-Aware Neural Machine Translation for Automatic Program Repair | Feb 26, 2021 | Machine TranslationNMT | CodeCode Available | 1 |
| TFix: Learning to Fix Coding Errors with a Text-to-Text Transformer | Jul 18, 2021 | Code GenerationMulti-Task Learning | CodeCode Available | 1 |
| Global Relational Models of Source Code | May 1, 2020 | Inductive BiasProgram Repair | CodeCode Available | 1 |
| Neural Program Repair by Jointly Learning to Localize and Repair | Apr 3, 2019 | Program RepairVariable misuse | CodeCode Available | 1 |
| CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching | Mar 28, 2025 | Program Repair | CodeCode Available | 1 |
| RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair | Dec 25, 2023 | HumanEvalparameter-efficient fine-tuning | CodeCode Available | 1 |
| A Syntax-Guided Edit Decoder for Neural Program Repair | Jun 15, 2021 | Code CompletionCode Generation | CodeCode Available | 1 |
| Integrating Various Software Artifacts for Better LLM-based Bug Localization and Program Repair | Dec 5, 2024 | Fault localizationProgram Repair | CodeCode Available | 1 |
| KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair | Feb 3, 2023 | DecoderProgram Repair | CodeCode Available | 1 |
| How Effective Are Neural Networks for Fixing Security Vulnerabilities | May 29, 2023 | Code CompletionProgram Repair | CodeCode Available | 1 |
| Planning-Driven Programming: A Large Language Model Programming Workflow | Nov 21, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning | Jun 3, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval | Mar 6, 2023 | Program RepairProgram Synthesis | CodeCode Available | 1 |
| Enhancing Genetic Improvement Mutations Using Large Language Models | Oct 18, 2023 | Program Repair | CodeCode Available | 1 |
| Unified Pre-training for Program Understanding and Generation | Mar 10, 2021 | Clone DetectionCode Generation | CodeCode Available | 1 |
| Aligning the Objective of LLM-based Program Repair | Apr 13, 2024 | Fault localizationProgram Repair | CodeCode Available | 1 |
| Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair | Sep 1, 2023 | Code GenerationProgram Repair | CodeCode Available | 1 |
| Break-It-Fix-It: Unsupervised Learning for Program Repair | Jun 11, 2021 | C++ codeCode Repair | CodeCode Available | 1 |
| Graph-based, Self-Supervised Program Repair from Diagnostic Feedback | May 20, 2020 | Code GenerationDiagnostic | CodeCode Available | 1 |
| RepairBench: Leaderboard of Frontier Models for Program Repair | Sep 27, 2024 | Program Repair | CodeCode Available | 1 |
| Conversational Automated Program Repair | Jan 30, 2023 | Program Repair | —Unverified | 0 |
| ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair | Oct 25, 2023 | BenchmarkingFault localization | —Unverified | 0 |
| Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code | Oct 13, 2024 | Code GenerationHallucination | —Unverified | 0 |
| A Study of Vulnerability Repair in JavaScript Programs with Large Language Models | Mar 19, 2024 | Bug fixingCode Generation | —Unverified | 0 |
| To Err is Machine: Vulnerability Detection Challenges LLM Reasoning | Mar 25, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| Fully Autonomous Programming with Large Language Models | Apr 20, 2023 | Program RepairProgram Synthesis | —Unverified | 0 |
| Agentic Bug Reproduction for Effective Automated Program Repair at Google | Feb 3, 2025 | Large Language ModelProgram Repair | —Unverified | 0 |
| Enhancing Automated Program Repair with Solution Design | Aug 22, 2024 | Program Repair | —Unverified | 0 |
| BigIssue: A Realistic Bug Localization Benchmark | Jul 21, 2022 | BIG-bench Machine LearningDiversity | —Unverified | 0 |
| RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair | Sep 12, 2023 | Language ModellingProgram Repair | —Unverified | 0 |
| Generating Bug-Fixes Using Pretrained Transformers | Apr 16, 2021 | DenoisingProgram Repair | —Unverified | 0 |
| An LLM-as-Judge Metric for Bridging the Gap with Human Evaluation in SE Tasks | May 27, 2025 | Code GenerationCode Summarization | —Unverified | 0 |
| Automatic Programming: Large Language Models and Beyond | May 3, 2024 | Program Repair | —Unverified | 0 |
| AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations | May 23, 2022 | Program Repair | —Unverified | 0 |
| Fairness-guided SMT-based Rectification of Decision Trees and Random Forests | Nov 22, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Jun 20, 2025 | Program Repair | —Unverified | 0 |
| Dynamic Neural Program Embeddings for Program Repair | Jan 1, 2018 | Code CompletionFault localization | —Unverified | 0 |
| Enabling Automatic Repair of Source Code Vulnerabilities Using Data-Driven Methods | Feb 7, 2022 | Bug fixingProgram Repair | —Unverified | 0 |
| ENCORE: Ensemble Learning using Convolution Neural Machine Translation for Automatic Program Repair | Jun 20, 2019 | Ensemble LearningMachine Translation | —Unverified | 0 |
| Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5 | Nov 27, 2022 | Bug fixingLanguage Modeling | —Unverified | 0 |
| Automated Program Repair: Emerging trends pose and expose problems for benchmarks | May 8, 2024 | Machine TranslationProgram Repair | —Unverified | 0 |
| Evaluating Agent-based Program Repair at Google | Jan 13, 2025 | Code GenerationProgram Repair | —Unverified | 0 |
| An Exploratory Literature Study on Sharing and Energy Use of Language Models for Source Code | Jul 5, 2023 | Program Repair | —Unverified | 0 |