| Arachne: Search Based Repair of Deep Neural Networks | Dec 28, 2019 | FairnessGender Classification | CodeCode Available | 0 | 5 |
| Teaching Large Language Models to Self-Debug | Apr 11, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 0 | 5 |
| Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data | May 12, 2025 | Program RepairSynthetic Data Generation | —Unverified | 0 | 0 |
| Exploring the Potential of Conversational Test Suite Based Program Repair on SWE-bench | Oct 6, 2024 | Program Repairvalid | —Unverified | 0 | 0 |
| Fairness-guided SMT-based Rectification of Decision Trees and Random Forests | Nov 22, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 | 0 |
| Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code | Oct 13, 2024 | Code GenerationHallucination | —Unverified | 0 | 0 |
| Frustrated with Code Quality Issues? LLMs can Help! | Sep 22, 2023 | Instruction FollowingProgram Repair | —Unverified | 0 | 0 |
| Fully Autonomous Programming with Large Language Models | Apr 20, 2023 | Program RepairProgram Synthesis | —Unverified | 0 | 0 |
| Generating Bug-Fixes Using Pretrained Transformers | Apr 16, 2021 | DenoisingProgram Repair | —Unverified | 0 | 0 |
| BigIssue: A Realistic Bug Localization Benchmark | Jul 21, 2022 | BIG-bench Machine LearningDiversity | —Unverified | 0 | 0 |
| Gradient-Based Program Repair: Fixing Bugs in Continuous Program Spaces | May 23, 2025 | Program Repair | —Unverified | 0 | 0 |
| Grammar-Based Patches Generation for Automated Program Repair | Aug 1, 2021 | Program Repair | —Unverified | 0 | 0 |
| Automatic Programming: Large Language Models and Beyond | May 3, 2024 | Program Repair | —Unverified | 0 | 0 |
| GRAPHIX: A Pre-trained Graph Edit Model for Automated Program Repair | Sep 29, 2021 | Bug fixingDecoder | —Unverified | 0 | 0 |
| T^3: Multi-level Tree-based Automatic Program Repair with Large Language Models | Jun 26, 2025 | Program Repair | —Unverified | 0 | 0 |
| Automated Program Repair: Emerging trends pose and expose problems for benchmarks | May 8, 2024 | Machine TranslationProgram Repair | —Unverified | 0 | 0 |
| SpecRover: Code Intent Extraction via LLMs | Aug 5, 2024 | Code SearchLarge Language Model | —Unverified | 0 | 0 |
| Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering | Apr 16, 2023 | Program RepairPrompt Engineering | —Unverified | 0 | 0 |
| Improving Automated Program Repair with Domain Adaptation | Dec 21, 2022 | Domain AdaptationProgram Repair | —Unverified | 0 | 0 |
| In-Context Code-Text Learning for Bimodal Software Engineering | Oct 8, 2024 | Clone DetectionIn-Context Learning | —Unverified | 0 | 0 |
| Automated C/C++ Program Repair for High-Level Synthesis via Large Language Models | Jul 4, 2024 | C++ codeCode Generation | —Unverified | 0 | 0 |
| Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix | Jul 17, 2021 | Bug fixingProgram Repair | —Unverified | 0 | 0 |
| A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation | Nov 12, 2024 | Bug fixingCode Generation | —Unverified | 0 | 0 |
| Is ChatGPT the Ultimate Programming Assistant -- How far is it? | Apr 24, 2023 | Code GenerationCode Summarization | —Unverified | 0 | 0 |
| Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT | Apr 1, 2023 | Program Repair | —Unverified | 0 | 0 |
| Automated Bug Generation in the era of Large Language Models | Oct 3, 2023 | Program Repair | —Unverified | 0 | 0 |
| The Art of Repair: Optimizing Iterative Program Repair with Instruction-Tuned Models | May 5, 2025 | HumanEvalProgram Repair | —Unverified | 0 | 0 |
| Learning to Fix Build Errors with Graph2Diff Neural Networks | Nov 4, 2019 | DiagnosticGraph Neural Network | —Unverified | 0 | 0 |
| The Impact of Input Order Bias on Large Language Models for Software Fault Localization | Dec 25, 2024 | Fault localizationMemorization | —Unverified | 0 | 0 |
| LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks | Feb 10, 2025 | Code GenerationProgram Repair | —Unverified | 0 | 0 |
| Leveraging Causal Inference for Explainable Automatic Program Repair | May 26, 2022 | Bug fixingCausal Inference | —Unverified | 0 | 0 |
| Better patching using LLM prompting, via Self-Consistency | May 31, 2023 | Program Repair | —Unverified | 0 | 0 |
| Mapping the Structure and Evolution of Software Testing Research Over the Past Three Decades | Sep 9, 2021 | BIG-bench Machine LearningProgram Repair | —Unverified | 0 | 0 |
| MdEval: Massively Multilingual Code Debugging | Nov 4, 2024 | Program Repair | —Unverified | 0 | 0 |
| MergeRepair: An Exploratory Study on Merging Task-Specific Adapters in Code LLMs for Automated Program Repair | Aug 18, 2024 | parameter-efficient fine-tuningProgram Repair | —Unverified | 0 | 0 |
| MultiFix: Learning to Repair Multiple Errors by Optimal Alignment Learning | Nov 1, 2021 | Program Repair | —Unverified | 0 | 0 |
| NARRepair: Non-Autoregressive Code Generation Model for Automatic Program Repair | Jun 24, 2024 | Code GenerationProgram Repair | —Unverified | 0 | 0 |
| Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing | Mar 20, 2025 | FairnessProgram Repair | —Unverified | 0 | 0 |
| Neural Program Repair: Systems, Challenges and Solutions | Feb 22, 2022 | DecoderProgram Repair | —Unverified | 0 | 0 |
| NExT: Teaching Large Language Models to Reason about Code Execution | Apr 23, 2024 | HumanEvalmbpp | —Unverified | 0 | 0 |
| Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning | Nov 22, 2023 | Code GenerationCode Translation | —Unverified | 0 | 0 |
| A Study of Vulnerability Repair in JavaScript Programs with Large Language Models | Mar 19, 2024 | Bug fixingCode Generation | —Unverified | 0 | 0 |
| Obstacles in Fully Automatic Program Repair: A survey | Nov 5, 2020 | Program RepairSurvey | —Unverified | 0 | 0 |
| Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs | May 7, 2025 | Program Repair | —Unverified | 0 | 0 |
| Towards Mixed Optimization for Reinforcement Learning with Program Synthesis | Jul 1, 2018 | Deep Reinforcement LearningProgram Repair | —Unverified | 0 | 0 |
| Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments | Apr 2, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| An LLM-as-Judge Metric for Bridging the Gap with Human Evaluation in SE Tasks | May 27, 2025 | Code GenerationCode Summarization | —Unverified | 0 | 0 |
| Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories | Jun 23, 2025 | Large Language ModelProgram Repair | —Unverified | 0 | 0 |
| Program Repair with Minimal Edits Using CodeT5 | Sep 26, 2023 | Program Repair | —Unverified | 0 | 0 |
| Program Repair with Repeated Learning | Apr 24, 2021 | Program Repair | —Unverified | 0 | 0 |