Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 132 papers

Title	Date	Tasks	Status
MdEval: Massively Multilingual Code Debugging	Nov 4, 2024	Program Repair	—Unverified
Semantic-guided Search for Efficient Program Repair with Large Language Models	Oct 22, 2024	GPUHumanEval	—Unverified
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code	Oct 13, 2024	Code GenerationHallucination	—Unverified
In-Context Code-Text Learning for Bimodal Software Engineering	Oct 8, 2024	Clone DetectionIn-Context Learning	—Unverified
Exploring the Potential of Conversational Test Suite Based Program Repair on SWE-bench	Oct 6, 2024	Program Repairvalid	—Unverified
Can GPT-O1 Kill All Bugs? An Evaluation of GPT-Family LLMs on QuixBugs	Sep 16, 2024	AllProgram Repair	CodeCode Available
Enhancing Automated Program Repair with Solution Design	Aug 22, 2024	Program Repair	—Unverified
RePair: Automated Program Repair with Process-based Feedback	Aug 21, 2024	Program Repair	CodeCode Available
MergeRepair: An Exploratory Study on Merging Task-Specific Adapters in Code LLMs for Automated Program Repair	Aug 18, 2024	parameter-efficient fine-tuningProgram Repair	—Unverified
SpecRover: Code Intent Extraction via LLMs	Aug 5, 2024	Code SearchLarge Language Model	—Unverified
Automated C/C++ Program Repair for High-Level Synthesis via Large Language Models	Jul 4, 2024	C++ codeCode Generation	—Unverified
NARRepair: Non-Autoregressive Code Generation Model for Automatic Program Repair	Jun 24, 2024	Code GenerationProgram Repair	—Unverified
Automated Program Repair: Emerging trends pose and expose problems for benchmarks	May 8, 2024	Machine TranslationProgram Repair	—Unverified
Benchmarking Educational Program Repair	May 8, 2024	BenchmarkingProgram Repair	CodeCode Available
Automatic Programming: Large Language Models and Beyond	May 3, 2024	Program Repair	—Unverified
NExT: Teaching Large Language Models to Reason about Code Execution	Apr 23, 2024	HumanEvalmbpp	—Unverified
Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments	Apr 2, 2024	Language ModellingLarge Language Model	—Unverified
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning	Mar 25, 2024	Code GenerationIn-Context Learning	—Unverified
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models	Mar 19, 2024	Bug fixingCode Generation	—Unverified
Towards Reliable Evaluation of Neural Program Repair with Natural Robustness Testing	Feb 19, 2024	Program Repair	CodeCode Available
DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models	Feb 19, 2024	Code RepairFew-Shot Learning	—Unverified
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language Models	Jan 15, 2024	HumanEvalLanguage Modelling	CodeCode Available
Breaking the Silence: the Threats of Using LLMs in Software Engineering	Dec 13, 2023	Code CompletionCode Summarization	CodeCode Available
Out of Context: How important is Local Context in Neural Program Repair?	Dec 8, 2023	Program Repair	CodeCode Available
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning	Nov 22, 2023	Code GenerationCode Translation	—Unverified
ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair	Oct 25, 2023	BenchmarkingFault localization	—Unverified
Automated Bug Generation in the era of Large Language Models	Oct 3, 2023	Program Repair	—Unverified
Program Repair with Minimal Edits Using CodeT5	Sep 26, 2023	Program Repair	—Unverified
Frustrated with Code Quality Issues? LLMs can Help!	Sep 22, 2023	Instruction FollowingProgram Repair	—Unverified
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair	Sep 12, 2023	Language ModellingProgram Repair	—Unverified
Graph Neural Networks For Mapping Variables Between Programs -- Extended Version	Jul 24, 2023	Clone DetectionProgram Repair	CodeCode Available
An Exploratory Literature Study on Sharing and Energy Use of Language Models for Source Code	Jul 5, 2023	Program Repair	—Unverified
Better patching using LLM prompting, via Self-Consistency	May 31, 2023	Program Repair	—Unverified
Is ChatGPT the Ultimate Programming Assistant -- How far is it?	Apr 24, 2023	Code GenerationCode Summarization	—Unverified
Fully Autonomous Programming with Large Language Models	Apr 20, 2023	Program RepairProgram Synthesis	—Unverified
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering	Apr 16, 2023	Program RepairPrompt Engineering	—Unverified
Teaching Large Language Models to Self-Debug	Apr 11, 2023	Code GenerationLanguage Modeling	CodeCode Available
RunBugRun -- An Executable Dataset for Automated Program Repair	Apr 3, 2023	Program Repair	—Unverified
Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT	Apr 1, 2023	Program Repair	—Unverified
Revisiting the Plastic Surgery Hypothesis via Large Language Models	Mar 18, 2023	Program Repair	—Unverified
Conversational Automated Program Repair	Jan 30, 2023	Program Repair	—Unverified
Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning	Jan 3, 2023	Language ModellingProgram Repair	CodeCode Available
Improving Automated Program Repair with Domain Adaptation	Dec 21, 2022	Domain AdaptationProgram Repair	—Unverified
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5	Nov 27, 2022	Bug fixingLanguage Modeling	—Unverified
Repairing Bugs in Python Assignments Using Large Language Models	Sep 29, 2022	ChunkingLanguage Modeling	—Unverified
Repair Is Nearly Generation: Multilingual Program Repair with LLMs	Aug 24, 2022	Language ModellingLarge Language Model	—Unverified
BigIssue: A Realistic Bug Localization Benchmark	Jul 21, 2022	BIG-bench Machine LearningDiversity	—Unverified
InvAASTCluster: On Applying Invariant-Based Program Clustering to Introductory Programming Assignments	Jun 28, 2022	ClusteringProgram Repair	CodeCode Available
C-Pack of IPAs: A C90 Program Benchmark of Introductory Programming Assignments	Jun 17, 2022	Program Repair	CodeCode Available
Leveraging Causal Inference for Explainable Automatic Program Repair	May 26, 2022	Bug fixingCausal Inference	—Unverified

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets DeepFix GitHub-Python HumanEvalPack TFix's Code Patches Data

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DrRepair + BIFI	Average Success Rate	71.7	—	Unverified
2	DrRepair	Average Success Rate	68.2	—	Unverified
3	SampleFix	Average Success Rate	45.3	—	Unverified
4	RLAssist	Average Success Rate	26.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer + BIFI	Accuracy (%)	90.5	—	Unverified
2	Transformer	Accuracy (%)	62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MGDebugger (DeepSeek-Coder-V2-Lite)	Pass@1	97.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TFix	Error Removal	678	—	Unverified