SOTAVerified

Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Showing 51100 of 132 papers

TitleStatusHype
Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments0
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning0
RepairAgent: An Autonomous, LLM-Based Agent for Program RepairCode2
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models0
DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models0
Towards Reliable Evaluation of Neural Program Repair with Natural Robustness TestingCode0
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language ModelsCode0
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program RepairCode1
Breaking the Silence: the Threats of Using LLMs in Software EngineeringCode0
Out of Context: How important is Local Context in Neural Program Repair?Code0
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning0
ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair0
Enhancing Genetic Improvement Mutations Using Large Language ModelsCode1
Automated Bug Generation in the era of Large Language Models0
Program Repair with Minimal Edits Using CodeT50
Frustrated with Code Quality Issues? LLMs can Help!0
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair0
Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program RepairCode1
Graph Neural Networks For Mapping Variables Between Programs -- Extended VersionCode0
An Exploratory Literature Study on Sharing and Energy Use of Language Models for Source Code0
Better patching using LLM prompting, via Self-Consistency0
How Effective Are Neural Networks for Fixing Security VulnerabilitiesCode1
Is ChatGPT the Ultimate Programming Assistant -- How far is it?0
Fully Autonomous Programming with Large Language Models0
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering0
Teaching Large Language Models to Self-DebugCode0
RunBugRun -- An Executable Dataset for Automated Program Repair0
Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT0
Revisiting the Plastic Surgery Hypothesis via Large Language Models0
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and RetrievalCode1
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program RepairCode1
Conversational Automated Program Repair0
Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic ReasoningCode0
Improving Automated Program Repair with Domain Adaptation0
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT50
Repairing Bugs in Python Assignments Using Large Language Models0
Repair Is Nearly Generation: Multilingual Program Repair with LLMs0
BigIssue: A Realistic Bug Localization Benchmark0
InvAASTCluster: On Applying Invariant-Based Program Clustering to Introductory Programming AssignmentsCode0
C-Pack of IPAs: A C90 Program Benchmark of Introductory Programming AssignmentsCode0
Leveraging Causal Inference for Explainable Automatic Program Repair0
AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations0
Neural Program Repair: Systems, Challenges and Solutions0
Enabling Automatic Repair of Source Code Vulnerabilities Using Data-Driven Methods0
MultiFix: Learning to Repair Multiple Errors by Optimal Alignment Learning0
GRAPHIX: A Pre-trained Graph Edit Model for Automated Program Repair0
Mapping the Structure and Evolution of Software Testing Research Over the Past Three Decades0
Grammar-Based Patches Generation for Automated Program Repair0
TFix: Learning to Fix Coding Errors with a Text-to-Text TransformerCode1
Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepair + BIFIAverage Success Rate71.7Unverified
2DrRepairAverage Success Rate68.2Unverified
3SampleFixAverage Success Rate45.3Unverified
4RLAssistAverage Success Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + BIFIAccuracy (%)90.5Unverified
2TransformerAccuracy (%)62Unverified
#ModelMetricClaimedVerifiedStatus
1MGDebugger (DeepSeek-Coder-V2-Lite)Pass@197.6Unverified
#ModelMetricClaimedVerifiedStatus
1TFixError Removal678Unverified