SOTAVerified

Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Showing 76100 of 132 papers

TitleStatusHype
RunBugRun -- An Executable Dataset for Automated Program Repair0
SampleFix: Learning to Generate Functionally Diverse Fixes0
SCELMo: Source Code Embeddings from Language Models0
SemAgent: A Semantics Aware Program Repair Agent0
Semantic-guided Search for Efficient Program Repair with Large Language Models0
SmartPaste: Learning to Adapt Source Code0
SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs0
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data0
T^3: Multi-level Tree-based Automatic Program Repair with Large Language Models0
Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix0
The Art of Repair: Optimizing Iterative Program Repair with Instruction-Tuned Models0
The Impact of Input Order Bias on Large Language Models for Software Fault Localization0
Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs0
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis0
Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories0
Using ML filters to help automated vulnerability repairs: when it helps and when it doesn't0
Where's the Bug? Attention Probing for Scalable Fault Localization0
Improving Automated Program Repair with Domain Adaptation0
In-Context Code-Text Learning for Bimodal Software Engineering0
Is ChatGPT the Ultimate Programming Assistant -- How far is it?0
Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT0
Learning to Fix Build Errors with Graph2Diff Neural Networks0
LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks0
Leveraging Causal Inference for Explainable Automatic Program Repair0
Better patching using LLM prompting, via Self-Consistency0
Show:102550
← PrevPage 4 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepair + BIFIAverage Success Rate71.7Unverified
2DrRepairAverage Success Rate68.2Unverified
3SampleFixAverage Success Rate45.3Unverified
4RLAssistAverage Success Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + BIFIAccuracy (%)90.5Unverified
2TransformerAccuracy (%)62Unverified
#ModelMetricClaimedVerifiedStatus
1MGDebugger (DeepSeek-Coder-V2-Lite)Pass@197.6Unverified
#ModelMetricClaimedVerifiedStatus
1TFixError Removal678Unverified