SOTAVerified

Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Showing 51100 of 132 papers

TitleStatusHype
Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing0
Automated Bug Generation in the era of Large Language Models0
Automated C/C++ Program Repair for High-Level Synthesis via Large Language Models0
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering0
Automated Program Repair: Emerging trends pose and expose problems for benchmarks0
Automatic Programming: Large Language Models and Beyond0
BigIssue: A Realistic Bug Localization Benchmark0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code0
ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair0
Conversational Automated Program Repair0
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair0
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization0
DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models0
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and Code Skeletons0
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT50
Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems0
Dynamic Neural Program Embeddings for Program Repair0
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning0
Obstacles in Fully Automatic Program Repair: A survey0
Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments0
Program Repair with Minimal Edits Using CodeT50
Program Repair with Repeated Learning0
Repairing Bugs in Python Assignments Using Large Language Models0
Repair Is Nearly Generation: Multilingual Program Repair with LLMs0
Revisiting the Plastic Surgery Hypothesis via Large Language Models0
RunBugRun -- An Executable Dataset for Automated Program Repair0
SampleFix: Learning to Generate Functionally Diverse Fixes0
SCELMo: Source Code Embeddings from Language Models0
SemAgent: A Semantics Aware Program Repair Agent0
Semantic-guided Search for Efficient Program Repair with Large Language Models0
SmartPaste: Learning to Adapt Source Code0
SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs0
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data0
T^3: Multi-level Tree-based Automatic Program Repair with Large Language Models0
Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix0
The Art of Repair: Optimizing Iterative Program Repair with Instruction-Tuned Models0
The Impact of Input Order Bias on Large Language Models for Software Fault Localization0
Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs0
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis0
Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories0
Using ML filters to help automated vulnerability repairs: when it helps and when it doesn't0
Where's the Bug? Attention Probing for Scalable Fault Localization0
Improving Automated Program Repair with Domain Adaptation0
In-Context Code-Text Learning for Bimodal Software Engineering0
Is ChatGPT the Ultimate Programming Assistant -- How far is it?0
Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT0
Learning to Fix Build Errors with Graph2Diff Neural Networks0
LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks0
Leveraging Causal Inference for Explainable Automatic Program Repair0
Better patching using LLM prompting, via Self-Consistency0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepair + BIFIAverage Success Rate71.7Unverified
2DrRepairAverage Success Rate68.2Unverified
3SampleFixAverage Success Rate45.3Unverified
4RLAssistAverage Success Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + BIFIAccuracy (%)90.5Unverified
2TransformerAccuracy (%)62Unverified
#ModelMetricClaimedVerifiedStatus
1MGDebugger (DeepSeek-Coder-V2-Lite)Pass@197.6Unverified
#ModelMetricClaimedVerifiedStatus
1TFixError Removal678Unverified