SOTAVerified

Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Showing 51100 of 132 papers

TitleStatusHype
Arachne: Search Based Repair of Deep Neural NetworksCode0
Teaching Large Language Models to Self-DebugCode0
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data0
Exploring the Potential of Conversational Test Suite Based Program Repair on SWE-bench0
Fairness-guided SMT-based Rectification of Decision Trees and Random Forests0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code0
Frustrated with Code Quality Issues? LLMs can Help!0
Fully Autonomous Programming with Large Language Models0
Generating Bug-Fixes Using Pretrained Transformers0
BigIssue: A Realistic Bug Localization Benchmark0
Gradient-Based Program Repair: Fixing Bugs in Continuous Program Spaces0
Grammar-Based Patches Generation for Automated Program Repair0
Automatic Programming: Large Language Models and Beyond0
GRAPHIX: A Pre-trained Graph Edit Model for Automated Program Repair0
T^3: Multi-level Tree-based Automatic Program Repair with Large Language Models0
Automated Program Repair: Emerging trends pose and expose problems for benchmarks0
SpecRover: Code Intent Extraction via LLMs0
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering0
Improving Automated Program Repair with Domain Adaptation0
In-Context Code-Text Learning for Bimodal Software Engineering0
Automated C/C++ Program Repair for High-Level Synthesis via Large Language Models0
Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix0
A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation0
Is ChatGPT the Ultimate Programming Assistant -- How far is it?0
Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT0
Automated Bug Generation in the era of Large Language Models0
The Art of Repair: Optimizing Iterative Program Repair with Instruction-Tuned Models0
Learning to Fix Build Errors with Graph2Diff Neural Networks0
The Impact of Input Order Bias on Large Language Models for Software Fault Localization0
LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks0
Leveraging Causal Inference for Explainable Automatic Program Repair0
Better patching using LLM prompting, via Self-Consistency0
Mapping the Structure and Evolution of Software Testing Research Over the Past Three Decades0
MdEval: Massively Multilingual Code Debugging0
MergeRepair: An Exploratory Study on Merging Task-Specific Adapters in Code LLMs for Automated Program Repair0
MultiFix: Learning to Repair Multiple Errors by Optimal Alignment Learning0
NARRepair: Non-Autoregressive Code Generation Model for Automatic Program Repair0
Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing0
Neural Program Repair: Systems, Challenges and Solutions0
NExT: Teaching Large Language Models to Reason about Code Execution0
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning0
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models0
Obstacles in Fully Automatic Program Repair: A survey0
Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs0
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis0
Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments0
An LLM-as-Judge Metric for Bridging the Gap with Human Evaluation in SE Tasks0
Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories0
Program Repair with Minimal Edits Using CodeT50
Program Repair with Repeated Learning0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepair + BIFIAverage Success Rate71.7Unverified
2DrRepairAverage Success Rate68.2Unverified
3SampleFixAverage Success Rate45.3Unverified
4RLAssistAverage Success Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + BIFIAccuracy (%)90.5Unverified
2TransformerAccuracy (%)62Unverified
#ModelMetricClaimedVerifiedStatus
1MGDebugger (DeepSeek-Coder-V2-Lite)Pass@197.6Unverified
#ModelMetricClaimedVerifiedStatus
1TFixError Removal678Unverified