SOTAVerified

Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Showing 2130 of 132 papers

TitleStatusHype
o3-mini vs DeepSeek-R1: Which One is Safer?Code1
Evaluating Agent-based Program Repair at Google0
The Impact of Input Order Bias on Large Language Models for Software Fault Localization0
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization0
Integrating Various Software Artifacts for Better LLM-based Bug Localization and Program RepairCode1
Planning-Driven Programming: A Large Language Model Programming WorkflowCode1
A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation0
MdEval: Massively Multilingual Code Debugging0
Semantic-guided Search for Efficient Program Repair with Large Language Models0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code0
Show:102550
← PrevPage 3 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepair + BIFIAverage Success Rate71.7Unverified
2DrRepairAverage Success Rate68.2Unverified
3SampleFixAverage Success Rate45.3Unverified
4RLAssistAverage Success Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + BIFIAccuracy (%)90.5Unverified
2TransformerAccuracy (%)62Unverified
#ModelMetricClaimedVerifiedStatus
1MGDebugger (DeepSeek-Coder-V2-Lite)Pass@197.6Unverified
#ModelMetricClaimedVerifiedStatus
1TFixError Removal678Unverified