SOTAVerified

Program Repair

Task of teaching ML models to modify an existing program to fix a bug in a given code.

Papers

Showing 101132 of 132 papers

TitleStatusHype
An Exploratory Literature Study on Sharing and Energy Use of Language Models for Source Code0
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning0
A Multi-Dataset Evaluation of Models for Automated Vulnerability Repair0
Repairing Bugs in Python Assignments Using Large Language Models0
Repair Is Nearly Generation: Multilingual Program Repair with LLMs0
Agentic Bug Reproduction for Effective Automated Program Repair at Google0
Revisiting the Plastic Surgery Hypothesis via Large Language Models0
Using ML filters to help automated vulnerability repairs: when it helps and when it doesn't0
RunBugRun -- An Executable Dataset for Automated Program Repair0
SampleFix: Learning to Generate Functionally Diverse Fixes0
SCELMo: Source Code Embeddings from Language Models0
Where's the Bug? Attention Probing for Scalable Fault Localization0
SemAgent: A Semantics Aware Program Repair Agent0
CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks0
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization0
Semantic-guided Search for Efficient Program Repair with Large Language Models0
Conversational Automated Program Repair0
DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models0
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and Code Skeletons0
AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations0
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair0
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT50
Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems0
SmartPaste: Learning to Adapt Source Code0
Dynamic Neural Program Embeddings for Program Repair0
Enabling Automatic Repair of Source Code Vulnerabilities Using Data-Driven Methods0
ENCORE: Ensemble Learning using Convolution Neural Machine Translation for Automatic Program Repair0
ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair0
Enhancing Automated Program Repair with Solution Design0
Evaluating Agent-based Program Repair at Google0
SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs0
Evaluating the Generalizability of LLMs in Automated Program Repair0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepair + BIFIAverage Success Rate71.7Unverified
2DrRepairAverage Success Rate68.2Unverified
3SampleFixAverage Success Rate45.3Unverified
4RLAssistAverage Success Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + BIFIAccuracy (%)90.5Unverified
2TransformerAccuracy (%)62Unverified
#ModelMetricClaimedVerifiedStatus
1MGDebugger (DeepSeek-Coder-V2-Lite)Pass@197.6Unverified
#ModelMetricClaimedVerifiedStatus
1TFixError Removal678Unverified