SOTAVerified

Code Repair

Papers

Showing 2639 of 39 papers

TitleStatusHype
RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts0
SolBench: A Dataset and Benchmark for Evaluating Functional Correctness in Solidity Code Completion and Repair0
AuPair: Golden Example Pairs for Code Repair0
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair0
Break it - Message it - Fix it : Learning to Repair Python Programs using Error Messages without Labelled Data0
Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents0
CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks0
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff0
Code Security Vulnerability Repair Using Reinforcement Learning with Large Language Models0
CrashFixer: A crash resolution agent for the Linux kernel0
DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models0
Investigating the Transferability of Code Repair for Low-Resource Programming Languages0
Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug ErrorsCode0
CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code RepairCode0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NSEditAccuracy (medium)13.87Unverified
2CodeBERTAccuracy (medium)5.2Unverified