SOTAVerified

Bug fixing

Papers

Showing 150 of 62 papers

TitleStatusHype
CoreCodeBench: A Configurable Multi-Scenario Repository-Level BenchmarkCode1
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software DevelopmentCode2
LongCodeBench: Evaluating Coding LLMs at 1M Context Windows0
APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries0
VeriDebug: A Unified LLM for Verilog Debugging via Contrastive Embedding and Guided Correction0
On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software0
Less is More: Adaptive Program Repair with Bug Localization and Preference LearningCode0
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol0
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System0
Repository-level Code Search with Neural Retrieval MethodsCode0
GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code GenerationCode0
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and RerankingCode2
An Empirical Study on LLM-based Agents for Automated Bug Fixing0
A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation0
PDC & DM-SFT: A Road for LLM SQL Bug-Fix Enhancing0
MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMsCode1
Characterising Open Source Co-opetition in Company-hosted Open Source Software Projects: The Cases of PyTorch, TensorFlow, and Transformers0
Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational Notebooks0
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical DebuggingCode2
MarsCode Agent: AI-native Automated Bug Fixing0
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
Patched RTC: evaluating LLMs for diverse software development tasksCode0
CodeR: Issue Resolving with Multi-Agent and Task GraphsCode2
SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringCode11
Unraveling Code Clone Dynamics in Deep Learning FrameworksCode0
AutoCodeRover: Autonomous Program ImprovementCode7
Code Comparison Tuning for Code Large Language Models0
Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks0
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models0
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming0
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?Code4
Bug Characterization in Machine Learning-based SystemsCode0
SecureFalcon: Are We There Yet in Automated Software Vulnerability Detection with LLMs?0
Model Card and Evaluations for Claude Models0
RAPGen: An Approach for Fixing Code Inefficiencies in Zero-Shot0
GrACE: Generation using Associated Code Edits0
GPT-4 Technical ReportCode6
Automating Code-Related Tasks Through Transformers: The Impact of Pre-trainingCode0
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT50
Using Developer Discussions to Guide Fixing Bugs in SoftwareCode0
ADPTriage: Approximate Dynamic Programming for Bug TriageCode0
CoditT5: Pretraining for Source Code and Natural Language EditingCode1
Bug Fix Time Optimization Using Matrix Factorization and Iterative Gale-Shaply Algorithms0
FixEval: Execution-based Evaluation of Program Fixes for Programming ProblemsCode1
Leveraging Causal Inference for Explainable Automatic Program Repair0
S-DABT: Schedule and Dependency-Aware Bug Triage in Open-Source Bug Tracking SystemsCode0
RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style TransformationCode1
Enabling Automatic Repair of Source Code Vulnerabilities Using Data-Driven Methods0
Fix-Filter-Fix: Intuitively Connect Any Models for Effective Bug Fixing0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.