SOTAVerified

Bug fixing

Papers

Showing 150 of 62 papers

TitleStatusHype
SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringCode11
AutoCodeRover: Autonomous Program ImprovementCode7
GPT-4 Technical ReportCode6
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?Code4
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and RerankingCode2
CodeR: Issue Resolving with Multi-Agent and Task GraphsCode2
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software DevelopmentCode2
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical DebuggingCode2
MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMsCode1
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
CoditT5: Pretraining for Source Code and Natural Language EditingCode1
RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style TransformationCode1
Empirical Study of Transformers for Source CodeCode1
FixEval: Execution-based Evaluation of Program Fixes for Programming ProblemsCode1
D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential AnalysisCode1
A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source CodeCode1
CoreCodeBench: A Configurable Multi-Scenario Repository-Level BenchmarkCode1
Neural Transfer Learning for Repairing Security Vulnerabilities in C CodeCode1
Leveraging Causal Inference for Explainable Automatic Program Repair0
VeriDebug: A Unified LLM for Verilog Debugging via Contrastive Embedding and Guided Correction0
An Empirical Investigation into Learning Bug-Fixing Patches in the Wild via Neural Machine Translation0
An Empirical Study on LLM-based Agents for Automated Bug Fixing0
APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries0
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models0
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol0
Bug Fix Time Optimization Using Matrix Factorization and Iterative Gale-Shaply Algorithms0
Characterising Open Source Co-opetition in Company-hosted Open Source Software Projects: The Cases of PyTorch, TensorFlow, and Transformers0
Code Comparison Tuning for Code Large Language Models0
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming0
Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational Notebooks0
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT50
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System0
Enabling Automatic Repair of Source Code Vulnerabilities Using Data-Driven Methods0
EnHMM: On the Use of Ensemble HMMs and Stack Traces to Predict the Reassignment of Bug Report Fields0
Fix-Filter-Fix: Intuitively Connect Any Models for Effective Bug Fixing0
GrACE: Generation using Associated Code Edits0
GRAPHIX: A Pre-trained Graph Edit Model for Automated Program Repair0
LongCodeBench: Evaluating Coding LLMs at 1M Context Windows0
MarsCode Agent: AI-native Automated Bug Fixing0
Model Card and Evaluations for Claude Models0
On Learning Meaningful Code Changes via Neural Machine Translation0
On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software0
PDC & DM-SFT: A Road for LLM SQL Bug-Fix Enhancing0
RAPGen: An Approach for Fixing Code Inefficiencies in Zero-Shot0
SecureFalcon: Are We There Yet in Automated Software Vulnerability Detection with LLMs?0
Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix0
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks0
A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation0
Repository-level Code Search with Neural Retrieval MethodsCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.