SOTAVerified

Bug fixing

Papers

Showing 125 of 62 papers

TitleStatusHype
SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringCode11
AutoCodeRover: Autonomous Program ImprovementCode7
GPT-4 Technical ReportCode6
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?Code4
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software DevelopmentCode2
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and RerankingCode2
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical DebuggingCode2
CodeR: Issue Resolving with Multi-Agent and Task GraphsCode2
CoreCodeBench: A Configurable Multi-Scenario Repository-Level BenchmarkCode1
MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMsCode1
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
CoditT5: Pretraining for Source Code and Natural Language EditingCode1
FixEval: Execution-based Evaluation of Program Fixes for Programming ProblemsCode1
RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style TransformationCode1
Neural Transfer Learning for Repairing Security Vulnerabilities in C CodeCode1
D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential AnalysisCode1
A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source CodeCode1
Empirical Study of Transformers for Source CodeCode1
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
LongCodeBench: Evaluating Coding LLMs at 1M Context Windows0
APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries0
VeriDebug: A Unified LLM for Verilog Debugging via Contrastive Embedding and Guided Correction0
On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software0
Less is More: Adaptive Program Repair with Bug Localization and Preference LearningCode0
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.