| SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering | May 6, 2024 | Bug fixingLanguage Modeling | CodeCode Available | 11 | 5 |
| AutoCodeRover: Autonomous Program Improvement | Apr 8, 2024 | Bug fixingCode Search | CodeCode Available | 7 | 5 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 | 5 |
| SWE-bench: Can Language Models Resolve Real-World GitHub Issues? | Oct 10, 2023 | Bug fixingCode Generation | CodeCode Available | 4 | 5 |
| SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | May 22, 2025 | Bug fixingChatbot | CodeCode Available | 2 | 5 |
| CodeR: Issue Resolving with Multi-Agent and Task Graphs | Jun 3, 2024 | Bug fixing | CodeCode Available | 2 | 5 |
| From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging | Oct 2, 2024 | Auto DebuggingBug fixing | CodeCode Available | 2 | 5 |
| CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking | Dec 1, 2024 | Bug fixingCode Generation | CodeCode Available | 2 | 5 |
| Neural Transfer Learning for Repairing Security Vulnerabilities in C Code | Apr 16, 2021 | Bug fixingC++ code | CodeCode Available | 1 | 5 |
| MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs | Nov 5, 2024 | Bug fixingCode Generation | CodeCode Available | 1 | 5 |
| Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | Aug 21, 2024 | Bug fixingDescriptive | CodeCode Available | 1 | 5 |
| Empirical Study of Transformers for Source Code | Oct 15, 2020 | Bug fixingCode Completion | CodeCode Available | 1 | 5 |
| D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential Analysis | Feb 16, 2021 | Bug fixingVulnerability Detection | CodeCode Available | 1 | 5 |
| A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code | Oct 23, 2020 | Bug fixingCode Completion | CodeCode Available | 1 | 5 |
| CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark | Jul 4, 2025 | Bug fixingCode Generation | CodeCode Available | 1 | 5 |
| RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style Transformation | Feb 12, 2022 | Authorship AttributionBug fixing | CodeCode Available | 1 | 5 |
| CoditT5: Pretraining for Source Code and Natural Language Editing | Aug 10, 2022 | Bug fixingLanguage Modeling | CodeCode Available | 1 | 5 |
| FixEval: Execution-based Evaluation of Program Fixes for Programming Problems | Jun 15, 2022 | Bug fixing | CodeCode Available | 1 | 5 |
| Patched RTC: evaluating LLMs for diverse software development tasks | Jul 23, 2024 | Bug fixingModel Selection | CodeCode Available | 0 | 5 |
| On the Embeddings of Variables in Recurrent Neural Networks for Source Code | Oct 23, 2020 | Bug fixingCode Completion | CodeCode Available | 0 | 5 |
| Repository-level Code Search with Neural Retrieval Methods | Feb 10, 2025 | Bug fixingCode Search | CodeCode Available | 0 | 5 |
| Bug Characterization in Machine Learning-based Systems | Jul 26, 2023 | Bug fixing | CodeCode Available | 0 | 5 |
| GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation | Jan 19, 2025 | Bug fixingCode Completion | CodeCode Available | 0 | 5 |
| DABT: A Dependency-aware Bug Triaging Method | Apr 26, 2021 | BlockingBug fixing | CodeCode Available | 0 | 5 |
| Learning the Relation between Code Features and Code Transforms with Structured Prediction | Jul 22, 2019 | Bug fixingMachine Translation | CodeCode Available | 0 | 5 |