| SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering | May 6, 2024 | Bug fixingLanguage Modeling | CodeCode Available | 11 |
| AutoCodeRover: Autonomous Program Improvement | Apr 8, 2024 | Bug fixingCode Search | CodeCode Available | 7 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| SWE-bench: Can Language Models Resolve Real-World GitHub Issues? | Oct 10, 2023 | Bug fixingCode Generation | CodeCode Available | 4 |
| SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | May 22, 2025 | Bug fixingChatbot | CodeCode Available | 2 |
| CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking | Dec 1, 2024 | Bug fixingCode Generation | CodeCode Available | 2 |
| CodeR: Issue Resolving with Multi-Agent and Task Graphs | Jun 3, 2024 | Bug fixing | CodeCode Available | 2 |
| From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging | Oct 2, 2024 | Auto DebuggingBug fixing | CodeCode Available | 2 |
| Empirical Study of Transformers for Source Code | Oct 15, 2020 | Bug fixingCode Completion | CodeCode Available | 1 |
| A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code | Oct 23, 2020 | Bug fixingCode Completion | CodeCode Available | 1 |