| PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback | Jul 27, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| Textbooks Are All You Need | Jun 20, 2023 | AllCode Generation | —Unverified | 0 |
| Large Language Models of Code Fail at Completing Code with Potential Bugs | Jun 6, 2023 | Code CompletionHumanEval | CodeCode Available | 0 |
| SelfEvolve: A Code Evolution Framework via Large Language Models | Jun 5, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 |
| Structured Chain-of-Thought Prompting for Code Generation | May 11, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| Self-Edit: Fault-Aware Code Editor for Code Generation | May 6, 2023 | Code GenerationHumanEval | CodeCode Available | 0 |
| Using Large Language Models to Generate JUnit Tests: An Empirical Study | Apr 30, 2023 | Code GenerationHumanEval | CodeCode Available | 0 |
| Stochastic Code Generation | Apr 14, 2023 | Code GenerationDecoder | —Unverified | 0 |
| Large Language Models Meet NL2Code: A Survey | Dec 19, 2022 | HumanEvalSurvey | —Unverified | 0 |
| The Stack: 3 TB of permissively licensed source code | Nov 20, 2022 | HumanEvalmbpp | —Unverified | 0 |
| Evaluating How Fine-tuning on Bimodal Data Effects Code Generation | Nov 15, 2022 | Code GenerationHumanEval | CodeCode Available | 0 |
| Piloting Copilot, Codex, and StarCoder2: Hot Temperature, Cold Prompts, or Black Magic? | Oct 26, 2022 | HumanEvalLanguage Modelling | —Unverified | 0 |
| Interactive Code Generation via Test-Driven User-Intent Formalization | Aug 11, 2022 | Code GenerationHumanEval | —Unverified | 0 |