| LORD: Low Rank Decomposition Of Monolingual Code LLMs For One-Shot Compression | Sep 25, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| Baichuan 2: Open Large-scale Language Models | Sep 19, 2023 | Feature EngineeringGSM8K | CodeCode Available | 4 |
| Can Programming Languages Boost Each Other via Instruction Tuning? | Aug 31, 2023 | HumanEval | CodeCode Available | 0 |
| Code Llama: Open Foundation Models for Code | Aug 24, 2023 | 16kCode Generation | CodeCode Available | 6 |
| CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code Generation | Aug 17, 2023 | Code GenerationFew-Shot Learning | —Unverified | 0 |
| OctoPack: Instruction Tuning Code Large Language Models | Aug 14, 2023 | Code GenerationCode Repair | CodeCode Available | 3 |
| ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation | Aug 3, 2023 | Class-level Code GenerationCode Generation | CodeCode Available | 1 |
| PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback | Jul 27, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| Predicting Code Coverage without Execution | Jul 25, 2023 | HumanEval | CodeCode Available | 1 |
| Textbooks Are All You Need | Jun 20, 2023 | AllCode Generation | —Unverified | 0 |
| Is Self-Repair a Silver Bullet for Code Generation? | Jun 16, 2023 | Code GenerationHumanEval | CodeCode Available | 1 |
| WizardCoder: Empowering Code Large Language Models with Evol-Instruct | Jun 14, 2023 | Code GenerationHumanEval | CodeCode Available | 5 |
| Large Language Models of Code Fail at Completing Code with Potential Bugs | Jun 6, 2023 | Code CompletionHumanEval | CodeCode Available | 0 |
| SelfEvolve: A Code Evolution Framework via Large Language Models | Jun 5, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| ANPL: Towards Natural Programming with Interactive Decomposition | May 29, 2023 | ARCCode Generation | CodeCode Available | 1 |
| LeTI: Learning to Generate from Textual Interactions | May 17, 2023 | Code GenerationEvent Argument Extraction | CodeCode Available | 1 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 |
| Structured Chain-of-Thought Prompting for Code Generation | May 11, 2023 | Code GenerationHumanEval | —Unverified | 0 |
| StarCoder: may the source be with you! | May 9, 2023 | 8kCode Generation | CodeCode Available | 5 |
| Self-Edit: Fault-Aware Code Editor for Code Generation | May 6, 2023 | Code GenerationHumanEval | CodeCode Available | 0 |
| Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation | May 2, 2023 | Code GenerationHumanEval | CodeCode Available | 3 |
| Using Large Language Models to Generate JUnit Tests: An Empirical Study | Apr 30, 2023 | Code GenerationHumanEval | CodeCode Available | 0 |
| Stochastic Code Generation | Apr 14, 2023 | Code GenerationDecoder | —Unverified | 0 |
| CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X | Mar 30, 2023 | BenchmarkingCode Generation | CodeCode Available | 5 |
| Reflexion: Language Agents with Verbal Reinforcement Learning | Mar 20, 2023 | Decision MakingHumanEval | CodeCode Available | 4 |