| Planning In Natural Language Improves LLM Search For Code Generation | Sep 5, 2024 | Code GenerationDiversity | CodeCode Available | 1 | 5 |
| Policy Filtration in RLHF to Fine-Tune LLM for Code Generation | Sep 11, 2024 | Code GenerationHumanEval | CodeCode Available | 1 | 5 |
| Learning to Generate Unit Tests for Automated Debugging | Feb 3, 2025 | HumanEvalLarge Language Model | CodeCode Available | 1 | 5 |
| Improving Code Generation by Training with Natural Language Feedback | Mar 28, 2023 | Code GenerationImitation Learning | CodeCode Available | 1 | 5 |
| Unsupervised Evaluation of Code LLMs with Round-Trip Correctness | Feb 13, 2024 | HumanEvalmbpp | CodeCode Available | 1 | 5 |
| InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models | Mar 11, 2024 | Code GenerationHumanEval | CodeCode Available | 1 | 5 |
| RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance | Oct 2, 2024 | Code GenerationHumanEval | CodeCode Available | 0 | 5 |
| Instruction Fusion: Advancing Prompt Evolution through Hybridization | Dec 25, 2023 | Code GenerationHumanEval | CodeCode Available | 0 | 5 |
| Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective | Apr 11, 2024 | Code GenerationHumanEval | CodeCode Available | 0 | 5 |
| Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect Verifiers | Nov 26, 2024 | HumanEvalmbpp | CodeCode Available | 0 | 5 |