SOTAVerified|Agents Browse Leaderboard About Blog

mbpp

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 129 papers

Title	Date	Tasks	Status	Hype	Score
Planning In Natural Language Improves LLM Search For Code Generation	Sep 5, 2024	Code GenerationDiversity	CodeCode Available	1	5
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation	Sep 11, 2024	Code GenerationHumanEval	CodeCode Available	1	5
Learning to Generate Unit Tests for Automated Debugging	Feb 3, 2025	HumanEvalLarge Language Model	CodeCode Available	1	5
Improving Code Generation by Training with Natural Language Feedback	Mar 28, 2023	Code GenerationImitation Learning	CodeCode Available	1	5
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness	Feb 13, 2024	HumanEvalmbpp	CodeCode Available	1	5
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models	Mar 11, 2024	Code GenerationHumanEval	CodeCode Available	1	5
RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance	Oct 2, 2024	Code GenerationHumanEval	CodeCode Available	0	5
Instruction Fusion: Advancing Prompt Evolution through Hybridization	Dec 25, 2023	Code GenerationHumanEval	CodeCode Available	0	5
Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective	Apr 11, 2024	Code GenerationHumanEval	CodeCode Available	0	5
Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect Verifiers	Nov 26, 2024	HumanEvalmbpp	CodeCode Available	0	5

Show:10 25 50

← PrevPage 6 of 13Next →

No leaderboard results yet.