SOTAVerified|Agents Browse Leaderboard About Blog

mbpp

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 129 papers

Title	Date	Tasks	Status	Hype	Score
Teaching Large Language Models to Self-Debug	Apr 11, 2023	Code GenerationLanguage Modeling	CodeCode Available	0	5
Self-Correcting Code Generation Using Small Language Models	May 29, 2025	Code GenerationHumanEval	CodeCode Available	0	5
Instruction Fusion: Advancing Prompt Evolution through Hybridization	Dec 25, 2023	Code GenerationHumanEval	CodeCode Available	0	5
Underwater Object Tracker: UOSTrack for Marine Organism Grasping of Underwater Vehicles	Jan 4, 2023	Data Augmentationmbpp	CodeCode Available	0	5
Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency	Sep 29, 2023	Code GenerationHumanEval	CodeCode Available	0	5
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation	Oct 1, 2024	Code GenerationHumanEval	CodeCode Available	0	5
Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect Verifiers	Nov 26, 2024	HumanEvalmbpp	CodeCode Available	0	5
Textbooks Are All You Need	Jun 20, 2023	AllCode Generation	—Unverified	0	0
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code	Mar 12, 2024	Code GenerationHumanEval	—Unverified	0	0
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models	May 25, 2025	GSM8KHumanEval	—Unverified	0	0

Show:10 25 50

← PrevPage 7 of 13Next →

No leaderboard results yet.