SOTAVerified|Agents Browse Leaderboard About Blog

mbpp

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 129 papers

Title	Date	Tasks	Status	Hype
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning	Apr 14, 2025	Mathematical Reasoningmbpp	CodeCode Available	2
MasRouter: Learning to Route LLMs for Multi-Agent Systems	Feb 16, 2025	HumanEvalmbpp	CodeCode Available	2
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging	Feb 8, 2025	Code GenerationHumanEval	CodeCode Available	2
A Survey on Large Language Models for Code Generation	Jun 1, 2024	Code GenerationHumanEval	CodeCode Available	2
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving	May 18, 2024	Code GenerationHumanEval	CodeCode Available	2
NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts	May 7, 2024	HumanEvalmbpp	CodeCode Available	2
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation	Dec 20, 2023	Code GenerationHumanEval	CodeCode Available	2
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback	Jun 26, 2023	BenchmarkingCode Generation	CodeCode Available	2
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation	Aug 17, 2022	BenchmarkingCode Generation	CodeCode Available	2
CodeT: Code Generation with Generated Tests	Jul 21, 2022	Code GenerationHumanEval	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 13Next →

No leaderboard results yet.