SOTAVerified

mbpp

Papers

Showing 110 of 129 papers

TitleStatusHype
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph DatabasesCode7
EvoAgentX: An Automated Framework for Evolving Agentic WorkflowsCode7
Code Llama: Open Foundation Models for CodeCode6
WizardCoder: Empowering Code Large Language Models with Evol-InstructCode5
OpenCodeInterpreter: Integrating Code Generation with Execution and RefinementCode5
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-stepCode4
Web-Bench: A LLM Code Benchmark Based on Web Standards and FrameworksCode3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
Show:102550
← PrevPage 1 of 13Next →

No leaderboard results yet.