SOTAVerified

mbpp

Papers

Showing 2130 of 129 papers

TitleStatusHype
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
Fault-Aware Neural Code RankersCode1
Getting the most out of your tokenizer for pre-training and domain adaptationCode1
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-InstructCode1
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code GenerationCode1
Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM WatermarkingCode1
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction TuningCode1
Can Language Models Replace Programmers for Coding? REPOCOD Says 'Not Yet'Code1
Clover: Closed-Loop Verifiable Code GenerationCode1
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code CompletionCode1
Show:102550
← PrevPage 3 of 13Next →

No leaderboard results yet.