SOTAVerified

HumanEval

Papers

Showing 2130 of 264 papers

TitleStatusHype
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
Automatic Instruction Evolving for Large Language ModelsCode3
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code GenerationCode3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingCode3
Evaluating Large Language Models Trained on CodeCode3
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
OctoPack: Instruction Tuning Code Large Language ModelsCode3
SelfCodeAlign: Self-Alignment for Code GenerationCode3
MapCoder: Multi-Agent Code Generation for Competitive Problem SolvingCode2
MasRouter: Learning to Route LLMs for Multi-Agent SystemsCode2
Show:102550
← PrevPage 3 of 27Next →

No leaderboard results yet.