SOTAVerified

HumanEval

Papers

Showing 2130 of 264 papers

TitleStatusHype
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingCode3
SelfCodeAlign: Self-Alignment for Code GenerationCode3
Automatic Instruction Evolving for Large Language ModelsCode3
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
OctoPack: Instruction Tuning Code Large Language ModelsCode3
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code GenerationCode3
Evaluating Large Language Models Trained on CodeCode3
any4: Learned 4-bit Numeric Representation for LLMsCode2
Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks AutomationCode2
Show:102550
← PrevPage 3 of 27Next →

No leaderboard results yet.