SOTAVerified

HellaSwag

Papers

Showing 110 of 39 papers

TitleStatusHype
Training Compute-Optimal Large Language ModelsCode6
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA OptimizationCode1
Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language ModelsCode1
An Open Source Data Contamination Report for Large Language ModelsCode1
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data AugmentationCode1
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Slimming Down LLMs Without Losing Their Minds0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.