SOTAVerified

HellaSwag

Papers

Showing 1120 of 39 papers

TitleStatusHype
FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level FilteringCode0
SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs0
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA OptimizationCode1
Towards Multilingual LLM Evaluation for European Languages0
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs0
GRIN: GRadient-INformed MoE0
You can remove GPT2's LayerNorm by fine-tuningCode0
metabench -- A Sparse Benchmark to Measure General Ability in Large Language ModelsCode0
Promises, Outlooks and Challenges of Diffusion Language Modeling0
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.