SOTAVerified

HellaSwag

Papers

Showing 2130 of 39 papers

TitleStatusHype
What the HellaSwag? On the Validity of Common-Sense Reasoning BenchmarksCode0
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs0
Promises, Outlooks and Challenges of Diffusion Language Modeling0
Comparing Test Sets with Item Response Theory0
English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too0
Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst0
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation0
Slimming Down LLMs Without Losing Their Minds0
SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs0
Contrastive Decoding Improves Reasoning in Large Language Models0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.