SOTAVerified

HellaSwag

Papers

Showing 2130 of 39 papers

TitleStatusHype
Contrastive Decoding Improves Reasoning in Large Language Models0
Towards Multilingual LLM Evaluation for European Languages0
GRIN: GRadient-INformed MoE0
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation0
Who's Harry Potter? Approximate Unlearning in LLMs0
HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning0
Domain-Adaptive Continued Pre-Training of Small Language Models0
You can remove GPT2's LayerNorm by fine-tuningCode0
Attacks on Node Attributes in Graph Neural NetworksCode0
FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level FilteringCode0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.