SOTAVerified

HellaSwag

Papers

Showing 2130 of 39 papers

TitleStatusHype
Towards Multilingual LLM Evaluation for European Languages0
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs0
GRIN: GRadient-INformed MoE0
You can remove GPT2's LayerNorm by fine-tuningCode0
metabench -- A Sparse Benchmark to Measure General Ability in Large Language ModelsCode0
Promises, Outlooks and Challenges of Diffusion Language Modeling0
SaGE: Evaluating Moral Consistency in Large Language ModelsCode0
Attacks on Node Attributes in Graph Neural NetworksCode0
Who's Harry Potter? Approximate Unlearning in LLMs0
Contrastive Decoding Improves Reasoning in Large Language Models0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.