SOTAVerified

Multi-task Language Understanding

The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. https://arxiv.org/pdf/2009.03300.pdf

Papers

Showing 2130 of 57 papers

TitleStatusHype
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General TasksCode2
Gemini: A Family of Highly Capable Multimodal ModelsCode1
The Falcon Series of Open Language Models0
Orca 2: Teaching Small Language Models How to Reason0
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language ModelsCode1
Mistral 7BCode6
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLUCode1
Are Human-generated Demonstrations Necessary for In-context Learning?Code1
Textbooks Are All You Need II: phi-1.5 technical reportCode0
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.