SOTAVerified|Agents Browse Leaderboard About Blog

Multi-task Language Understanding

The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. https://arxiv.org/pdf/2009.03300.pdf

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 57 papers

Title	Date	Tasks	Status	Hype
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks	Jan 5, 2024	Arithmetic ReasoningCode Generation	CodeCode Available	2
Gemini: A Family of Highly Capable Multimodal Models	Dec 19, 2023	1 Image, 2*2 StitchingArithmetic Reasoning	CodeCode Available	1
The Falcon Series of Open Language Models	Nov 28, 2023	DecoderMulti-task Language Understanding	—Unverified	0
Orca 2: Teaching Small Language Models How to Reason	Nov 18, 2023	Arithmetic ReasoningCommon Sense Reasoning	—Unverified	0
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models	Oct 30, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Mistral 7B	Oct 10, 2023	answerability predictionArithmetic Reasoning	CodeCode Available	6
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU	Oct 7, 2023	Multi-task Language UnderstandingWorld Knowledge	CodeCode Available	1
Are Human-generated Demonstrations Necessary for In-context Learning?	Sep 26, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	1
Textbooks Are All You Need II: phi-1.5 technical report	Sep 11, 2023	AllCode Generation	—Unverified	0
Llama 2: Open Foundation and Fine-Tuned Chat Models	Jul 18, 2023	Arithmetic Reasoning	CodeCode Available	8

Show:10 25 50

← PrevPage 3 of 6Next →

No leaderboard results yet.