SOTAVerified|Agents Browse Leaderboard About

Multi-task Language Understanding

The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. https://arxiv.org/pdf/2009.03300.pdf

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 57 papers

Title	Date	Tasks	Status	Hype
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU	Oct 7, 2023	Multi-task Language UnderstandingWorld Knowledge	CodeCode Available	1
Are Human-generated Demonstrations Necessary for In-context Learning?	Sep 26, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	1
UL2: Unifying Language Learning Paradigms	May 10, 2022	Arithmetic ReasoningCommon Sense Reasoning	CodeCode Available	1
GPT-NeoX-20B: An Open-Source Autoregressive Language Model	Apr 14, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Merging Models with Fisher-Weighted Averaging	Nov 18, 2021	Domain AdaptationMulti-task Language Understanding	CodeCode Available	1
UnifiedQA: Crossing Format Boundaries With a Single QA System	May 2, 2020	Common Sense ReasoningLanguage Modeling	CodeCode Available	1
RoBERTa: A Robustly Optimized BERT Pretraining Approach	Jul 26, 2019	Common Sense ReasoningDocument Image Classification	CodeCode Available	1
Language Models are Unsupervised Multitask Learners	Feb 14, 2019	Common Sense ReasoningCoreference Resolution	CodeCode Available	1
Measuring Hong Kong Massive Multi-Task Language Understanding	May 4, 2025	MMLUMulti-task Language Understanding	—Unverified	0
Effectiveness of Zero-shot-CoT in Japanese Prompts	Mar 9, 2025	Abstract AlgebraCollege Mathematics	—Unverified	0

Show:10 25 50

← PrevPage 4 of 6Next →

No leaderboard results yet.