SOTAVerified

Multi-task Language Understanding

The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. https://arxiv.org/pdf/2009.03300.pdf

Papers

Showing 4150 of 57 papers

TitleStatusHype
Transcending Scaling Laws with 0.1% Extra Compute0
GLM-130B: An Open Bilingual Pre-trained ModelCode6
Atlas: Few-shot Learning with Retrieval Augmented Language ModelsCode2
Solving Quantitative Reasoning Problems with Language ModelsCode2
UL2: Unifying Language Learning ParadigmsCode1
GPT-NeoX-20B: An Open-Source Autoregressive Language ModelCode1
PaLM: Scaling Language Modeling with PathwaysCode2
Training Compute-Optimal Large Language ModelsCode6
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Merging Models with Fisher-Weighted AveragingCode1
Show:102550
← PrevPage 5 of 6Next →

No leaderboard results yet.