SOTAVerified

Multi-task Language Understanding

The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. https://arxiv.org/pdf/2009.03300.pdf

Papers

Showing 5157 of 57 papers

TitleStatusHype
Textbooks Are All You Need II: phi-1.5 technical reportCode0
Model Card and Evaluations for Claude Models0
Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning0
MERGE: Fast Private Text GenerationCode0
PaLM 2 Technical ReportCode0
BloombergGPT: A Large Language Model for FinanceCode0
Transcending Scaling Laws with 0.1% Extra Compute0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.