Multi-task Language Understanding
The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. https://arxiv.org/pdf/2009.03300.pdf
Papers
Showing 1–1 of 1 papers
| Title | Status | Hype |
|---|---|---|
| DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | Code | 15 |
No leaderboard results yet.