MMLU
Papers
Showing 1–1 of 1 papers
| Title | Status | Hype |
|---|---|---|
| ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools | Code | 14 |
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | go ahead, make my data | Final_score | 61.72 | — | Unverified |
| 2 | #GreedyCow | Final_score | 61.63 | — | Unverified |
| 3 | Don't Ask Us y | Final_score | 61.4 | — | Unverified |
| 4 | Data_and_Confused | Final_score | 60.96 | — | Unverified |
| 5 | Waffles | Final_score | 60.91 | — | Unverified |
| 6 | raaka | Final_score | 60.91 | — | Unverified |
| 7 | Team Procrustination | Final_score | 60.64 | — | Unverified |
| 8 | Axiom Consulting Partners | Final_score | 60.63 | — | Unverified |
| 9 | Lets_Be_Fair | Final_score | 60.23 | — | Unverified |
| 10 | gooners | Final_score | 60.22 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Orange-mini | 0-shot MRR | 99.19 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | HybridBeam+ | SI-SDRi | 13.3 | — | Unverified |