GSM8K
Papers
Showing 1–1 of 1 papers
| Title | Status | Hype |
|---|---|---|
| ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools | Code | 14 |
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Xolver | Accuracy | 98.1 | — | Unverified |
| 2 | Orange-mini | 0-shot MRR | 98 | — | Unverified |