Math Word Problem Solving
A math word problem is a mathematical exercise (such as in a textbook, worksheet, or exam) where significant background information on the problem is presented in ordinary language rather than in mathematical notation. As most word problems involve a narrative of some sort, they are sometimes referred to as story problems and may vary in the amount of technical language used.
Papers
Showing 1–10 of 107 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT-4 DUP | Accuracy | 94.2 | — | Unverified |
| 2 | GPT-4 (Teaching-Inspired) | Execution Accuracy | 93.9 | — | Unverified |
| 3 | GPT-4 (Model Selection) | Execution Accuracy | 93.7 | — | Unverified |
| 4 | Qwen2(CoT + Code Interpreter) | Execution Accuracy | 92.3 | — | Unverified |
| 5 | GPT-4 (PHP) | Execution Accuracy | 91.9 | — | Unverified |
| 6 | OpenMath-CodeLlama-70B (w/ code) | Execution Accuracy | 87.8 | — | Unverified |
| 7 | MathCoder-L-70B | Execution Accuracy | 84.9 | — | Unverified |
| 8 | PoT_Eng (self-consistency @ 5) | Execution Accuracy | 83.7 | — | Unverified |
| 9 | CoT_Eng (self-consistency @ 5) | Execution Accuracy | 82.5 | — | Unverified |
| 10 | MMOS-CODE-34B(0-shot) | Execution Accuracy | 80.6 | — | Unverified |