| CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets | Sep 29, 2023 | Language ModellingMathematical Reasoning | CodeCode Available | 2 |
| ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving | Sep 29, 2023 | Arithmetic ReasoningComputational Efficiency | CodeCode Available | 3 |
| LPML: LLM-Prompting Markup Language for Mathematical Reasoning | Sep 21, 2023 | Mathematical Reasoning | —Unverified | 0 |
| Code Soliloquies for Accurate Calculations in Large Language Models | Sep 21, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models | Sep 21, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| Auto-Regressive Next-Token Predictors are Universal Learners | Sep 13, 2023 | Mathematical ReasoningText Generation | CodeCode Available | 1 |
| MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning | Sep 11, 2023 | MathMathematical Reasoning | CodeCode Available | 2 |
| On the meaning of uncertainty for ethical AI: philosophy and practice | Sep 11, 2023 | Decision MakingMathematical Reasoning | —Unverified | 0 |
| No Train Still Gain. Unleash Mathematical Reasoning of Large Language Models with Monte Carlo Tree Search Guided by Energy Function | Sep 1, 2023 | GSM8KMathematical Reasoning | —Unverified | 0 |
| When Do Program-of-Thoughts Work for Reasoning? | Aug 29, 2023 | Code GenerationMathematical Reasoning | CodeCode Available | 2 |