| TheoremQA: A Theorem-driven Question Answering dataset | May 21, 2023 | MathQuestion Answering | CodeCode Available | 1 |
| Non-Autoregressive Math Word Problem Solver with Unified Tree Structure | May 8, 2023 | Mathvalid | CodeCode Available | 1 |
| Solving Math Word Problems by Combining Language Models With Symbolic Solvers | Apr 16, 2023 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| From Zero to Hero: Convincing with Extremely Complicated Math | Apr 1, 2023 | Math | CodeCode Available | 1 |
| How well do Large Language Models perform in Arithmetic tasks? | Mar 16, 2023 | Math | CodeCode Available | 1 |
| SALSA PICANTE: a machine learning attack on LWE with binary secrets | Mar 7, 2023 | Math | CodeCode Available | 1 |
| MathPrompter: Mathematical Reasoning using Large Language Models | Mar 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 1 |
| LEVER: Learning to Verify Language-to-Code Generation with Execution | Feb 16, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |
| Tree-Based Representation and Generation of Natural and Mathematical Language | Feb 15, 2023 | MathMathematical Reasoning | CodeCode Available | 1 |
| A Categorical Archive of ChatGPT Failures | Feb 6, 2023 | Math | CodeCode Available | 1 |