| A Rule-Based Computational Model of Cognitive Arithmetic | May 3, 2017 | Mathmodel | —Unverified | 0 | 0 |
| MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts | Feb 28, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions | May 26, 2025 | AttributeMath | —Unverified | 0 | 0 |
| MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| MWPRanker: An Expression Similarity Based Math Word Problem Retriever | Jul 3, 2023 | Logical SequenceMath | —Unverified | 0 | 0 |
| A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science | Mar 21, 2024 | Active LearningMath | —Unverified | 0 | 0 |
| TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games | Jun 11, 2025 | Logical ReasoningMath | —Unverified | 0 | 0 |
| NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions | Feb 18, 2025 | Knowledge DistillationMath | —Unverified | 0 | 0 |
| Natural- to formal-language generation using Tensor Product Representations | Sep 25, 2019 | DecoderMath | —Unverified | 0 | 0 |
| Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems | Jun 18, 2024 | In-Context LearningMath | —Unverified | 0 | 0 |