| Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs | Feb 4, 2025 | Formal LogicKnowledge Graphs | —Unverified | 0 | 0 |
| STRIVE: Structured Reasoning for Self-Improvement in Claim Verification | Feb 17, 2025 | Claim VerificationMathematical Problem-Solving | —Unverified | 0 | 0 |
| Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations | May 16, 2025 | Code GenerationMathematical Problem-Solving | —Unverified | 0 | 0 |
| Automatic Detection of Reflective Thinking in Mathematical Problem Solving based on Unconstrained Bodily Exploration | Dec 18, 2018 | Mathematical Problem-Solving | —Unverified | 0 | 0 |
| Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving | Feb 17, 2025 | MathMathematical Problem-Solving | —Unverified | 0 | 0 |
| TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving | Jun 12, 2025 | Logical ReasoningMathematical Problem-Solving | —Unverified | 0 | 0 |
| The Consensus Game: Language Model Generation via Equilibrium Search | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning | Oct 20, 2023 | Mathematical Problem-SolvingPosition | —Unverified | 0 | 0 |
| Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving | Jan 28, 2025 | MathMathematical Problem-Solving | —Unverified | 0 | 0 |
| Large Language Models for Mathematical Reasoning: Progresses and Challenges | Jan 31, 2024 | DiversityMath | —Unverified | 0 | 0 |