| MATHion: Solving Math Word Problems with Logically Consistent Problems | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems | Oct 29, 2021 | Answer GenerationMath | —Unverified | 0 |
| A Theme-Rewriting Approach for Generating Algebra Word Problems | Oct 19, 2016 | MathText Generation | —Unverified | 0 |
| Math Multiple Choice Question Generation via Human-Large Language Model Collaboration | May 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers | Jan 2, 2022 | MathVocal Bursts Type Prediction | —Unverified | 0 |
| Atari games and Intel processors | May 19, 2017 | Atari GamesBIG-bench Machine Learning | —Unverified | 0 |
| Math Operation Embeddings for Open-ended Solution Analysis and Feedback | Apr 25, 2021 | Math | —Unverified | 0 |
| MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations | Feb 10, 2025 | BenchmarkingIn-Context Learning | —Unverified | 0 |
| MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection | Apr 17, 2025 | Anomaly DetectionData Augmentation | —Unverified | 0 |
| Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition | Apr 29, 2025 | GSM8KKnowledge Distillation | —Unverified | 0 |
| math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories | Oct 25, 2023 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms | May 30, 2019 | MathMath Word Problem Solving | —Unverified | 0 |
| Math Search for the Masses: Multimodal Search Interfaces and Appearance-Based Retrieval | May 11, 2015 | MathRetrieval | —Unverified | 0 |
| MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education | Apr 10, 2024 | Math | —Unverified | 0 |
| MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? | Mar 21, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| A Tag-based English Math Word Problem Solver with Understanding, Reasoning and Explanation | Jun 1, 2016 | MathTAG | —Unverified | 0 |
| When Dimensionality Reduction Meets Graph (Drawing) Theory: Introducing a Common Framework, Challenges and Opportunities | Dec 9, 2024 | Dimensionality ReductionMath | —Unverified | 0 |
| When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs | Jun 25, 2025 | Math | —Unverified | 0 |
| Math Word Problem Generation with Mathematical Consistency and Problem Context Constraints | Sep 9, 2021 | DiversityMath | —Unverified | 0 |
| Matryoshka Model Learning for Improved Elastic Student Models | May 29, 2025 | LAMBADAMath | —Unverified | 0 |
| Asymptotic expression for the fixation probability of a mutant in star graphs | Mar 18, 2016 | Math | —Unverified | 0 |
| Maximizing Confidence Alone Improves Reasoning | May 28, 2025 | GSM8KMath | —Unverified | 0 |
| MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning | Apr 9, 2025 | Code GenerationDiversity | —Unverified | 0 |
| Training Large Language Models to Reason via EM Policy Gradient | Apr 24, 2025 | GSM8KMath | —Unverified | 0 |
| Measurement to Meaning: A Validity-Centered Framework for AI Evaluation | May 13, 2025 | Math | —Unverified | 0 |