| MATHion: Solving Math Word Problems with Logically Consistent Problems | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems | Oct 29, 2021 | Answer GenerationMath | —Unverified | 0 |
| A Theme-Rewriting Approach for Generating Algebra Word Problems | Oct 19, 2016 | MathText Generation | —Unverified | 0 |
| Math Multiple Choice Question Generation via Human-Large Language Model Collaboration | May 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers | Jan 2, 2022 | MathVocal Bursts Type Prediction | —Unverified | 0 |
| Atari games and Intel processors | May 19, 2017 | Atari GamesBIG-bench Machine Learning | —Unverified | 0 |
| Math Operation Embeddings for Open-ended Solution Analysis and Feedback | Apr 25, 2021 | Math | —Unverified | 0 |
| MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations | Feb 10, 2025 | BenchmarkingIn-Context Learning | —Unverified | 0 |
| MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection | Apr 17, 2025 | Anomaly DetectionData Augmentation | —Unverified | 0 |
| Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition | Apr 29, 2025 | GSM8KKnowledge Distillation | —Unverified | 0 |
| math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories | Oct 25, 2023 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms | May 30, 2019 | MathMath Word Problem Solving | —Unverified | 0 |
| Math Search for the Masses: Multimodal Search Interfaces and Appearance-Based Retrieval | May 11, 2015 | MathRetrieval | —Unverified | 0 |
| MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education | Apr 10, 2024 | Math | —Unverified | 0 |
| MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? | Mar 21, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| A Tag-based English Math Word Problem Solver with Understanding, Reasoning and Explanation | Jun 1, 2016 | MathTAG | —Unverified | 0 |
| When Dimensionality Reduction Meets Graph (Drawing) Theory: Introducing a Common Framework, Challenges and Opportunities | Dec 9, 2024 | Dimensionality ReductionMath | —Unverified | 0 |
| When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs | Jun 25, 2025 | Math | —Unverified | 0 |
| Math Word Problem Generation with Mathematical Consistency and Problem Context Constraints | Sep 9, 2021 | DiversityMath | —Unverified | 0 |
| Matryoshka Model Learning for Improved Elastic Student Models | May 29, 2025 | LAMBADAMath | —Unverified | 0 |
| Asymptotic expression for the fixation probability of a mutant in star graphs | Mar 18, 2016 | Math | —Unverified | 0 |
| Maximizing Confidence Alone Improves Reasoning | May 28, 2025 | GSM8KMath | —Unverified | 0 |
| MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning | Apr 9, 2025 | Code GenerationDiversity | —Unverified | 0 |
| Training Large Language Models to Reason via EM Policy Gradient | Apr 24, 2025 | GSM8KMath | —Unverified | 0 |
| Measurement to Meaning: A Validity-Centered Framework for AI Evaluation | May 13, 2025 | Math | —Unverified | 0 |
| Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning | Jun 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning. | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems | Oct 16, 2024 | HallucinationMath | —Unverified | 0 |
| Measuring Large Language Models Capacity to Annotate Journalistic Sourcing | Dec 30, 2024 | BenchmarkingEthics | —Unverified | 0 |
| Asymptotic behavior of mean fixation times in the Moran process with frequency-independent fitnesses | Dec 30, 2022 | Math | —Unverified | 0 |
| Mechanochemical models for calcium waves in embryonic epithelia | Nov 3, 2021 | Math | —Unverified | 0 |
| To Err is Machine: Vulnerability Detection Challenges LLM Reasoning | Mar 25, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning | Feb 27, 2025 | MathMedical Question Answering | —Unverified | 0 |
| A Survey on Multimodal Large Language Models | Jun 23, 2023 | HallucinationIn-Context Learning | —Unverified | 0 |
| A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics | Feb 20, 2025 | Math | —Unverified | 0 |
| Translating a Math Word Problem to a Expression Tree | Oct 1, 2018 | Machine TranslationMath | —Unverified | 0 |
| Mental Stress Detection: Development and Evaluation of a Wearable In-Ear Plethysmography | Apr 12, 2024 | MathMental Stress Detection | —Unverified | 0 |
| Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving | May 20, 2024 | GSM8KMath | —Unverified | 0 |
| A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law | May 5, 2025 | MathMedical Diagnosis | —Unverified | 0 |
| A Survey of Question Answering for Math and Science Problem | May 10, 2017 | MathQuestion Answering | —Unverified | 0 |
| INC-Math: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models | Sep 28, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Study on Leveraging Search and Self-Feedback for Agent Reasoning | Feb 17, 2025 | Math | —Unverified | 0 |
| Metric-agnostic Ranking Optimization | Apr 17, 2023 | Information RetrievalLearning-To-Rank | —Unverified | 0 |
| MIaS: Math-Aware Retrieval in Digital Mathematical Libraries | Aug 28, 2018 | Information RetrievalMath | —Unverified | 0 |
| MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning | Oct 23, 2024 | MathMixture-of-Experts | —Unverified | 0 |
| A Study of PHOC Spatial Region Configurations for Math Formula Retrieval | Aug 17, 2024 | MathRetrieval | —Unverified | 0 |
| MIND: Math Informed syNthetic Dialogues for Pretraining LLMs | Oct 15, 2024 | GSM8KMath | —Unverified | 0 |
| Mind meets machine: Unravelling GPT-4's cognitive psychology | Mar 20, 2023 | Common Sense ReasoningDecision Making | —Unverified | 0 |
| MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time | May 25, 2024 | GSM8KMath | —Unverified | 0 |