| Mining Commonsense and Domain Knowledge from Math Word Problems | Oct 1, 2021 | Math | —Unverified | 0 |
| Mining Math Conjectures from LLMs: A Pruning Approach | Dec 9, 2024 | Math | —Unverified | 0 |
| Assignment Flows for Data Labeling on Graphs: Convergence and Stability | Feb 26, 2020 | General ClassificationMath | —Unverified | 0 |
| Assessing the impact of social activity permissiveness on the COVID-19 infection curve of several countries | Jun 8, 2021 | Math | —Unverified | 0 |
| Mixture of Parrots: Experts improve memorization more than reasoning | Oct 24, 2024 | MathMemorization | —Unverified | 0 |
| ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain | Mar 28, 2024 | Math | —Unverified | 0 |
| MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency | Feb 13, 2025 | BenchmarkingMath | —Unverified | 0 |
| Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities | Dec 22, 2023 | ChatbotGSM8K | —Unverified | 0 |
| MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems | Jun 2, 2022 | DecoderMath | —Unverified | 0 |
| MNIST-Fraction: Enhancing Math Education with AI-Driven Fraction Detection and Analysis | Dec 11, 2024 | Math | —Unverified | 0 |
| Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions | Jun 1, 2023 | Math | —Unverified | 0 |
| Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test | Jun 26, 2025 | Code GenerationLarge Language Model | —Unverified | 0 |
| Modeling Student Response Times: Towards Efficient One-on-one Tutoring Dialogues | Nov 1, 2018 | Math | —Unverified | 0 |
| Modelling silicosis: dynamics of a model with piecewise constant rate coefficients | Sep 2, 2021 | Math | —Unverified | 0 |
| Models Can and Should Embrace the Communicative Nature of Human-Generated Math | Sep 25, 2024 | Math | —Unverified | 0 |
| MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models | Feb 20, 2024 | Common Sense ReasoningContrastive Learning | —Unverified | 0 |
| MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities | May 17, 2025 | Math | —Unverified | 0 |
| Assessing and Verifying Task Utility in LLM-Powered Applications | May 3, 2024 | Math | —Unverified | 0 |
| More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models | May 23, 2025 | DiagnosticHallucination | —Unverified | 0 |
| MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-lingual Functional Evaluation for Large Language Models | Jun 25, 2025 | BelebeleInstruction Following | —Unverified | 0 |
| Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach | May 12, 2025 | MathMulti-Task Learning | —Unverified | 0 |
| A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach | Sep 1, 2020 | MathMath Word Problem Solving | —Unverified | 0 |
| Multi-Stage Pre-Training for Math-Understanding: ^2(AL)BERT | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees | Feb 18, 2025 | Math | —Unverified | 0 |
| Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision | Feb 5, 2024 | GSM8KMath | —Unverified | 0 |
| Multi-tool Integration Application for Math Reasoning Using Large Language Model | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions | Dec 22, 2024 | GSM8KMath | —Unverified | 0 |
| Tree-structured Decoding for Solving Math Word Problems | Nov 1, 2019 | Math | —Unverified | 0 |
| A Rule-Based Computational Model of Cognitive Arithmetic | May 3, 2017 | Mathmodel | —Unverified | 0 |
| MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts | Feb 28, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions | May 26, 2025 | AttributeMath | —Unverified | 0 |
| MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MWPRanker: An Expression Similarity Based Math Word Problem Retriever | Jul 3, 2023 | Logical SequenceMath | —Unverified | 0 |
| A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science | Mar 21, 2024 | Active LearningMath | —Unverified | 0 |
| TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games | Jun 11, 2025 | Logical ReasoningMath | —Unverified | 0 |
| NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions | Feb 18, 2025 | Knowledge DistillationMath | —Unverified | 0 |
| Natural- to formal-language generation using Tensor Product Representations | Sep 25, 2019 | DecoderMath | —Unverified | 0 |
| Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems | Jun 18, 2024 | In-Context LearningMath | —Unverified | 0 |
| AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning | May 22, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| NEOLAF, an LLM-powered neural-symbolic cognitive architecture | Aug 8, 2023 | Incremental LearningMath | —Unverified | 0 |
| "Turing Tests" For An AI Scientist | May 22, 2024 | AI AgentData Compression | —Unverified | 0 |
| Network psychometrics and cognitive network science open new ways for detecting, understanding and tackling the complexity of math anxiety: A review | Aug 31, 2021 | Math | —Unverified | 0 |
| Neural Math Word Problem Solver with Reinforcement Learning | Aug 1, 2018 | Feature EngineeringMath | —Unverified | 0 |
| Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension | May 1, 2020 | Data AugmentationMath | —Unverified | 0 |
| Neuro-Symbolic Data Generation for Math Reasoning | Dec 6, 2024 | DiversityMath | —Unverified | 0 |
| NLU for Game-based Learning in Real: Initial Evaluations | May 27, 2022 | Intent RecognitionMath | —Unverified | 0 |
| No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Jun 20, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| Arithmetic Reasoning with LLM: Prolog Generation & Permutation | May 28, 2024 | Arithmetic ReasoningData Augmentation | —Unverified | 0 |
| Noisy Deductive Reasoning: How Humans Construct Math, and How Math Constructs Universes | Oct 28, 2020 | MathMathematical Reasoning | —Unverified | 0 |