| Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models | Feb 27, 2025 | Mathematical ReasoningMulti-Armed Bandits | —Unverified | 0 | 0 |
| MetaRuleGPT: Recursive Numerical Reasoning of Language Models Trained with Simple Rules | Dec 18, 2024 | Mathematical ReasoningMeta-Learning | —Unverified | 0 | 0 |
| MIND: Math Informed syNthetic Dialogues for Pretraining LLMs | Oct 15, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning | May 20, 2025 | Logical ReasoningMathematical Reasoning | —Unverified | 0 | 0 |
| MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning | Jul 16, 2023 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 | 0 |
| Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning | Mar 17, 2025 | Mathematical ReasoningMultimodal Reasoning | —Unverified | 0 | 0 |
| MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems | Jun 2, 2022 | DecoderMath | —Unverified | 0 | 0 |
| Modeling Intelligent Decision Making Command And Control Agents: An Application to Air Defense | Mar 20, 2019 | Decision MakingMathematical Reasoning | —Unverified | 0 | 0 |
| Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing | Feb 27, 2025 | Document SummarizationLarge Language Model | —Unverified | 0 | 0 |
| Multi-Layer GRPO: Enhancing Reasoning and Self-Correction in Large Language Models | Jun 5, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models | Mar 1, 2024 | BenchmarkingMathematical Reasoning | —Unverified | 0 | 0 |
| Multi-tool Integration Application for Math Reasoning Using Large Language Model | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts | Feb 28, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| MWPRanker: An Expression Similarity Based Math Word Problem Retriever | Jul 3, 2023 | Logical SequenceMath | —Unverified | 0 | 0 |
| Neuro-Symbolic Data Generation for Math Reasoning | Dec 6, 2024 | DiversityMath | —Unverified | 0 | 0 |
| Noisy Deductive Reasoning: How Humans Construct Math, and How Math Constructs Universes | Oct 28, 2020 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering | Mar 3, 2025 | Business EthicsEthics | —Unverified | 0 | 0 |
| Notes on a Path to AI Assistance in Mathematical Reasoning | Oct 4, 2023 | Mathematical Reasoning | —Unverified | 0 | 0 |
| No Train Still Gain. Unleash Mathematical Reasoning of Large Language Models with Monte Carlo Tree Search Guided by Energy Function | Sep 1, 2023 | GSM8KMathematical Reasoning | —Unverified | 0 | 0 |
| Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions | Oct 3, 2023 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks | Apr 12, 2022 | Arithmetic ReasoningMathematical Reasoning | —Unverified | 0 | 0 |
| Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs | Dec 29, 2023 | Mathematical Reasoning | —Unverified | 0 | 0 |
| One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs | Feb 12, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| On-Policy RL with Optimal Reward Baseline | May 29, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 | 0 |
| On the meaning of uncertainty for ethical AI: philosophy and practice | Sep 11, 2023 | Decision MakingMathematical Reasoning | —Unverified | 0 | 0 |
| OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety | Mar 18, 2024 | BenchmarkingMathematical Reasoning | —Unverified | 0 | 0 |
| Optimizing Alignment with Less: Leveraging Data Augmentation for Personalized Evaluation | Dec 10, 2024 | Data AugmentationMathematical Reasoning | —Unverified | 0 | 0 |
| Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models | Jul 26, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Orca 2: Teaching Small Language Models How to Reason | Nov 18, 2023 | Arithmetic ReasoningCommon Sense Reasoning | —Unverified | 0 | 0 |
| OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation | May 20, 2025 | Common Sense ReasoningMathematical Reasoning | —Unverified | 0 | 0 |
| PARAMANU-GANITA: Language Model with Mathematical Capabilities | Apr 22, 2024 | Domain AdaptationGSM8K | —Unverified | 0 | 0 |
| Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging | Apr 23, 2025 | Mathematical Reasoningparameter-efficient fine-tuning | —Unverified | 0 | 0 |
| Path-Consistency: Prefix Enhancement for Efficient Inference in LLM | Aug 25, 2024 | Code GenerationCommon Sense Reasoning | —Unverified | 0 | 0 |
| Path Planning for Masked Diffusion Model Sampling | Feb 5, 2025 | Code GenerationIn-Context Learning | —Unverified | 0 | 0 |
| Pensez: Less Data, Better Reasoning -- Rethinking French LLM | Mar 17, 2025 | Large Language ModelMath | —Unverified | 0 | 0 |
| PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models | Jun 21, 2025 | Mathematical ReasoningMultiple-choice | —Unverified | 0 | 0 |
| Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information | Mar 7, 2025 | Geometry Problem SolvingMathematical Reasoning | —Unverified | 0 | 0 |
| Plug-and-Play Training Framework for Preference Optimization | Dec 30, 2024 | Mathematical ReasoningQuestion Answering | —Unverified | 0 | 0 |
| Policy Guided Tree Search for Enhanced LLM Reasoning | Feb 4, 2025 | Mathematical ReasoningNavigate | —Unverified | 0 | 0 |
| PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts | Apr 25, 2025 | DiversityMathematical Reasoning | —Unverified | 0 | 0 |
| PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness | Oct 9, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| PPT: A Process-based Preference Learning Framework for Self Improving Table Question Answering Models | May 23, 2025 | Code GenerationMathematical Reasoning | —Unverified | 0 | 0 |
| Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs | Feb 4, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models | Jun 12, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 | 0 |
| Pre-trained Large Language Models Use Fourier Features to Compute Addition | Jun 5, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Probabilistic Results on the Architecture of Mathematical Reasoning Aligned by Cognitive Alternation | Aug 17, 2023 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models | Nov 19, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps | Mar 25, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Progress or Regress? Self-Improvement Reversal in Post-training | Jul 6, 2024 | DiversityMathematical Reasoning | —Unverified | 0 | 0 |
| Prompt Selection and Augmentation for Few Examples Code Generation in Large Language Model and its Application in Robotics Control | Mar 11, 2024 | Code GenerationDiversity | —Unverified | 0 | 0 |