| Building Math Agents with Multi-Turn Iterative Preference Learning | Sep 4, 2024 | GSM8KMath | —Unverified | 0 |
| A Bayesian model for recognizing handwritten mathematical expressions | Sep 18, 2014 | Mathmodel | —Unverified | 0 |
| Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning | Dec 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles | May 26, 2025 | ARCLogical Reasoning | —Unverified | 0 |
| A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition | Aug 13, 2024 | Common Sense ReasoningMath | —Unverified | 0 |
| APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries | Apr 27, 2025 | Automated Theorem ProvingBug fixing | —Unverified | 0 |
| AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling | Dec 19, 2024 | Math | —Unverified | 0 |
| Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search | Jun 10, 2025 | GSM8KMath | —Unverified | 0 |
| GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach | Aug 18, 2023 | Math | —Unverified | 0 |
| BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Introduction to Coresets: Accurate Coresets | Oct 19, 2019 | Math | —Unverified | 0 |
| Enhancing Math Learning in an LMS Using AI-Driven Question Recommendations | Apr 18, 2025 | ManagementMath | —Unverified | 0 |
| Enhancing Mathematical Reasoning in LLMs with Background Operators | Dec 5, 2024 | Data AugmentationMath | —Unverified | 0 |
| Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens | Oct 18, 2024 | MathQuestion Answering | —Unverified | 0 |
| Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation | Nov 7, 2023 | MathRAG | —Unverified | 0 |
| Bridging Offline and Online Reinforcement Learning for LLMs | Jun 26, 2025 | Instruction FollowingMath | —Unverified | 0 |
| Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption | Feb 17, 2025 | BenchmarkingCode Summarization | —Unverified | 0 |
| End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics | Nov 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | Jan 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking Ties: Regression Discontinuity Design Meets Market Design | Dec 31, 2020 | Mathregression | —Unverified | 0 |
| 構建一個中文國小數學文字問題語料庫(Building a Corpus for Developing the Chinese Elementary School Math Word Problem Solver)[In Chinese] | Oct 1, 2016 | Math | —Unverified | 0 |
| AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy | Jun 16, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Investigating Large Language Models in Diagnosing Students' Cognitive Skills in Math Problem-solving | Apr 1, 2025 | Math | —Unverified | 0 |
| Enabling Massive Deep Neural Networks with the GraphBLAS | Aug 9, 2017 | Math | —Unverified | 0 |
| Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models | Jan 5, 2025 | Math | —Unverified | 0 |
| Empirical entropy, minimax regret and minimax risk | Aug 6, 2013 | Mathregression | —Unverified | 0 |
| Emergent inabilities? Inverse scaling over the course of pretraining | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Mar 12, 2024 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem | Mar 17, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning | Oct 14, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics | Apr 1, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| Embedded Phase Shifting: Robust Phase Shifting With Embedded Signals | Jun 1, 2015 | MathQuantization | —Unverified | 0 |
| A novel variational model for image registration using Gaussian curvature | Apr 28, 2015 | Image RegistrationMath | —Unverified | 0 |
| 1bit-Merging: Dynamic Quantized Merging for Large Language Models | Feb 15, 2025 | Code GenerationMath | —Unverified | 0 |
| Efficient Tool Use with Chain-of-Abstraction Reasoning | Jan 30, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| A note on the option price and 'Mass at zero in the uncorrelated SABR model and implied volatility asymptotics' | Nov 1, 2020 | MathNumerical Integration | —Unverified | 0 |
| An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests | Jan 21, 2025 | Math | —Unverified | 0 |
| Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning | Dec 14, 2023 | Arithmetic ReasoningFew-Shot Learning | —Unverified | 0 |
| Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards | Jun 13, 2025 | MathNavigate | —Unverified | 0 |
| Intriguing Properties of Large Language and Vision Models | Oct 7, 2024 | cross-modal alignmentLarge Language Model | —Unverified | 0 |
| Introducing the Mathematics Meme Repository | Oct 19, 2021 | Math | —Unverified | 0 |
| Investigating Math Word Problems using Pretrained Multilingual Language Models | Jan 16, 2022 | Machine TranslationMath | —Unverified | 0 |
| Kappa Learning: A New Method for Measuring Similarity Between Educational Items Using Performance Data | Dec 20, 2018 | ClusteringMath | —Unverified | 0 |
| Effects of context, complexity, and clustering on evaluation for math formula retrieval | Nov 20, 2021 | ClusteringMath | —Unverified | 0 |
| Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving | Feb 12, 2025 | Mathmultimodal interaction | —Unverified | 0 |
| BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation | Feb 6, 2025 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| An Optimal Likelihood Free Method for Biological Model Selection | Aug 3, 2022 | Drug DiscoveryMath | —Unverified | 0 |
| Interleaved Reasoning for Large Language Models via Reinforcement Learning | May 26, 2025 | Logical ReasoningMath | —Unverified | 0 |
| EasyMath: A 0-shot Math Benchmark for SLMs | May 20, 2025 | Math | —Unverified | 0 |