| Stem-ming the Tide: Predicting STEM attrition using student transcript data | Aug 28, 2017 | BIG-bench Machine LearningMath | —Unverified | 0 | 0 |
| STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing | Nov 1, 2024 | 2kIn-Context Learning | —Unverified | 0 | 0 |
| Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo | Oct 2, 2024 | Math | —Unverified | 0 | 0 |
| xGen-small Technical Report | May 10, 2025 | DecoderMath | —Unverified | 0 | 0 |
| VideoGameBench: Can Vision-Language Models complete popular video games? | May 23, 2025 | Math | —Unverified | 0 | 0 |
| Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning | Oct 18, 2024 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback | Jan 18, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| A case study : Influence of Dimension Reduction on regression trees-based Algorithms -Predicting Aeronautics Loads of a Derivative Aircraft | Nov 16, 2018 | Dimensionality ReductionMath | —Unverified | 0 | 0 |
| Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation | May 22, 2023 | Knowledge TracingMath | —Unverified | 0 | 0 |
| Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards | Jun 13, 2025 | MathNavigate | —Unverified | 0 | 0 |
| A Careful Examination of Large Language Model Performance on Grade School Arithmetic | May 1, 2024 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Strictly monotone mean-variance preferences with applications to portfolio selection | Dec 18, 2024 | ManagementMath | —Unverified | 0 | 0 |
| StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs | Dec 23, 2024 | BenchmarkingLogical Reasoning | —Unverified | 0 | 0 |
| A Bayesian model for recognizing handwritten mathematical expressions | Sep 18, 2014 | Mathmodel | —Unverified | 0 | 0 |
| Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class | Aug 26, 2024 | Math | —Unverified | 0 | 0 |
| VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM | Nov 8, 2024 | Math | —Unverified | 0 | 0 |
| Subtle Errors Matter: Preference Learning via Error-injected Self-editing | Oct 9, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications | Jan 9, 2025 | MathRAG | —Unverified | 0 | 0 |
| Supervised Optimism Correction: Be Confident When LLMs Are Sure | Apr 10, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| Sustainable Border Control Policy in the COVID-19 Pandemic: A Math Modeling Study | Aug 28, 2020 | Math | —Unverified | 0 | 0 |
| SVM-based Deep Stacking Networks | Feb 15, 2019 | Math | —Unverified | 0 | 0 |
| SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution | Feb 25, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Visual Analytics of Student Learning Behaviors on K-12 Mathematics E-learning Platforms | Sep 7, 2019 | Math | —Unverified | 0 | 0 |
| Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Mar 7, 2025 | GPUMath | —Unverified | 0 | 0 |
| Advancing Process Verification for Large Language Models via Tree-Based Preference Learning | Jun 29, 2024 | Binary ClassificationGSM8K | —Unverified | 0 | 0 |