| Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing | Jul 15, 2025 | Knowledge TracingMath | CodeCode Available | 0 | 5 |
| PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning | May 14, 2025 | MathMathematical Problem-Solving | CodeCode Available | 0 | 5 |
| Decomposing Elements of Problem Solving: What "Math" Does RL Teach? | May 28, 2025 | MathMathematical Problem-Solving | CodeCode Available | 0 | 5 |
| Analysis of Optimization Algorithms via Sum-of-Squares | Jun 11, 2019 | Math | CodeCode Available | 0 | 5 |
| OntoMath^PRO Ontology: A Linked Data Hub for Mathematics | Jul 17, 2014 | Math | CodeCode Available | 0 | 5 |
| Automatic Generation of Headlines for Online Math Questions | Nov 27, 2019 | Math | CodeCode Available | 0 | 5 |
| Analogical Math Word Problems Solving with Enhanced Problem-Solution Association | Dec 1, 2022 | MathQuestion Answering | CodeCode Available | 0 | 5 |
| HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization | May 16, 2025 | Math | CodeCode Available | 0 | 5 |
| NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models | Jun 5, 2024 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning | Oct 16, 2024 | AllGSM8K | CodeCode Available | 0 | 5 |
| One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks | Oct 14, 2024 | FairnessGSM8K | CodeCode Available | 0 | 5 |
| Neural Machine Translation and Sequence-to-sequence Models: A Tutorial | Mar 5, 2017 | Machine TranslationMath | CodeCode Available | 0 | 5 |
| A mixed policy to improve performance of language models on math problems | Jul 17, 2023 | GSM8KMath | CodeCode Available | 0 | 5 |
| Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Dec 19, 2024 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing | Oct 2, 2024 | Contrastive LearningKnowledge Tracing | CodeCode Available | 0 | 5 |
| A Meaning-based Statistical English Math Word Problem Solver | Mar 16, 2018 | Math | CodeCode Available | 0 | 5 |
| Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition | Jan 5, 2018 | DecoderHandwritten Mathmatical Expression Recognition | CodeCode Available | 0 | 5 |
| Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks | Jul 30, 2023 | MathOptical Character Recognition | CodeCode Available | 0 | 5 |
| Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions | Jul 1, 2019 | Deep LearningMath | CodeCode Available | 0 | 5 |
| Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory Network | May 1, 2022 | Math | CodeCode Available | 0 | 5 |
| MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs | Nov 14, 2024 | General KnowledgeMath | CodeCode Available | 0 | 5 |
| Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models | May 30, 2025 | MathMultiple-choice | CodeCode Available | 0 | 5 |
| MMATH: A Multilingual Benchmark for Mathematical Reasoning | May 25, 2025 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| More is More: Addition Bias in Large Language Models | Sep 4, 2024 | MathText Summarization | CodeCode Available | 0 | 5 |
| ATHENA: Mathematical Reasoning with Thought Expansion | Nov 2, 2023 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MIRB: Mathematical Information Retrieval Benchmark | May 21, 2025 | Automated Theorem ProvingInformation Retrieval | CodeCode Available | 0 | 5 |
| Meta-Reasoning Improves Tool Use in Large Language Models | Nov 7, 2024 | Math | CodeCode Available | 0 | 5 |
| How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study | May 21, 2025 | Math | CodeCode Available | 0 | 5 |
| How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark | May 24, 2025 | Math | CodeCode Available | 0 | 5 |
| metboost: Exploratory regression analysis with hierarchically clustered data | Feb 13, 2017 | MathMissing Values | CodeCode Available | 0 | 5 |
| How Do Humans Write Code? Large Models Do It the Same Way Too | Feb 24, 2024 | Code GenerationMath | CodeCode Available | 0 | 5 |
| ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models | May 22, 2025 | Large Language ModelMath | CodeCode Available | 0 | 5 |
| Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-Making | May 22, 2020 | BIG-bench Machine LearningDecision Making | CodeCode Available | 0 | 5 |
| mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models | Jun 4, 2024 | Math | CodeCode Available | 0 | 5 |
| MAWPS: A Math Word Problem Repository | Jun 1, 2016 | MathMath Word Problem Solving | CodeCode Available | 0 | 5 |
| Heteroclinic cycling and extinction in May-Leonard models with demographic stochasticity | Nov 10, 2021 | MathUnity | CodeCode Available | 0 | 5 |
| ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision | Oct 13, 2022 | Math | CodeCode Available | 0 | 5 |
| Math Word Problem Solving by Generating Linguistic Variants of Problem Statements | Jun 24, 2023 | DecoderIngenuity | CodeCode Available | 0 | 5 |
| Algebra Error Classification with Large Language Models | May 8, 2023 | ClassificationMath | CodeCode Available | 0 | 5 |
| Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark | May 28, 2025 | Math | CodeCode Available | 0 | 5 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 | 5 |
| Computationally Identifying Funneling and Focusing Questions in Classroom Discourse | Jul 8, 2022 | Math | CodeCode Available | 0 | 5 |
| MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark | Aug 14, 2024 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models | May 26, 2025 | Contrastive LearningMath | CodeCode Available | 0 | 5 |
| Compositional Processing Emerges in Neural Networks Solving Math Problems | May 19, 2021 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| MathScale: Scaling Instruction Tuning for Mathematical Reasoning | Mar 5, 2024 | GSM8KMath | CodeCode Available | 0 | 5 |
| HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class | May 17, 2025 | MathMathematical Problem-Solving | CodeCode Available | 0 | 5 |
| Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition Extraction | May 24, 2023 | Definition ExtractionMath | CodeCode Available | 0 | 5 |