| Translating Math Formula Images to LaTeX Sequences Using Deep Neural Networks with Sequence-level Training | Aug 29, 2019 | DecoderLanguage Modelling | CodeCode Available | 0 |
| DIVE: Diversified Iterative Self-Improvement | Jan 1, 2025 | DiversityGSM8K | CodeCode Available | 0 |
| Scheherazade: Evaluating Chain-of-Thought Math Reasoning in LLMs with Chain-of-Problems | Sep 30, 2024 | GSM8KMath | CodeCode Available | 0 |
| Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions | Jul 1, 2019 | Deep LearningMath | CodeCode Available | 0 |
| Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency | May 14, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning | May 22, 2025 | GSM8KMath | CodeCode Available | 0 |
| ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision | Oct 13, 2022 | Math | CodeCode Available | 0 |
| Seeking Diverse Reasoning Logic: Controlled Equation Expression Generation for Solving Math Word Problems | Sep 21, 2022 | Math | CodeCode Available | 0 |
| Towards Infinite-Long Prefix in Transformer | Jun 20, 2024 | Mathparameter-efficient fine-tuning | CodeCode Available | 0 |
| An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Faithful Chain-of-Thought Reasoning | Jan 31, 2023 | MathMulti-hop Question Answering | CodeCode Available | 0 |
| Techniques to Improve Neural Math Word Problem Solvers | Feb 6, 2023 | DecoderLanguage Modelling | CodeCode Available | 0 |
| DyRRen: A Dynamic Retriever-Reranker-Generator Model for Numerical Reasoning over Tabular and Textual Data | Nov 23, 2022 | MathReranking | CodeCode Available | 0 |
| Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process | May 10, 2024 | Geometry Problem SolvingMachine Translation | CodeCode Available | 0 |
| More is More: Addition Bias in Large Language Models | Sep 4, 2024 | MathText Summarization | CodeCode Available | 0 |
| SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving | Oct 19, 2023 | GSM8KMath | CodeCode Available | 0 |
| Decomposing Elements of Problem Solving: What "Math" Does RL Teach? | May 28, 2025 | MathMathematical Problem-Solving | CodeCode Available | 0 |
| A Goal-Driven Tree-Structured Neural Model for Math Word Problems | Aug 10, 2019 | MathMath Word Problem Solving | CodeCode Available | 0 |
| Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward Reasoning in Math Word Problems | Oct 3, 2023 | GSM8KMath | CodeCode Available | 0 |
| TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation | Feb 19, 2025 | Dataset GenerationGSM8K | CodeCode Available | 0 |
| Prover-Verifier Games improve legibility of LLM outputs | Jul 18, 2024 | Math | CodeCode Available | 0 |
| Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | May 15, 2025 | Large Language ModelMath | CodeCode Available | 0 |
| ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models | May 22, 2025 | Large Language ModelMath | CodeCode Available | 0 |
| FINNger -- Applying artificial intelligence to ease math learning for children | May 26, 2021 | Hand Pose EstimationMath | CodeCode Available | 0 |
| Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition | Jan 5, 2018 | DecoderHandwritten Mathmatical Expression Recognition | CodeCode Available | 0 |
| Leveraging Label Semantics and Meta-Label Refinement for Multi-Label Question Classification | Nov 4, 2024 | MathReranking | CodeCode Available | 0 |
| PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning | May 14, 2025 | MathMathematical Problem-Solving | CodeCode Available | 0 |
| An algorithm to represent inbreeding trees | Sep 21, 2020 | Math | CodeCode Available | 0 |
| What Makes Math Word Problems Challenging for LLMs? | Mar 17, 2024 | Math | CodeCode Available | 0 |
| Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning | May 29, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search | Jun 6, 2025 | Large Language ModelMath | CodeCode Available | 0 |
| Leveraging Web-Crawled Data for High-Quality Fine-Tuning | Aug 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error | Mar 13, 2025 | Math | CodeCode Available | 0 |
| Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models | Apr 1, 2024 | In-Context LearningMath | CodeCode Available | 0 |
| Library Learning Doesn't: The Curious Case of the Single-Use "Library" | Oct 26, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| AutoMSC: Automatic Assignment of Mathematics Subject Classification Labels | May 25, 2020 | ArticlesClassification | CodeCode Available | 0 |
| From Euler to AI: Unifying Formulas for Mathematical Constants | Feb 24, 2025 | Math | CodeCode Available | 0 |
| A safety realignment framework via subspace-oriented model fusion for large language models | May 15, 2024 | Instruction FollowingMath | CodeCode Available | 0 |
| TreeRPO: Tree Relative Policy Optimization | Jun 5, 2025 | Math | CodeCode Available | 0 |
| A large language model-assisted education tool to provide feedback on open-ended responses | Jul 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning | Feb 24, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |
| DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Jun 27, 2024 | Distractor GenerationMath | CodeCode Available | 0 |
| Automatic Short Math Answer Grading via In-context Meta-learning | May 30, 2022 | automatic short answer gradingIn-Context Learning | CodeCode Available | 0 |
| The Matrix Calculus You Need For Deep Learning | Feb 5, 2018 | ArticlesDeep Learning | CodeCode Available | 0 |
| An extrapolated and provably convergent algorithm for nonlinear matrix decomposition with the ReLU function | Mar 31, 2025 | Data CompressionMath | CodeCode Available | 0 |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Jul 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection | Oct 3, 2024 | Mathparameter-efficient fine-tuning | CodeCode Available | 0 |
| Taxonomy of Mathematical Plagiarism | Jan 30, 2024 | MathQuestion Answering | CodeCode Available | 0 |
| LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation | Dec 10, 2024 | Math | CodeCode Available | 0 |
| GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks | Feb 20, 2025 | Code GenerationMath | CodeCode Available | 0 |