| StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error | Mar 13, 2025 | Math | CodeCode Available | 0 | 5 |
| Introducing MathQA -- A Math-Aware Question Answering System | Jun 28, 2019 | MathQuestion Answering | CodeCode Available | 0 | 5 |
| Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval | Mar 21, 2022 | Information RetrievalMath | CodeCode Available | 0 | 5 |
| Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions? | May 10, 2024 | Mathtext similarity | CodeCode Available | 0 | 5 |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Jul 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning | Oct 16, 2024 | AllGSM8K | CodeCode Available | 0 | 5 |
| GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation | Jun 17, 2024 | Image GenerationMath | CodeCode Available | 0 | 5 |
| AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need | Jun 18, 2025 | GSM8KHumanEval | CodeCode Available | 0 | 5 |
| X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | May 22, 2025 | ChatbotMath | CodeCode Available | 0 | 5 |
| Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem | Mar 6, 2024 | BenchmarkingHallucination | CodeCode Available | 0 | 5 |
| Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction | Jan 9, 2025 | MathSentence | CodeCode Available | 0 | 5 |
| LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks | Feb 18, 2024 | Math | —Unverified | 0 | 0 |
| Towards Generating Math Word Problems from Equations and Topics | Oct 1, 2019 | Math | —Unverified | 0 | 0 |
| LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Automatic Description Construction for Math Expression via Topic Relation Graph | Apr 24, 2021 | MathRelation | —Unverified | 0 | 0 |
| Towards Interpretable Math Word Problem Solving with Grounded Linguistic Logic Reasoning | Nov 16, 2021 | MathMath Word Problem Solving | —Unverified | 0 | 0 |
| "Love is as Complex as Math": Metaphor Generation System for Social Chatbot | Jan 3, 2020 | ChatbotMath | —Unverified | 0 | 0 |
| AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database | May 19, 2025 | Data AugmentationIn-Context Learning | —Unverified | 0 | 0 |
| Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning | Dec 23, 2024 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| Machine Translation of Mathematical Text | Oct 11, 2020 | Machine TranslationMath | —Unverified | 0 | 0 |
| Automate Knowledge Concept Tagging on Math Questions with LLMs | Mar 26, 2024 | Few-Shot LearningMath | —Unverified | 0 | 0 |
| Towards Language Agnostic Universal Representations | Sep 23, 2018 | Math | —Unverified | 0 | 0 |
| Towards Math-Aware Automated Classification and Similarity Search of Scientific Publications: Methods of Mathematical Content Representations | Oct 8, 2021 | BIG-bench Machine LearningClassification | —Unverified | 0 | 0 |
| MALT: Improving Reasoning with Multi-Agent LLM Training | Dec 2, 2024 | Common Sense ReasoningGSM8K | —Unverified | 0 | 0 |
| MAmmoTH2: Scaling Instructions from the Web | May 6, 2024 | ChatbotGSM8K | —Unverified | 0 | 0 |
| Automated Systems For Diagnosis of Dysgraphia in Children: A Survey and Novel Framework | Jun 27, 2022 | Math | —Unverified | 0 | 0 |
| Mapping probability word problems to executable representations | Nov 1, 2021 | Contextualised Word RepresentationsMath | —Unverified | 0 | 0 |
| MAPS: A Multilingual Benchmark for Global Agent Performance and Security | May 21, 2025 | Code GenerationMath | —Unverified | 0 | 0 |
| Automated LaTeX Code Generation from Handwritten Math Expressions Using Vision Transformer | Dec 5, 2024 | Code GenerationDecoder | —Unverified | 0 | 0 |
| Mars-PO: Multi-Agent Reasoning System Preference Optimization | Nov 28, 2024 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Zero-sum repeated games: Counterexamples to the existence of the asymptotic value and the conjecture maxmin=limv_n | May 21, 2013 | Math | —Unverified | 0 | 0 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Jun 3, 2025 | Data AugmentationInstruction Following | —Unverified | 0 | 0 |
| Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models | Mar 13, 2024 | Math | —Unverified | 0 | 0 |
| Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction | Jan 4, 2024 | ClusteringFairness | —Unverified | 0 | 0 |
| Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses | Oct 29, 2024 | MathZero-Shot Learning | —Unverified | 0 | 0 |
| MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering | Dec 19, 2022 | Chart Question AnsweringData Summarization | —Unverified | 0 | 0 |
| MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection | Mar 23, 2025 | MathMathematical Problem-Solving | —Unverified | 0 | 0 |
| Math Agents: Computational Infrastructure, Mathematical Embedding, and Genomics | Jul 4, 2023 | Automated Theorem ProvingMath | —Unverified | 0 | 0 |
| AutoBERT-Zero: Evolving BERT Backbone from Scratch | Jul 15, 2021 | Inductive BiasLanguage Modelling | —Unverified | 0 | 0 |
| MathAttack: Attacking Large Language Models Towards Math Solving Ability | Sep 4, 2023 | Adversarial AttackGSM8K | —Unverified | 0 | 0 |
| MathBERT: A Pre-Trained Model for Mathematical Formula Understanding | May 2, 2021 | Headline GenerationInformation Retrieval | —Unverified | 0 | 0 |
| Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective | May 24, 2023 | Decision MakingMath | —Unverified | 0 | 0 |
| A TWO-STAGE FRAMEWORK FOR MATHEMATICAL EXPRESSION RECOGNITION | Sep 25, 2019 | Mathobject-detection | —Unverified | 0 | 0 |
| A Transformer-based Math Language Model for Handwritten Math Expression Recognition | Aug 11, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems | May 21, 2025 | BenchmarkingMath | —Unverified | 0 | 0 |
| MathDivide: Improved mathematical reasoning by large language models | May 12, 2024 | GSM8KLogical Reasoning | —Unverified | 0 | 0 |
| Mathematical Information Retrieval: Search and Question Answering | Aug 21, 2024 | Information RetrievalMath | —Unverified | 0 | 0 |
| Mathematical Opportunities in Digital Twins (MATH-DT) | Feb 15, 2024 | Math | —Unverified | 0 | 0 |
| MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task | Feb 17, 2025 | Code CompletionGSM8K | —Unverified | 0 | 0 |
| MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs | Feb 26, 2024 | GSM8KMath | —Unverified | 0 | 0 |