| A quantitative study of NLP approaches to question difficulty estimation | May 17, 2023 | MathMultiple-choice | CodeCode Available | 0 |
| A Probabilistic Model for Node Classification in Directed Graphs | Jan 3, 2025 | MathNode Classification | CodeCode Available | 0 |
| Inference-time Alignment in Continuous Space | May 26, 2025 | Math | CodeCode Available | 0 |
| WARM: A Weakly (+Semi) Supervised Math Word Problem Solver | Oct 1, 2022 | Math | CodeCode Available | 0 |
| Unlocking Temporal Question Answering for Large Language Models with Tailor-Made Reasoning Logic | May 24, 2023 | Logical ReasoningMath | CodeCode Available | 0 |
| RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing | Nov 13, 2024 | DecoderMath | CodeCode Available | 0 |
| Can LLMs Solve longer Math Word Problems Better? | May 23, 2024 | Data AugmentationMath | CodeCode Available | 0 |
| Tracing and Manipulating Intermediate Values in Neural Math Problem Solvers | Jan 17, 2023 | Math | CodeCode Available | 0 |
| Evaluating and Optimizing Educational Content with Large Language Model Judgments | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Sep 21, 2024 | MathText to SQL | CodeCode Available | 0 |
| Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning | Feb 11, 2025 | Code GenerationMath | CodeCode Available | 0 |
| MathScale: Scaling Instruction Tuning for Mathematical Reasoning | Mar 5, 2024 | GSM8KMath | CodeCode Available | 0 |
| MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark | Aug 14, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance | Oct 3, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Instructing Large Language Models to Identify and Ignore Irrelevant Conditions | Mar 19, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Rethinking floating point for deep learning | Nov 1, 2018 | Deep LearningMath | CodeCode Available | 0 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation | Apr 20, 2020 | Deep LearningMath | CodeCode Available | 0 |
| Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning | Jan 24, 2018 | CPUDeep Learning | CodeCode Available | 0 |
| Public Attitudes Toward ChatGPT on Twitter: Sentiments, Topics, and Occupations | Jun 22, 2023 | ChatbotLanguage Modelling | CodeCode Available | 0 |
| Coarse-grained Stochastic Model of Myosin-Driven Vesicles into Dendritic Spines | Jul 15, 2021 | Math | CodeCode Available | 0 |
| Solving Arithmetic Word Problems Automatically Using Transformer and Unambiguous Representations | Dec 2, 2019 | Math | CodeCode Available | 0 |
| Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS | Nov 27, 2024 | In-Context LearningMath | CodeCode Available | 0 |
| Can LLMs Reason in the Wild with Programs? | Jun 19, 2024 | GSM8KMath | CodeCode Available | 0 |
| Unbiased Math Word Problems Benchmark for Mitigating Solving Bias | May 17, 2022 | Math | CodeCode Available | 0 |
| Can We Use Small Models to Investigate Multimodal Fusion Methods? | Sep 1, 2022 | Math | CodeCode Available | 0 |
| Introducing MathQA -- A Math-Aware Question Answering System | Jun 28, 2019 | MathQuestion Answering | CodeCode Available | 0 |
| Mathematical Formalized Problem Solving and Theorem Proving in Different Fields in Lean 4 | Sep 9, 2024 | Abstract AlgebraAutomated Theorem Proving | CodeCode Available | 0 |
| Math Word Problem Solving by Generating Linguistic Variants of Problem Statements | Jun 24, 2023 | DecoderIngenuity | CodeCode Available | 0 |
| Introduction To The Monogenic Signal | Mar 27, 2017 | Math | CodeCode Available | 0 |
| Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning | Jun 2, 2025 | Machine UnlearningMath | CodeCode Available | 0 |
| A Context-Enhanced Framework for Sequential Graph Reasoning | Dec 12, 2024 | Math | CodeCode Available | 0 |
| Investigating Math Word Problems using Pretrained Multilingual Language Models | May 19, 2021 | Machine TranslationMath | CodeCode Available | 0 |
| Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns? | Jul 6, 2024 | Math | CodeCode Available | 0 |
| Reverse Operation based Data Augmentation for Solving Math Word Problems | Oct 4, 2020 | Data AugmentationMath | CodeCode Available | 0 |
| MAWPS: A Math Word Problem Repository | Jun 1, 2016 | MathMath Word Problem Solving | CodeCode Available | 0 |
| EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers | May 1, 2022 | MathMath Word Problem Solving | CodeCode Available | 0 |
| Cutting Through the Noise: Boosting LLM Performance on Math Word Problems | May 30, 2024 | 8kMath | CodeCode Available | 0 |
| mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models | Jun 4, 2024 | Math | CodeCode Available | 0 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 |
| MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification | Apr 7, 2024 | Image ComprehensionMath | CodeCode Available | 0 |
| Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models | Mar 27, 2025 | Data VisualizationMath | CodeCode Available | 0 |
| Unsupervised learning-based calibration scheme for Rough Bergomi model | Dec 3, 2024 | Math | CodeCode Available | 0 |
| Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision | Jan 14, 2025 | Instruction FollowingMath | CodeCode Available | 0 |
| A mixed policy to improve performance of language models on math problems | Jul 17, 2023 | GSM8KMath | CodeCode Available | 0 |
| Teaching Machines to Code: Neural Markup Generation with Visual Attention | Feb 15, 2018 | MathOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Solving Math Word Problems with Multi-Encoders and Multi-Decoders | Dec 1, 2020 | DecoderMath | CodeCode Available | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark | May 28, 2025 | Math | CodeCode Available | 0 |
| Solving Math Word Problems with Reexamination | Oct 14, 2023 | DescriptiveMath | CodeCode Available | 0 |