| TheoremQA: A Theorem-driven Question Answering dataset | May 21, 2023 | MathQuestion Answering | CodeCode Available | 1 |
| Non-Autoregressive Math Word Problem Solver with Unified Tree Structure | May 8, 2023 | Mathvalid | CodeCode Available | 1 |
| Solving Math Word Problems by Combining Language Models With Symbolic Solvers | Apr 16, 2023 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| From Zero to Hero: Convincing with Extremely Complicated Math | Apr 1, 2023 | Math | CodeCode Available | 1 |
| How well do Large Language Models perform in Arithmetic tasks? | Mar 16, 2023 | Math | CodeCode Available | 1 |
| SALSA PICANTE: a machine learning attack on LWE with binary secrets | Mar 7, 2023 | Math | CodeCode Available | 1 |
| MathPrompter: Mathematical Reasoning using Large Language Models | Mar 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 1 |
| LEVER: Learning to Verify Language-to-Code Generation with Execution | Feb 16, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |
| Tree-Based Representation and Generation of Natural and Mathematical Language | Feb 15, 2023 | MathMathematical Reasoning | CodeCode Available | 1 |
| A Categorical Archive of ChatGPT Failures | Feb 6, 2023 | Math | CodeCode Available | 1 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Mathematical Capabilities of ChatGPT | Jan 31, 2023 | Elementary MathematicsMath | CodeCode Available | 1 |
| Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for Education | Jan 30, 2023 | MathPosition | CodeCode Available | 1 |
| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Jan 27, 2023 | Few-Shot LearningGSM8K | CodeCode Available | 1 |
| UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression | Dec 6, 2022 | Geometry Problem SolvingLogical Reasoning | CodeCode Available | 1 |
| Automatic Generation of Socratic Subquestions for Teaching Math Word Problems | Nov 23, 2022 | MathMath Word Problem Solving | CodeCode Available | 1 |
| The NCTE Transcripts: A Dataset of Elementary Math Classroom Transcripts | Nov 21, 2022 | Elementary MathematicsMath | CodeCode Available | 1 |
| Mining Mathematical Documents for Question Answering via Unsupervised Formula Labeling | Nov 12, 2022 | Entity LinkingKnowledge Graphs | CodeCode Available | 1 |
| What is my math transformer doing? -- Three results on interpretability and generalization | Oct 31, 2022 | Math | CodeCode Available | 1 |
| Solving Math Word Problems via Cooperative Reasoning induced Language Models | Oct 28, 2022 | Arithmetic ReasoningMath | CodeCode Available | 1 |
| Broken Neural Scaling Laws | Oct 26, 2022 | Adversarial RobustnessContinual Learning | CodeCode Available | 1 |
| A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models | Oct 21, 2022 | MathMathematical Reasoning | CodeCode Available | 1 |
| Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning | Sep 29, 2022 | Logical ReasoningMath | CodeCode Available | 1 |
| FormulaNet: A Benchmark Dataset for Mathematical Formula Detection | Aug 29, 2022 | Math | CodeCode Available | 1 |
| CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical Reasoning | Aug 10, 2022 | MathMathematical Reasoning | CodeCode Available | 1 |
| JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding | Jun 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Building Dataset for Grounding of Formulae — Annotating Coreference Relations Among Math Identifiers | Jun 1, 2022 | Math | CodeCode Available | 1 |
| ArMATH: a Dataset for Solving Arabic Math Word Problems | Jun 1, 2022 | Deep LearningMath | CodeCode Available | 1 |
| Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions | May 28, 2022 | Arithmetic ReasoningEfficient Exploration | CodeCode Available | 1 |
| Math-KG: Construction and Applications of Mathematical Knowledge Graph | May 8, 2022 | Math | CodeCode Available | 1 |
| The TalkMoves Dataset: K-12 Mathematics Lesson Transcripts Annotated for Teacher and Student Discursive Moves | Apr 6, 2022 | MathSentence | CodeCode Available | 1 |
| Self-Consistency Improves Chain of Thought Reasoning in Language Models | Mar 21, 2022 | ARCArithmetic Reasoning | CodeCode Available | 1 |
| Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction | Mar 19, 2022 | MathMath Word Problem Solving | CodeCode Available | 1 |
| Training and Evaluating a Jupyter Notebook Data Science Assistant | Jan 30, 2022 | Math | CodeCode Available | 1 |
| A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level | Dec 31, 2021 | Few-Shot LearningLanguage Modelling | CodeCode Available | 1 |
| Seeking Patterns, Not just Memorizing Procedures: Contrastive Learning for Solving Math Word Problems | Oct 16, 2021 | Contrastive LearningMath | CodeCode Available | 1 |
| Pretrained Language Models are Symbolic Mathematics Solvers too! | Oct 7, 2021 | IngenuityLanguage Modelling | CodeCode Available | 1 |
| Recall and Learn: A Memory-augmented Solver for Math Word Problems | Sep 27, 2021 | MathMath Word Problem Solving | CodeCode Available | 1 |
| MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers | Sep 2, 2021 | MathMath Word Problem Solving | CodeCode Available | 1 |
| Math Word Problem Solving with Explicit Numerical Values | Aug 1, 2021 | MathMath Word Problem Solving | CodeCode Available | 1 |
| MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving | Jul 28, 2021 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 1 |
| Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks | Jul 3, 2021 | DecoderMath | CodeCode Available | 1 |
| A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers | Jun 30, 2021 | DiversityMath | CodeCode Available | 1 |
| Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions | Jun 7, 2021 | MathQuestion Answering | CodeCode Available | 1 |
| MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education | Jun 2, 2021 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 |
| GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning | May 30, 2021 | MathMathematical Reasoning | CodeCode Available | 1 |
| Design and implementation of an environment for Learning to Run a Power Network (L2RPN) | Apr 6, 2021 | Mathreinforcement-learning | CodeCode Available | 1 |
| Are NLP Models really able to Solve Simple Math Word Problems? | Mar 12, 2021 | MathMath Word Problem Solving | CodeCode Available | 1 |
| Learning by Fixing: Solving Math Word Problems with Weak Supervision | Dec 19, 2020 | MathWeakly-supervised Learning | CodeCode Available | 1 |
| Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems | Oct 14, 2020 | DecoderMath | CodeCode Available | 1 |