| Evaluating and Optimizing Educational Content with Large Language Model Judgments | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions? | May 10, 2024 | Mathtext similarity | CodeCode Available | 0 | 5 |
| Prover-Verifier Games improve legibility of LLM outputs | Jul 18, 2024 | Math | CodeCode Available | 0 | 5 |
| A Goal-Driven Tree-Structured Neural Model for Math Word Problems | Aug 10, 2019 | MathMath Word Problem Solving | CodeCode Available | 0 | 5 |
| Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision | May 26, 2025 | HallucinationMath | CodeCode Available | 0 | 5 |
| EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers | May 1, 2022 | MathMath Word Problem Solving | CodeCode Available | 0 | 5 |
| EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning | May 22, 2025 | GSM8KMath | CodeCode Available | 0 | 5 |
| Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers | Apr 30, 2022 | Data AugmentationDiversity | CodeCode Available | 0 | 5 |
| Enumerate-Conjecture-Prove: Formally Solving Answer-Construction Problems in Math Competitions | May 24, 2025 | Automated Theorem ProvingMath | CodeCode Available | 0 | 5 |
| AALC: Large Language Model Efficient Reasoning via Adaptive Accuracy-Length Control | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |