| Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning | Feb 25, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| MATHion: Solving Math Word Problems with Logically Consistent Problems | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems | Oct 29, 2021 | Answer GenerationMath | —Unverified | 0 | 0 |
| A Theme-Rewriting Approach for Generating Algebra Word Problems | Oct 19, 2016 | MathText Generation | —Unverified | 0 | 0 |
| Math Multiple Choice Question Generation via Human-Large Language Model Collaboration | May 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers | Jan 2, 2022 | MathVocal Bursts Type Prediction | —Unverified | 0 | 0 |
| Atari games and Intel processors | May 19, 2017 | Atari GamesBIG-bench Machine Learning | —Unverified | 0 | 0 |
| Math Operation Embeddings for Open-ended Solution Analysis and Feedback | Apr 25, 2021 | Math | —Unverified | 0 | 0 |
| MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations | Feb 10, 2025 | BenchmarkingIn-Context Learning | —Unverified | 0 | 0 |
| MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection | Apr 17, 2025 | Anomaly DetectionData Augmentation | —Unverified | 0 | 0 |