| A Meaning-based Statistical English Math Word Problem Solver | Mar 16, 2018 | Math | CodeCode Available | 0 |
| VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency | Nov 13, 2023 | MathMathematical Reasoning | CodeCode Available | 0 |
| Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Solving Math Word Problem with Problem Type Classification | Aug 26, 2023 | Answer SelectionClassification | CodeCode Available | 0 |
| Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP Tasks | Oct 30, 2023 | FairnessMath | CodeCode Available | 0 |
| Discriminative Policy Optimization for Token-Level Reward Models | May 29, 2025 | GSM8KLanguage Modeling | CodeCode Available | 0 |
| AIFB-WebScience at SemEval-2022 Task 12: Relation Extraction First - Using Relation Extraction to Identify Entities | Jul 1, 2022 | Joint Entity and Relation ExtractionMath | CodeCode Available | 0 |
| Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing | Jul 15, 2025 | Knowledge TracingMath | CodeCode Available | 0 |
| Spectral Derivatives | Jun 6, 2025 | Math | CodeCode Available | 0 |
| A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving | Jan 7, 2025 | DiversityKnowledge Distillation | CodeCode Available | 0 |
| Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping | Jan 31, 2025 | DenoisingImage Denoising | CodeCode Available | 0 |
| Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework Support | Dec 16, 2024 | Large Language ModelMath | CodeCode Available | 0 |
| Benchmarking Large Language Models for Math Reasoning Tasks | Aug 20, 2024 | BenchmarkingIn-Context Learning | CodeCode Available | 0 |
| Meta-Reasoning Improves Tool Use in Large Language Models | Nov 7, 2024 | Math | CodeCode Available | 0 |
| metboost: Exploratory regression analysis with hierarchically clustered data | Feb 13, 2017 | MathMissing Values | CodeCode Available | 0 |
| AIFB-WebScience at SemEval-2022 Task 12: Relation Extraction First -- Using Relation Extraction to Identify Entities | Mar 10, 2022 | Joint Entity and Relation ExtractionMath | CodeCode Available | 0 |
| Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis | Jan 18, 2025 | cognitive diagnosisMath | CodeCode Available | 0 |
| ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions | Dec 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| AI-Assisted Generation of Difficult Math Questions | Jul 30, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting | Feb 5, 2025 | GSM8KMath | CodeCode Available | 0 |
| SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation | Oct 17, 2024 | GSM8KLanguage Modeling | CodeCode Available | 0 |
| Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation | Dec 20, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational Data | Aug 7, 2023 | MathMisconceptions | CodeCode Available | 0 |