| Automatized Evaluation of Formalization Exercises in Mathematics | Jun 2, 2020 | MathSentence | —Unverified | 0 |
| LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks | Dec 17, 2024 | Math | —Unverified | 0 |
| Automatic tagging of knowledge points for K12 math problems | Aug 21, 2022 | ClassificationMath | —Unverified | 0 |
| Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers | Jun 5, 2025 | GSM8KMath | —Unverified | 0 |
| LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ | Sep 25, 2024 | ChatbotGSM8K | —Unverified | 0 |
| Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology | Nov 5, 2024 | MathMisconceptions | —Unverified | 0 |
| Meaning-Typed Programming: Language Abstraction and Runtime for Model-Integrated Applications | May 14, 2024 | GSM8KMath | —Unverified | 0 |
| LLMs as Potential Brainstorming Partners for Math and Science Problems | Oct 10, 2023 | Math | —Unverified | 0 |
| Automatic Generation of High Quality CCGbanks for Parser Domain Adaptation | Jun 5, 2019 | Domain AdaptationMath | —Unverified | 0 |
| LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought | May 9, 2024 | HallucinationMath | —Unverified | 0 |