| Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions | Jun 1, 2023 | Math | —Unverified | 0 |
| Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test | Jun 26, 2025 | Code GenerationLarge Language Model | —Unverified | 0 |
| Modeling Student Response Times: Towards Efficient One-on-one Tutoring Dialogues | Nov 1, 2018 | Math | —Unverified | 0 |
| Modelling silicosis: dynamics of a model with piecewise constant rate coefficients | Sep 2, 2021 | Math | —Unverified | 0 |
| Models Can and Should Embrace the Communicative Nature of Human-Generated Math | Sep 25, 2024 | Math | —Unverified | 0 |
| MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models | Feb 20, 2024 | Common Sense ReasoningContrastive Learning | —Unverified | 0 |
| MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities | May 17, 2025 | Math | —Unverified | 0 |
| Assessing and Verifying Task Utility in LLM-Powered Applications | May 3, 2024 | Math | —Unverified | 0 |
| More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models | May 23, 2025 | DiagnosticHallucination | —Unverified | 0 |
| MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |