| Limits of an AI program for solving college math problems | Aug 14, 2022 | Few-Shot LearningMath | —Unverified | 0 | 0 |
| Automatized Evaluation of Formalization Exercises in Mathematics | Jun 2, 2020 | MathSentence | —Unverified | 0 | 0 |
| LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks | Dec 17, 2024 | Math | —Unverified | 0 | 0 |
| Automatic tagging of knowledge points for K12 math problems | Aug 21, 2022 | ClassificationMath | —Unverified | 0 | 0 |
| Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers | Jun 5, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ | Sep 25, 2024 | ChatbotGSM8K | —Unverified | 0 | 0 |
| Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology | Nov 5, 2024 | MathMisconceptions | —Unverified | 0 | 0 |
| Meaning-Typed Programming: Language Abstraction and Runtime for Model-Integrated Applications | May 14, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| LLMs as Potential Brainstorming Partners for Math and Science Problems | Oct 10, 2023 | Math | —Unverified | 0 | 0 |
| Automatic Generation of High Quality CCGbanks for Parser Domain Adaptation | Jun 5, 2019 | Domain AdaptationMath | —Unverified | 0 | 0 |
| LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought | May 9, 2024 | HallucinationMath | —Unverified | 0 | 0 |
| LLMs Do Not Have Human-Like Working Memory | Apr 30, 2025 | Math | —Unverified | 0 | 0 |
| LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems | Oct 18, 2024 | In-Context LearningMath | —Unverified | 0 | 0 |
| Local and global asymptotic inference in smoothing spline models | Dec 30, 2012 | Mathvalid | —Unverified | 0 | 0 |
| Local Prompt Optimization | Apr 29, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret | Mar 3, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Long Is More Important Than Difficult for Training Reasoning Models | Mar 23, 2025 | Math | —Unverified | 0 | 0 |
| LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception | Apr 21, 2025 | MathMMLU | —Unverified | 0 | 0 |
| Long-range Sequence Modeling with Predictable Sparse Attention | May 1, 2022 | Math | —Unverified | 0 | 0 |
| LookAlike: Consistent Distractor Generation in Math MCQs | May 3, 2025 | Distractor GenerationMath | —Unverified | 0 | 0 |