| IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | Jun 5, 2024 | Mathematical ReasoningNatural Language Inference | —Unverified | 0 | 0 |
| Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Jul 11, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| iTBLS: A Dataset of Interactive Conversations Over Tabular Information | Apr 19, 2024 | ArticlesMathematical Reasoning | —Unverified | 0 | 0 |
| JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving | Jun 19, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| Keep Guessing? When Considering Inference Scaling, Mind the Baselines | Oct 20, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning | Mar 4, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model | Jul 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning? | Jul 15, 2025 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey | May 6, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments | Dec 26, 2023 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 | 0 |
| Kwai-STaR: Transform LLMs into State-Transition Reasoners | Nov 7, 2024 | GSM8KMathematical Problem-Solving | —Unverified | 0 | 0 |
| KwaiYiiMath: Technical Report | Oct 11, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| Mathematical Reasoning via Self-supervised Skip-tree Training | Jun 8, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Language Models Use Trigonometry to Do Addition | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| LANS: A Layout-Aware Neural Solver for Plane Geometry Problem | Nov 25, 2023 | Geometry Problem SolvingLanguage Modelling | —Unverified | 0 | 0 |
| Large Language Models and Mathematical Reasoning Failures | Feb 17, 2025 | Mathematical ReasoningPhysical Intuition | —Unverified | 0 | 0 |
| Large Language Models Don't Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective | Jun 30, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Large Language Models for Combinatorial Optimization of Design Structure Matrix | Nov 19, 2024 | Combinatorial OptimizationMathematical Reasoning | —Unverified | 0 | 0 |
| Large Language Models for Design Structure Matrix Optimization | Jun 11, 2025 | Combinatorial OptimizationMathematical Reasoning | —Unverified | 0 | 0 |
| Large Language Models for Mathematical Reasoning: Progresses and Challenges | Jan 31, 2024 | DiversityMath | —Unverified | 0 | 0 |
| Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens | Jun 10, 2025 | BenchmarkingMathematical Reasoning | —Unverified | 0 | 0 |
| Large Multi-Modal Models (LMMs) as Universal Foundation Models for AI-Native Wireless Systems | Jan 30, 2024 | Mathematical ReasoningRAG | —Unverified | 0 | 0 |
| Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training | Jun 27, 2025 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 | 0 |
| Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models | Oct 2, 2024 | Cross-Lingual TransferMath | —Unverified | 0 | 0 |
| LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction | Feb 25, 2025 | Automated Theorem ProvingMathematical Reasoning | —Unverified | 0 | 0 |