| Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents | May 19, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Herald: A Natural Language Annotated Lean 4 Dataset | Oct 9, 2024 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models | Sep 27, 2024 | Code GenerationMathematical Reasoning | —Unverified | 0 | 0 |
| HOFT: Householder Orthogonal Fine-tuning | May 22, 2025 | Machine TranslationMathematical Reasoning | —Unverified | 0 | 0 |
| How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study | Apr 1, 2025 | Code GenerationMath | —Unverified | 0 | 0 |
| How Does Quantization Affect Multilingual LLMs? | Jul 3, 2024 | Mathematical ReasoningQuantization | —Unverified | 0 | 0 |
| How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs | Oct 17, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation | May 26, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Improve Mathematical Reasoning in Language Models by Automated Process Supervision | Jun 5, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation | Nov 22, 2024 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 | 0 |
| Improving Multilingual Math Reasoning for African Languages | May 26, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Improving Physics Reasoning in Large Language Models Using Mixture of Refinement Agents | Dec 1, 2024 | Mathematical ReasoningMMLU | —Unverified | 0 | 0 |
| Improving RL Exploration for LLM Reasoning through Retrospective Replay | Apr 19, 2025 | Code GenerationMathematical Reasoning | —Unverified | 0 | 0 |
| Improving Rule-based Reasoning in LLMs via Neurosymbolic Representations | Jan 31, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Distilling Mathematical Reasoning Capabilities into Small Language Models | Jan 22, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks | Oct 24, 2024 | Logical ReasoningMathematical Problem-Solving | —Unverified | 0 | 0 |
| InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning | Sep 19, 2024 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking | Mar 25, 2025 | In-Context LearningMathematical Reasoning | —Unverified | 0 | 0 |
| Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs | Jun 25, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models | Feb 18, 2025 | Data AugmentationGSM8K | —Unverified | 0 | 0 |
| Integrating External Tools with Large Language Models to Improve Accuracy | Jul 9, 2025 | Mathematical ReasoningMMLU | —Unverified | 0 | 0 |
| InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model | Jan 21, 2025 | Instruction FollowingMathematical Reasoning | —Unverified | 0 | 0 |
| Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination | Jun 10, 2023 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Investigating the interaction of linguistic and mathematical reasoning in language models using multilingual number puzzles | Jun 16, 2025 | DiversityMathematical Reasoning | —Unverified | 0 | 0 |
| Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study | Jun 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | Jun 5, 2024 | Mathematical ReasoningNatural Language Inference | —Unverified | 0 | 0 |
| Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Jul 11, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| iTBLS: A Dataset of Interactive Conversations Over Tabular Information | Apr 19, 2024 | ArticlesMathematical Reasoning | —Unverified | 0 | 0 |
| JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving | Jun 19, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| Keep Guessing? When Considering Inference Scaling, Mind the Baselines | Oct 20, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning | Mar 4, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model | Jul 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning? | Jul 15, 2025 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey | May 6, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments | Dec 26, 2023 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 | 0 |
| Kwai-STaR: Transform LLMs into State-Transition Reasoners | Nov 7, 2024 | GSM8KMathematical Problem-Solving | —Unverified | 0 | 0 |
| KwaiYiiMath: Technical Report | Oct 11, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| Mathematical Reasoning via Self-supervised Skip-tree Training | Jun 8, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Language Models Use Trigonometry to Do Addition | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| LANS: A Layout-Aware Neural Solver for Plane Geometry Problem | Nov 25, 2023 | Geometry Problem SolvingLanguage Modelling | —Unverified | 0 | 0 |
| Large Language Models and Mathematical Reasoning Failures | Feb 17, 2025 | Mathematical ReasoningPhysical Intuition | —Unverified | 0 | 0 |
| Large Language Models Don't Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective | Jun 30, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Large Language Models for Combinatorial Optimization of Design Structure Matrix | Nov 19, 2024 | Combinatorial OptimizationMathematical Reasoning | —Unverified | 0 | 0 |
| Large Language Models for Design Structure Matrix Optimization | Jun 11, 2025 | Combinatorial OptimizationMathematical Reasoning | —Unverified | 0 | 0 |
| Large Language Models for Mathematical Reasoning: Progresses and Challenges | Jan 31, 2024 | DiversityMath | —Unverified | 0 | 0 |
| Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens | Jun 10, 2025 | BenchmarkingMathematical Reasoning | —Unverified | 0 | 0 |
| Large Multi-Modal Models (LMMs) as Universal Foundation Models for AI-Native Wireless Systems | Jan 30, 2024 | Mathematical ReasoningRAG | —Unverified | 0 | 0 |
| Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training | Jun 27, 2025 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 | 0 |
| Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models | Oct 2, 2024 | Cross-Lingual TransferMath | —Unverified | 0 | 0 |
| LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction | Feb 25, 2025 | Automated Theorem ProvingMathematical Reasoning | —Unverified | 0 | 0 |