| Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple Inputs | Feb 12, 2024 | 2kMathematical Reasoning | —Unverified | 0 |
| From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks | Sep 6, 2024 | Machine TranslationMathematical Reasoning | —Unverified | 0 |
| From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education | Feb 19, 2025 | DiagnosticGSM8K | —Unverified | 0 |
| From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting | Dec 18, 2023 | DiversityGSM8K | —Unverified | 0 |
| From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs | Jan 27, 2025 | 4kMathematical Reasoning | —Unverified | 0 |
| FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI | Nov 7, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning | Feb 20, 2025 | Mathematical Reasoning | —Unverified | 0 |
| GAPS: Geometry-Aware Problem Solver | Jan 29, 2024 | Geometry Problem SolvingMath | —Unverified | 0 |
| GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning | Apr 1, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning | Dec 19, 2023 | Mathematical Reasoning | —Unverified | 0 |
| GFlowNet Fine-tuning for Diverse Correct Solutions in Mathematical Reasoning Tasks | Oct 26, 2024 | DiversityMathematical Reasoning | —Unverified | 0 |
| GoRA: Gradient-driven Adaptive Low Rank Adaptation | Feb 13, 2025 | Computational EfficiencyMathematical Reasoning | —Unverified | 0 |
| GraphIC: A Graph-Based In-Context Example Retrieval Model for Multi-Step Reasoning | Oct 3, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| GraphMR: Graph Neural Network for Mathematical Reasoning | Nov 1, 2021 | Graph Neural NetworkGraph-to-Sequence | —Unverified | 0 |
| Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence | May 23, 2025 | GPULarge Language Model | —Unverified | 0 |
| Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents | May 19, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Herald: A Natural Language Annotated Lean 4 Dataset | Oct 9, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models | Sep 27, 2024 | Code GenerationMathematical Reasoning | —Unverified | 0 |
| HOFT: Householder Orthogonal Fine-tuning | May 22, 2025 | Machine TranslationMathematical Reasoning | —Unverified | 0 |
| How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study | Apr 1, 2025 | Code GenerationMath | —Unverified | 0 |
| How Does Quantization Affect Multilingual LLMs? | Jul 3, 2024 | Mathematical ReasoningQuantization | —Unverified | 0 |
| How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs | Oct 17, 2024 | Mathematical Reasoning | —Unverified | 0 |
| HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation | May 26, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Improve Mathematical Reasoning in Language Models by Automated Process Supervision | Jun 5, 2024 | GSM8KMath | —Unverified | 0 |
| Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation | Nov 22, 2024 | Knowledge DistillationMathematical Reasoning | —Unverified | 0 |