| Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models | Feb 27, 2025 | Mathematical ReasoningMulti-Armed Bandits | —Unverified | 0 |
| Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation | Feb 27, 2025 | DiversityMathematical Reasoning | —Unverified | 0 |
| OmniRouter: Budget and Performance Controllable Multi-LLM Routing | Feb 27, 2025 | AI AgentMathematical Reasoning | CodeCode Available | 0 |
| Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing | Feb 27, 2025 | Document SummarizationLarge Language Model | —Unverified | 0 |
| Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners | Feb 27, 2025 | MambaMathematical Reasoning | —Unverified | 0 |
| Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning | Feb 26, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction | Feb 25, 2025 | Automated Theorem ProvingMathematical Reasoning | —Unverified | 0 |
| Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning | Feb 25, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning | Feb 24, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |
| The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer | Feb 21, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |
| Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning | Feb 20, 2025 | Mathematical ReasoningRetrieval | —Unverified | 0 |
| Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning | Feb 20, 2025 | Mathematical Reasoning | —Unverified | 0 |
| CER: Confidence Enhanced Reasoning in LLMs | Feb 20, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |
| From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education | Feb 19, 2025 | DiagnosticGSM8K | —Unverified | 0 |
| Theorem Prover as a Judge for Synthetic Data Generation | Feb 18, 2025 | Mathematical ProofsMathematical Reasoning | —Unverified | 0 |
| Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models | Feb 18, 2025 | Data AugmentationGSM8K | —Unverified | 0 |
| Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models | Feb 18, 2025 | Code GenerationGeneral Knowledge | —Unverified | 0 |
| Large Language Models and Mathematical Reasoning Failures | Feb 17, 2025 | Mathematical ReasoningPhysical Intuition | —Unverified | 0 |
| Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving | Feb 17, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task | Feb 17, 2025 | Code CompletionGSM8K | —Unverified | 0 |
| Uncertainty-Aware Step-wise Verification with Generative Reward Models | Feb 16, 2025 | Mathematical ReasoningUncertainty Quantification | —Unverified | 0 |
| Leveraging Constrained Monte Carlo Tree Search to Generate Reliable Long Chain-of-Thought for Mathematical Reasoning | Feb 16, 2025 | Mathematical Reasoning | —Unverified | 0 |
| 1bit-Merging: Dynamic Quantized Merging for Large Language Models | Feb 15, 2025 | Code GenerationMath | —Unverified | 0 |
| Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering | Feb 14, 2025 | Mathematical ReasoningObject | —Unverified | 0 |
| GoRA: Gradient-driven Adaptive Low Rank Adaptation | Feb 13, 2025 | Computational EfficiencyMathematical Reasoning | —Unverified | 0 |