| OmniRouter: Budget and Performance Controllable Multi-LLM Routing | Feb 27, 2025 | AI AgentMathematical Reasoning | CodeCode Available | 0 |
| Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners | Feb 27, 2025 | MambaMathematical Reasoning | —Unverified | 0 |
| Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models | Feb 27, 2025 | Mathematical ReasoningMulti-Armed Bandits | —Unverified | 0 |
| Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation | Feb 27, 2025 | DiversityMathematical Reasoning | —Unverified | 0 |
| Self-rewarding correction for mathematical reasoning | Feb 26, 2025 | Mathematical Reasoning | CodeCode Available | 3 |
| Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning | Feb 26, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction | Feb 25, 2025 | Automated Theorem ProvingMathematical Reasoning | —Unverified | 0 |
| Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning | Feb 25, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought | Feb 24, 2025 | Mathematical ReasoningMisinformation | CodeCode Available | 1 |
| Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning | Feb 24, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |