| Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models | Feb 27, 2025 | Mathematical ReasoningMulti-Armed Bandits | —Unverified | 0 |
| Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation | Feb 27, 2025 | DiversityMathematical Reasoning | —Unverified | 0 |
| OmniRouter: Budget and Performance Controllable Multi-LLM Routing | Feb 27, 2025 | AI AgentMathematical Reasoning | CodeCode Available | 0 |
| Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing | Feb 27, 2025 | Document SummarizationLarge Language Model | —Unverified | 0 |
| Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners | Feb 27, 2025 | MambaMathematical Reasoning | —Unverified | 0 |
| Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning | Feb 26, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction | Feb 25, 2025 | Automated Theorem ProvingMathematical Reasoning | —Unverified | 0 |
| Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning | Feb 25, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning | Feb 24, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |
| The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer | Feb 21, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |