| Uncertainty-Aware Step-wise Verification with Generative Reward Models | Feb 16, 2025 | Mathematical ReasoningUncertainty Quantification | —Unverified | 0 |
| Leveraging Constrained Monte Carlo Tree Search to Generate Reliable Long Chain-of-Thought for Mathematical Reasoning | Feb 16, 2025 | Mathematical Reasoning | —Unverified | 0 |
| 1bit-Merging: Dynamic Quantized Merging for Large Language Models | Feb 15, 2025 | Code GenerationMath | —Unverified | 0 |
| Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering | Feb 14, 2025 | Mathematical ReasoningObject | —Unverified | 0 |
| GoRA: Gradient-driven Adaptive Low Rank Adaptation | Feb 13, 2025 | Computational EfficiencyMathematical Reasoning | —Unverified | 0 |
| Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models | Feb 12, 2025 | Mathematical ReasoningMMLU | —Unverified | 0 |
| Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges | Feb 12, 2025 | GSM8KMath | CodeCode Available | 0 |
| One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs | Feb 12, 2025 | Mathematical Reasoning | —Unverified | 0 |
| LLMs can implicitly learn from mistakes in-context | Feb 12, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning | Feb 11, 2025 | Code GenerationMath | CodeCode Available | 0 |