| Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use | Apr 7, 2025 | GSM8KMath | —Unverified | 0 |
| Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models | Apr 7, 2025 | MathQuantization | CodeCode Available | 2 |
| Efficient Reinforcement Finetuning via Adaptive Curriculum Learning | Apr 7, 2025 | MathMathematical Reasoning | CodeCode Available | 2 |
| Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification | Apr 7, 2025 | Logical ReasoningMath | —Unverified | 0 |
| Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning | Apr 6, 2025 | Math | —Unverified | 0 |
| oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data Science | Apr 5, 2025 | Math | —Unverified | 0 |
| Explain with Visual Keypoints Like a Real Mentor! A Benchmark for Multimodal Solution Explanation | Apr 4, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning | Apr 4, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| Large (Vision) Language Models are Unsupervised In-Context Learners | Apr 3, 2025 | GSM8KIn-Context Learning | CodeCode Available | 1 |
| MegaMath: Pushing the Limits of Open Math Corpora | Apr 3, 2025 | DiversityMath | CodeCode Available | 2 |