| Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures | Nov 25, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning | Apr 6, 2025 | Math | —Unverified | 0 | 0 |
| Reverse Thinking Makes LLMs Stronger Reasoners | Nov 29, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 | 0 |
| Revisiting Priority k-Center: Fairness and Outliers | Mar 4, 2021 | ClusteringFairness | —Unverified | 0 | 0 |
| Revisiting the Superficial Alignment Hypothesis | Sep 27, 2024 | Instruction FollowingMath | —Unverified | 0 | 0 |
| RevOrder: A Novel Method for Enhanced Arithmetic in Language Models | Feb 6, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| A Method to Support Difficult Re-finding Tasks | Jan 27, 2016 | Math | —Unverified | 0 | 0 |
| Rip van Winkle's Razor: A Simple Estimate of Overfit to Test Data | Feb 25, 2021 | Math | —Unverified | 0 | 0 |
| RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning | May 20, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets | Apr 28, 2025 | Data AugmentationDiversity | —Unverified | 0 | 0 |