| RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? | Jan 20, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 | 0 |
| RedStone: Curating General, Code, Math, and QA Data for Large Language Models | Dec 4, 2024 | Domain AdaptationMath | —Unverified | 0 | 0 |
| Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning | May 30, 2025 | Mathreinforcement-learning | —Unverified | 0 | 0 |
| Analytic solution of the SEIR epidemic model via asymptotic approximant | Jun 30, 2020 | FormMath | —Unverified | 0 | 0 |
| Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models | Oct 6, 2023 | 8kMath | —Unverified | 0 | 0 |
| Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem | Jun 3, 2025 | GPUMath | —Unverified | 0 | 0 |
| Reinforced optimal control | Nov 24, 2020 | Mathregression | —Unverified | 0 | 0 |
| Reinforce LLM Reasoning through Multi-Agent Reflection | Jun 10, 2025 | MathOut-of-Distribution Generalization | —Unverified | 0 | 0 |
| Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task | Apr 11, 2023 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 | 0 |
| A multi-core periphery perspective: Ranking via relative centrality | Jun 6, 2024 | Math | —Unverified | 0 | 0 |