| Assessing Robustness to Spurious Correlations in Post-Training Language Models | May 9, 2025 | Instruction FollowingMathematical Reasoning | —Unverified | 0 |
| Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey | May 6, 2025 | Mathematical Reasoning | —Unverified | 0 |
| RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library | Apr 29, 2025 | Data AugmentationMathematical Reasoning | —Unverified | 0 |
| Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think | Apr 29, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets | Apr 28, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning | Apr 28, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models | Apr 28, 2025 | Mathematical ReasoningMeta-Learning | CodeCode Available | 0 |
| SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning | Apr 27, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| Hierarchical Attention Generates Better Proofs | Apr 27, 2025 | Automated Theorem ProvingMathematical Proofs | CodeCode Available | 0 |
| PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts | Apr 25, 2025 | DiversityMathematical Reasoning | —Unverified | 0 |