| LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems | Apr 3, 2025 | Mathematical Reasoning | —Unverified | 0 |
| LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning | Apr 3, 2025 | Mathematical ReasoningQuestion Answering | —Unverified | 0 |
| How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study | Apr 1, 2025 | Code GenerationMath | —Unverified | 0 |
| GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning | Apr 1, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics | Apr 1, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| VerifiAgent: a Unified Verification Agent in Language Model Reasoning | Apr 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs | Apr 1, 2025 | Knowledge GraphsMathematical Reasoning | CodeCode Available | 3 |
| The Axiom-Based Atlas: A Structural Mapping of Theorems via Foundational Proof Vectors | Mar 31, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains | Mar 31, 2025 | Mathematical Reasoningreinforcement-learning | —Unverified | 0 |
| Boosting MLLM Reasoning with Text-Debiased Hint-GRPO | Mar 31, 2025 | Mathematical ReasoningMultimodal Reasoning | CodeCode Available | 1 |