| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| Guided Speculative Inference for Efficient Test-Time Alignment of LLMs | Jun 4, 2025 | Math | CodeCode Available | 0 |
| Rectified Sparse Attention | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem | Jun 3, 2025 | GPUMath | —Unverified | 0 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Jun 3, 2025 | Data AugmentationInstruction Following | —Unverified | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 |
| Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains | Jun 2, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning | Jun 2, 2025 | Machine UnlearningMath | CodeCode Available | 0 |
| GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking | Jun 1, 2025 | 4kMath | CodeCode Available | 0 |
| Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning | May 30, 2025 | Mathreinforcement-learning | —Unverified | 0 |