| The Role of Diversity in In-Context Learning for Large Language Models | May 26, 2025 | DiversityIn-Context Learning | —Unverified | 0 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles | May 26, 2025 | ARCLogical Reasoning | —Unverified | 0 |
| Interleaved Reasoning for Large Language Models via Reinforcement Learning | May 26, 2025 | Logical ReasoningMath | —Unverified | 0 |
| Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning | May 26, 2025 | DiversityMath | —Unverified | 0 |
| Improving Multilingual Math Reasoning for African Languages | May 26, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models | May 26, 2025 | Contrastive LearningMath | CodeCode Available | 0 |
| Inference-time Alignment in Continuous Space | May 26, 2025 | Math | CodeCode Available | 0 |
| AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models | May 25, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| MMATH: A Multilingual Benchmark for Mathematical Reasoning | May 25, 2025 | MathMathematical Reasoning | CodeCode Available | 0 |
| Enumerate-Conjecture-Prove: Formally Solving Answer-Construction Problems in Math Competitions | May 24, 2025 | Automated Theorem ProvingMath | CodeCode Available | 0 |