| Subtle Errors Matter: Preference Learning via Error-injected Self-editing | Oct 9, 2024 | GSM8KMath | —Unverified | 0 |
| Supervised Optimism Correction: Be Confident When LLMs Are Sure | Apr 10, 2025 | GSM8KMath | —Unverified | 0 |
| Sustainability of Collusion and Market Transparency in a Sequential Search Market: a Generalization | May 5, 2021 | Mathematical Reasoning | —Unverified | 0 |
| Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models | Feb 20, 2024 | Instruction FollowingLogical Reasoning | —Unverified | 0 |
| Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use | Apr 7, 2025 | GSM8KMath | —Unverified | 0 |
| System-2 Mathematical Reasoning via Enriched Instruction Tuning | Dec 22, 2024 | ERPGSM8K | —Unverified | 0 |
| Table as Thought: Exploring Structured Thoughts in LLM Reasoning | Jan 4, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Taming Generative Diffusion Prior for Universal Blind Image Restoration | Aug 21, 2024 | Image RestorationMathematical Reasoning | —Unverified | 0 |
| Tangram: Benchmark for Evaluating Geometric Element Recognition in Large Multimodal Models | Aug 25, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving | Feb 17, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |