| Thinking Preference Optimization | Feb 17, 2025 | Math | CodeCode Available | 1 |
| MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task | Feb 17, 2025 | Code CompletionGSM8K | —Unverified | 0 |
| Scaling Test-Time Compute Without Verification or RL is Suboptimal | Feb 17, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving | Feb 17, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption | Feb 17, 2025 | BenchmarkingCode Summarization | —Unverified | 0 |
| Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding | Feb 17, 2025 | Arithmetic ReasoningChart Understanding | —Unverified | 0 |
| A Study on Leveraging Search and Self-Feedback for Agent Reasoning | Feb 17, 2025 | Math | —Unverified | 0 |
| Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation | Feb 17, 2025 | Knowledge DistillationMath | CodeCode Available | 0 |
| Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models | Feb 17, 2025 | Math | —Unverified | 0 |
| Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL | Feb 17, 2025 | Code GenerationMath | CodeCode Available | 1 |