| Speculative Decoding for Multi-Sample Inference | Mar 7, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Better Process Supervision with Bi-directional Rewarding Signals | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability | Mar 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Process-based Self-Rewarding Language Models | Mar 5, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning | Mar 4, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models | Mar 4, 2025 | GSM8KMath | —Unverified | 0 |
| PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models | Mar 4, 2025 | GSM8KMath | CodeCode Available | 1 |
| None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering | Mar 3, 2025 | Business EthicsEthics | —Unverified | 0 |
| MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts | Feb 28, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing | Feb 27, 2025 | Document SummarizationLarge Language Model | —Unverified | 0 |