| Entropy-Aware Branching for Improved Mathematical Reasoning | Mar 27, 2025 | Mathematical Reasoning | —Unverified | 0 |
| MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams | Mar 26, 2025 | Mathematical ReasoningObject Counting | —Unverified | 0 |
| Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence | Mar 26, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| RL-finetuning LLMs from on- and off-policy data with a single algorithm | Mar 25, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Learning to chain-of-thought with Jensen's evidence lower bound | Mar 25, 2025 | Mathematical Reasoningreinforcement-learning | —Unverified | 0 |
| Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking | Mar 25, 2025 | In-Context LearningMathematical Reasoning | —Unverified | 0 |
| Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps | Mar 25, 2025 | Mathematical Reasoning | —Unverified | 0 |
| CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts? | Mar 23, 2025 | GSM8KMath | CodeCode Available | 0 |
| A Survey on Mathematical Reasoning and Optimization with Large Language Models | Mar 22, 2025 | Automated Theorem ProvingHeuristic Search | CodeCode Available | 0 |
| Temporal Consistency for LLM Reasoning Process Error Identification | Mar 18, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning | Mar 17, 2025 | Mathematical ReasoningMultimodal Reasoning | —Unverified | 0 |
| Pensez: Less Data, Better Reasoning -- Rethinking French LLM | Mar 17, 2025 | Large Language ModelMath | —Unverified | 0 |
| Reliable and Efficient Amortized Model-based Evaluation | Mar 17, 2025 | DiagnosticMathematical Reasoning | —Unverified | 0 |
| Evaluating Mathematical Reasoning Across Large Language Models: A Fine-Grained Approach | Mar 13, 2025 | Formal LogicMathematical Reasoning | —Unverified | 0 |
| VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models | Mar 10, 2025 | Binary ClassificationHallucination | CodeCode Available | 0 |
| Speculative Decoding for Multi-Sample Inference | Mar 7, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information | Mar 7, 2025 | Geometry Problem SolvingMathematical Reasoning | —Unverified | 0 |
| Better Process Supervision with Bi-directional Rewarding Signals | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability | Mar 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Process-based Self-Rewarding Language Models | Mar 5, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning | Mar 4, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models | Mar 4, 2025 | GSM8KMath | —Unverified | 0 |
| None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering | Mar 3, 2025 | Business EthicsEthics | —Unverified | 0 |
| MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts | Feb 28, 2025 | MathMathematical Reasoning | —Unverified | 0 |