| Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks | Mar 27, 2025 | Imitation LearningMathematical Reasoning | CodeCode Available | 2 |
| MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams | Mar 26, 2025 | Mathematical ReasoningObject Counting | —Unverified | 0 |
| Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence | Mar 26, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| RL-finetuning LLMs from on- and off-policy data with a single algorithm | Mar 25, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Learning to chain-of-thought with Jensen's evidence lower bound | Mar 25, 2025 | Mathematical Reasoningreinforcement-learning | —Unverified | 0 |
| Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking | Mar 25, 2025 | In-Context LearningMathematical Reasoning | —Unverified | 0 |
| Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps | Mar 25, 2025 | Mathematical Reasoning | —Unverified | 0 |
| CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training | Mar 24, 2025 | DiversityLarge Language Model | CodeCode Available | 1 |
| Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts? | Mar 23, 2025 | GSM8KMath | CodeCode Available | 0 |
| A Survey on Mathematical Reasoning and Optimization with Large Language Models | Mar 22, 2025 | Automated Theorem ProvingHeuristic Search | CodeCode Available | 0 |
| Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't | Mar 20, 2025 | Mathematical ReasoningReinforcement Learning (RL) | CodeCode Available | 3 |
| MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion | Mar 20, 2025 | Data AugmentationMathematical Problem-Solving | CodeCode Available | 1 |
| MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer | Mar 19, 2025 | Answer GenerationMathematical Reasoning | CodeCode Available | 1 |
| Temporal Consistency for LLM Reasoning Process Error Identification | Mar 18, 2025 | Mathematical Reasoning | CodeCode Available | 0 |
| Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation | Mar 17, 2025 | Mathematical ReasoningReinforcement Learning (RL) | CodeCode Available | 1 |
| Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning | Mar 17, 2025 | Mathematical ReasoningMultimodal Reasoning | —Unverified | 0 |
| Pensez: Less Data, Better Reasoning -- Rethinking French LLM | Mar 17, 2025 | Large Language ModelMath | —Unverified | 0 |
| Reliable and Efficient Amortized Model-based Evaluation | Mar 17, 2025 | DiagnosticMathematical Reasoning | —Unverified | 0 |
| Evaluating Mathematical Reasoning Across Large Language Models: A Fine-Grained Approach | Mar 13, 2025 | Formal LogicMathematical Reasoning | —Unverified | 0 |
| VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models | Mar 10, 2025 | Binary ClassificationHallucination | CodeCode Available | 0 |
| Implicit Reasoning in Transformers is Reasoning through Shortcuts | Mar 10, 2025 | Mathematical Reasoning | CodeCode Available | 1 |
| RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs | Mar 8, 2025 | Instruction FollowingMathematical Reasoning | CodeCode Available | 2 |
| Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models? | Mar 8, 2025 | Mathematical ReasoningMultimodal Reasoning | CodeCode Available | 2 |
| Speculative Decoding for Multi-Sample Inference | Mar 7, 2025 | Mathematical Reasoning | —Unverified | 0 |