| O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning | Jan 22, 2025 | Mathematical Reasoning | CodeCode Available | 2 |
| DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | Jan 22, 2025 | Mathematical ReasoningMulti-task Language Understanding | CodeCode Available | 15 |
| CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning | Jan 21, 2025 | ClusteringMathematical Reasoning | —Unverified | 0 |
| InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model | Jan 21, 2025 | Instruction FollowingMathematical Reasoning | —Unverified | 0 |
| Benchmarking Large Language Models via Random Variables | Jan 20, 2025 | BenchmarkingMathematical Reasoning | —Unverified | 0 |
| Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective | Jan 19, 2025 | Automated Theorem ProvingMath | —Unverified | 0 |
| Control LLM: Controlled Evolution for Intelligence Retention in LLM | Jan 19, 2025 | MathMathematical Reasoning | CodeCode Available | 1 |
| Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback | Jan 18, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| The Lessons of Developing Process Reward Models in Mathematical Reasoning | Jan 13, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs | Jan 11, 2025 | MathMathematical Problem-Solving | CodeCode Available | 1 |
| Search-o1: Agentic Search-Enhanced Large Reasoning Models | Jan 9, 2025 | Code Generation | CodeCode Available | 5 |
| VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models | Jan 9, 2025 | BenchmarkingMathematical Problem-Solving | CodeCode Available | 1 |
| URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics | Jan 8, 2025 | MathMathematical Reasoning | CodeCode Available | 2 |
| Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning | Jan 6, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap | Jan 5, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Table as Thought: Exploring Structured Thoughts in LLM Reasoning | Jan 4, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search | Jan 2, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Plug-and-Play Training Framework for Preference Optimization | Dec 30, 2024 | Mathematical ReasoningQuestion Answering | —Unverified | 0 |
| LLM2: Let Large Language Models Harness System 2 Reasoning | Dec 29, 2024 | GSM8KMathematical Reasoning | CodeCode Available | 0 |
| Large Language Models for Mathematical Analysis | Dec 28, 2024 | Mathematical Problem-SolvingMathematical Reasoning | CodeCode Available | 0 |
| LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning | Dec 28, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English | Dec 24, 2024 | Mathematical Reasoning | CodeCode Available | 0 |
| B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners | Dec 23, 2024 | Mathematical Reasoning | CodeCode Available | 2 |
| Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration | Dec 22, 2024 | Decision MakingMachine Translation | CodeCode Available | 0 |
| System-2 Mathematical Reasoning via Enriched Instruction Tuning | Dec 22, 2024 | ERPGSM8K | —Unverified | 0 |