| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Language Models Rival Mathematics Students? Evaluating Mathematical Reasoning through Textual Manipulation and Human Experiments | Dec 16, 2024 | Mathematical Reasoning | —Unverified | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| Entropy-Regularized Process Reward Model | Dec 15, 2024 | GSM8KMath | CodeCode Available | 1 |
| Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models | Dec 13, 2024 | Mathematical Reasoning | —Unverified | 0 |
| A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions | Dec 12, 2024 | GSM8KKnowledge Graphs | —Unverified | 0 |
| Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking | Dec 12, 2024 | Mathematical Reasoning | —Unverified | 0 |
| SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs | Dec 11, 2024 | ARCGSM8K | —Unverified | 0 |
| Optimizing Alignment with Less: Leveraging Data Augmentation for Personalized Evaluation | Dec 10, 2024 | Data AugmentationMathematical Reasoning | —Unverified | 0 |
| Applications of Positive Unlabeled (PU) and Negative Unlabeled (NU) Learning in Cybersecurity | Dec 9, 2024 | Intrusion DetectionMalware Detection | —Unverified | 0 |