| Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ControlMath: Controllable Data Generation Promotes Math Generalist Models | Sep 20, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning | Sep 19, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| GRIN: GRadient-INformed MoE | Sep 18, 2024 | HellaSwagHumanEval | —Unverified | 0 |
| Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement | Sep 18, 2024 | GSM8KMath | —Unverified | 0 |
| Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning | Sep 17, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 |
| NVLM: Open Frontier-Class Multimodal LLMs | Sep 17, 2024 | MathMultimodal Reasoning | —Unverified | 0 |
| GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students | Sep 16, 2024 | Math | —Unverified | 0 |
| Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia | Sep 13, 2024 | MathMultiple-choice | —Unverified | 0 |
| CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks | Sep 13, 2024 | ARCCode Generation | —Unverified | 0 |