| LLM Performance for Code Generation on Noisy Tasks | May 29, 2025 | BenchmarkingCode Generation | CodeCode Available | 0 |
| Adversarial Math Word Problem Generation | Feb 27, 2024 | Math | CodeCode Available | 0 |
| Generalizing Math Word Problem Solvers via Solution Diversification | Dec 1, 2022 | Math | CodeCode Available | 0 |
| Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange | Mar 30, 2024 | MathMathematical Problem-Solving | CodeCode Available | 0 |
| An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings | Oct 2, 2024 | 8kMath | CodeCode Available | 0 |
| AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need | Jun 18, 2025 | GSM8KHumanEval | CodeCode Available | 0 |
| X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | May 22, 2025 | ChatbotMath | CodeCode Available | 0 |
| Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective | Feb 20, 2025 | GSM8KMath | CodeCode Available | 0 |
| An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem Solving | Nov 1, 2021 | DecoderMath | CodeCode Available | 0 |
| Towards Effective and Efficient Continual Pre-training of Large Language Models | Jul 26, 2024 | Math | CodeCode Available | 0 |