| Assessing Robustness to Spurious Correlations in Post-Training Language Models | May 9, 2025 | Instruction FollowingMathematical Reasoning | —Unverified | 0 | 0 |
| Assessing the Emergent Symbolic Reasoning Abilities of Llama Large Language Models | Jun 5, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities | Dec 22, 2023 | ChatbotGSM8K | —Unverified | 0 | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Survey on Large Language Models for Mathematical Reasoning | Jun 10, 2025 | Answer GenerationMathematical Reasoning | —Unverified | 0 | 0 |
| A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers | May 21, 2023 | Mathematical Reasoning | —Unverified | 0 | 0 |
| A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks | May 16, 2024 | Code GenerationDialogue Generation | —Unverified | 0 | 0 |
| A Systematic Survey on Large Language Models for Algorithm Design | Oct 11, 2024 | Mathematical Reasoningscientific discovery | —Unverified | 0 | 0 |
| A Technical Study into Small Reasoning Language Models | Jun 16, 2025 | Code GenerationComputational Efficiency | —Unverified | 0 | 0 |
| Augmenting In-Context-Learning in LLMs via Automatic Data Labeling and Refinement | Oct 14, 2024 | In-Context LearningMathematical Reasoning | —Unverified | 0 | 0 |