| Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models | Jun 6, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 0 |
| DiaBlo: Diagonal Blocks Are Sufficient For Finetuning | Jun 3, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 |
| PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 0 |
| DCR: Quantifying Data Contamination in LLMs Evaluation | Jul 15, 2025 | Arithmetic ReasoningBenchmarking | CodeCode Available | 0 |
| Overcoming Barriers to Skill Injection in Language Modeling: Case Study in Arithmetic | Nov 3, 2022 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 0 |
| SBoRA: Low-Rank Adaptation with Regional Weight Updates | Jul 7, 2024 | Arithmetic Reasoningparameter-efficient fine-tuning | CodeCode Available | 0 |
| OpenChat: Advancing Open-source Language Models with Mixed-Quality Data | Sep 20, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 |
| OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 |
| Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems | May 24, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 0 |
| Self-training Language Models for Arithmetic Reasoning | Jul 11, 2024 | Arithmetic Reasoning | CodeCode Available | 0 |
| 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability | Aug 28, 2024 | Arithmetic ReasoningGPU | CodeCode Available | 0 |
| Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning | Nov 4, 2024 | Arithmetic ReasoningDecoder | CodeCode Available | 0 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 |
| ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions | Dec 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models | Jun 6, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 |
| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Mar 12, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 |
| LLM Augmented LLMs: Expanding Capabilities through Composition | Jan 4, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 |
| Least-to-Most Prompting Enables Complex Reasoning in Large Language Models | May 21, 2022 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency | May 14, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| Improving Arithmetic Reasoning Ability of Large Language Models through Relation Tuples, Verification and Dynamic Feedback | Jun 25, 2024 | Arithmetic ReasoningRelation | CodeCode Available | 0 |
| Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning | Dec 9, 2023 | Arithmetic ReasoningMathematical Reasoning | CodeCode Available | 0 |
| DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification | Jul 8, 2025 | ARCArithmetic Reasoning | CodeCode Available | 0 |
| Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models | Oct 10, 2024 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| Do Deep Neural Networks Capture Compositionality in Arithmetic Reasoning? | Feb 15, 2023 | Arithmetic Reasoning | CodeCode Available | 0 |