| Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions | May 28, 2022 | Arithmetic ReasoningEfficient Exploration | CodeCode Available | 1 | 5 |
| Learning to Reason for Text Generation from Scientific Tables | Apr 16, 2021 | Arithmetic ReasoningArticles | CodeCode Available | 1 | 5 |
| LEVER: Learning to Verify Language-to-Code Generation with Execution | Feb 16, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| MathPrompter: Mathematical Reasoning using Large Language Models | Mar 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Dec 14, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| MoT: Memory-of-Thought Enables ChatGPT to Self-Improve | May 9, 2023 | Arithmetic ReasoningNatural Language Inference | CodeCode Available | 1 | 5 |
| Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks | Apr 4, 2023 | Arithmetic ReasoningLanguage Modelling | CodeCode Available | 1 | 5 |
| Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs | Nov 16, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL | Sep 13, 2023 | Arithmetic ReasoningNavigate | CodeCode Available | 1 | 5 |
| OpenCQA: Open-ended Question Answering with Charts | Oct 12, 2022 | Arithmetic ReasoningDescriptive | CodeCode Available | 1 | 5 |
| DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models | Oct 8, 2023 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning | Nov 16, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| OpenChat: Advancing Open-source Language Models with Mixed-Quality Data | Sep 20, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 0 | 5 |
| Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning | Nov 4, 2024 | Arithmetic ReasoningDecoder | CodeCode Available | 0 | 5 |
| Self-training Language Models for Arithmetic Reasoning | Jul 11, 2024 | Arithmetic Reasoning | CodeCode Available | 0 | 5 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 | 5 |
| Do Deep Neural Networks Capture Compositionality in Arithmetic Reasoning? | Feb 15, 2023 | Arithmetic Reasoning | CodeCode Available | 0 | 5 |
| Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models | Jun 6, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 0 | 5 |
| Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems | May 24, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 0 | 5 |
| DiaBlo: Diagonal Blocks Are Sufficient For Finetuning | Jun 3, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification | Jul 8, 2025 | ARCArithmetic Reasoning | CodeCode Available | 0 | 5 |
| ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions | Dec 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| SBoRA: Low-Rank Adaptation with Regional Weight Updates | Jul 7, 2024 | Arithmetic Reasoningparameter-efficient fine-tuning | CodeCode Available | 0 | 5 |