| Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning | Feb 21, 2025 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| Solving Math Word Problems via Cooperative Reasoning induced Language Models | Oct 28, 2022 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions | Jan 17, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| Bridging the Gap between Different Vocabularies for LLM Ensemble | Apr 15, 2024 | Arithmetic ReasoningData-to-Text Generation | CodeCode Available | 1 | 5 |
| An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs | Jun 18, 2024 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| Gemini: A Family of Highly Capable Multimodal Models | Dec 19, 2023 | 1 Image, 2*2 StitchingArithmetic Reasoning | CodeCode Available | 1 | 5 |
| Generative Parameter-Efficient Fine-Tuning | Dec 1, 2023 | Arithmetic ReasoningFine-Grained Image Classification | CodeCode Available | 1 | 5 |
| Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs | Jun 22, 2023 | Arithmetic ReasoningBenchmarking | CodeCode Available | 1 | 5 |
| HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles | Jun 18, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| Token-Scaled Logit Distillation for Ternary Weight Generative Language Models | Aug 13, 2023 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 1 | 5 |
| Toward Adaptive Reasoning in Large Language Models with Thought Rollback | Jul 21, 2024 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing | Apr 18, 2024 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Are Human-generated Demonstrations Necessary for In-context Learning? | Sep 26, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning | Oct 25, 2021 | Arithmetic ReasoningMathematical Question Answering | CodeCode Available | 1 | 5 |
| Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data | Dec 20, 2023 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis | May 24, 2023 | Arithmetic ReasoningMathematical Reasoning | CodeCode Available | 1 | 5 |
| UL2: Unifying Language Learning Paradigms | May 10, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 1 | 5 |
| Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning | May 10, 2021 | Arithmetic ReasoningGeometry Problem Solving | CodeCode Available | 1 | 5 |
| Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure | Apr 2, 2025 | Arithmetic ReasoningData Augmentation | CodeCode Available | 1 | 5 |
| Language Imbalance Driven Rewarding for Multilingual Self-improving | Oct 11, 2024 | Arithmetic ReasoningInstruction Following | CodeCode Available | 1 | 5 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics | Oct 28, 2024 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions | May 28, 2022 | Arithmetic ReasoningEfficient Exploration | CodeCode Available | 1 | 5 |
| Learning to Reason for Text Generation from Scientific Tables | Apr 16, 2021 | Arithmetic ReasoningArticles | CodeCode Available | 1 | 5 |
| LEVER: Learning to Verify Language-to-Code Generation with Execution | Feb 16, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| MathPrompter: Mathematical Reasoning using Large Language Models | Mar 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Dec 14, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| MoT: Memory-of-Thought Enables ChatGPT to Self-Improve | May 9, 2023 | Arithmetic ReasoningNatural Language Inference | CodeCode Available | 1 | 5 |
| Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks | Apr 4, 2023 | Arithmetic ReasoningLanguage Modelling | CodeCode Available | 1 | 5 |
| Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs | Nov 16, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting | May 11, 2023 | AllArithmetic Reasoning | CodeCode Available | 1 | 5 |
| Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL | Sep 13, 2023 | Arithmetic ReasoningNavigate | CodeCode Available | 1 | 5 |
| DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models | Oct 8, 2023 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation | Feb 21, 2024 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| OpenCQA: Open-ended Question Answering with Charts | Oct 12, 2022 | Arithmetic ReasoningDescriptive | CodeCode Available | 1 | 5 |
| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 0 | 5 |
| DiaBlo: Diagonal Blocks Are Sufficient For Finetuning | Jun 3, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| SBoRA: Low-Rank Adaptation with Regional Weight Updates | Jul 7, 2024 | Arithmetic Reasoningparameter-efficient fine-tuning | CodeCode Available | 0 | 5 |
| Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models | Jun 6, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 0 | 5 |
| Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems | May 24, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 0 | 5 |
| DCR: Quantifying Data Contamination in LLMs Evaluation | Jul 15, 2025 | Arithmetic ReasoningBenchmarking | CodeCode Available | 0 | 5 |
| Overcoming Barriers to Skill Injection in Language Modeling: Case Study in Arithmetic | Nov 3, 2022 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 0 | 5 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 | 5 |
| 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability | Aug 28, 2024 | Arithmetic ReasoningGPU | CodeCode Available | 0 | 5 |
| OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency | May 14, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification | Jul 8, 2025 | ARCArithmetic Reasoning | CodeCode Available | 0 | 5 |