| Overcoming Barriers to Skill Injection in Language Modeling: Case Study in Arithmetic | Nov 3, 2022 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 0 | 5 |
| OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency | May 14, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| DCR: Quantifying Data Contamination in LLMs Evaluation | Jul 15, 2025 | Arithmetic ReasoningBenchmarking | CodeCode Available | 0 | 5 |
| Least-to-Most Prompting Enables Complex Reasoning in Large Language Models | May 21, 2022 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability | Aug 28, 2024 | Arithmetic ReasoningGPU | CodeCode Available | 0 | 5 |
| Improving Arithmetic Reasoning Ability of Large Language Models through Relation Tuples, Verification and Dynamic Feedback | Jun 25, 2024 | Arithmetic ReasoningRelation | CodeCode Available | 0 | 5 |
| Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models | Jun 6, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning | Dec 9, 2023 | Arithmetic ReasoningMathematical Reasoning | CodeCode Available | 0 | 5 |
| Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Mar 12, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| LLM Augmented LLMs: Expanding Capabilities through Composition | Jan 4, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 0 | 5 |
| Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models | Oct 10, 2024 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training | Feb 25, 2025 | Arithmetic ReasoningData Augmentation | —Unverified | 0 | 0 |
| Leveraging LLM Reasoning Enhances Personalized Recommender Systems | Jul 22, 2024 | Arithmetic ReasoningRecommendation Systems | —Unverified | 0 | 0 |
| Arithmetic Reasoning with LLM: Prolog Generation & Permutation | May 28, 2024 | Arithmetic ReasoningData Augmentation | —Unverified | 0 | 0 |
| Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering | Feb 17, 2024 | Arithmetic ReasoningMathematical Reasoning | —Unverified | 0 | 0 |
| Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning | Dec 14, 2023 | Arithmetic ReasoningFew-Shot Learning | —Unverified | 0 | 0 |
| Can LLMs Maintain Fundamental Abilities under KV Cache Compression? | Feb 4, 2025 | Arithmetic ReasoningCode Generation | —Unverified | 0 | 0 |
| CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization | Jan 30, 2025 | Arithmetic ReasoningText Generation | —Unverified | 0 | 0 |
| Code Prompting: a Neural Symbolic Method for Complex Reasoning in Large Language Models | May 29, 2023 | Arithmetic Reasoning | —Unverified | 0 | 0 |
| Composing Ensembles of Pre-trained Models via Iterative Consensus | Oct 20, 2022 | Arithmetic ReasoningImage Generation | —Unverified | 0 | 0 |
| DiversiGATE: A Comprehensive Framework for Reliable Large Language Models | Jun 22, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models | Dec 30, 2024 | Arithmetic ReasoningQuantization | —Unverified | 0 | 0 |
| Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment | May 6, 2024 | Arithmetic ReasoningCode Generation | —Unverified | 0 | 0 |
| Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting | Jan 28, 2024 | Arithmetic ReasoningFact Checking | —Unverified | 0 | 0 |