| Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data | Feb 24, 2023 | Arithmetic ReasoningLanguage Modelling | CodeCode Available | 1 | 5 |
| Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL | Sep 13, 2023 | Arithmetic ReasoningNavigate | CodeCode Available | 1 | 5 |
| FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models | Oct 12, 2024 | Arithmetic ReasoningFederated Learning | CodeCode Available | 1 | 5 |
| Generative Parameter-Efficient Fine-Tuning | Dec 1, 2023 | Arithmetic ReasoningFine-Grained Image Classification | CodeCode Available | 1 | 5 |
| Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Dec 14, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems | Apr 23, 2024 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles | Jun 18, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| MathPrompter: Mathematical Reasoning using Large Language Models | Mar 4, 2023 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs | Jun 22, 2023 | Arithmetic ReasoningBenchmarking | CodeCode Available | 1 | 5 |
| Boosting Language Models Reasoning with Chain-of-Knowledge Prompting | Jun 10, 2023 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| Empirical Study of Zero-Shot NER with ChatGPT | Oct 16, 2023 | Arithmetic Reasoningnamed-entity-recognition | CodeCode Available | 1 | 5 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics | Oct 28, 2024 | Arithmetic ReasoningMath | CodeCode Available | 1 | 5 |
| Bridging the Gap between Different Vocabularies for LLM Ensemble | Apr 15, 2024 | Arithmetic ReasoningData-to-Text Generation | CodeCode Available | 1 | 5 |
| DOMINO: A Dual-System for Multi-step Visual Language Reasoning | Oct 4, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| Prompt Optimization via Adversarial In-Context Learning | Dec 5, 2023 | Arithmetic ReasoningData-to-Text Generation | CodeCode Available | 1 | 5 |
| Large Language Models are Better Reasoners with Self-Verification | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 1 | 5 |
| Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation | Feb 21, 2024 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| MoT: Memory-of-Thought Enables ChatGPT to Self-Improve | May 9, 2023 | Arithmetic ReasoningNatural Language Inference | CodeCode Available | 1 | 5 |
| Language Imbalance Driven Rewarding for Multilingual Self-improving | Oct 11, 2024 | Arithmetic ReasoningInstruction Following | CodeCode Available | 1 | 5 |
| DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models | Oct 8, 2023 | Arithmetic Reasoning | CodeCode Available | 1 | 5 |
| Learning to Reason for Text Generation from Scientific Tables | Apr 16, 2021 | Arithmetic ReasoningArticles | CodeCode Available | 1 | 5 |
| Are Human-generated Demonstrations Necessary for In-context Learning? | Sep 26, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |