| Think before you speak: Training Language Models With Pause Tokens | Oct 3, 2023 | DecoderGSM8K | —Unverified | 0 |
| Adapting LLM Agents with Universal Feedback in Communication | Oct 1, 2023 | Decision MakingGSM8K | —Unverified | 0 |
| UPAR: A Kantian-Inspired Prompting Framework for Enhancing Large Language Model Capabilities | Sep 30, 2023 | Causal JudgmentGSM8K | —Unverified | 0 |
| MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models | Sep 21, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| Design of Chain-of-Thought in Math Problem Solving | Sep 20, 2023 | DiversityGSM8K | CodeCode Available | 1 |
| Baichuan 2: Open Large-scale Language Models | Sep 19, 2023 | Feature EngineeringGSM8K | CodeCode Available | 4 |
| Contrastive Decoding Improves Reasoning in Large Language Models | Sep 17, 2023 | GSM8KHellaSwag | —Unverified | 0 |
| EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning | Sep 16, 2023 | Date UnderstandingGSM8K | CodeCode Available | 0 |
| Exploring an LM to generate Prolog Predicates from Mathematics Questions | Sep 7, 2023 | GSM8KLanguage Modeling | —Unverified | 0 |
| Large Language Models as Optimizers | Sep 7, 2023 | GSM8K | CodeCode Available | 1 |