| Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations | Oct 31, 2023 | GSM8KMath | CodeCode Available | 1 |
| Learning From Mistakes Makes LLM Better Reasoner | Oct 31, 2023 | GSM8KMath | CodeCode Available | 1 |
| An Early Evaluation of GPT-4V(ision) | Oct 25, 2023 | Math | CodeCode Available | 1 |
| Expression Syntax Information Bottleneck for Math Word Problems | Oct 24, 2023 | Math | CodeCode Available | 1 |
| Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts | Oct 23, 2023 | Logical ReasoningMath | CodeCode Available | 1 |
| Teaching Language Models to Self-Improve through Interactive Demonstrations | Oct 20, 2023 | Math | CodeCode Available | 1 |
| Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes | Oct 16, 2023 | Decision MakingMath | CodeCode Available | 1 |
| Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding | Oct 10, 2023 | Mathvalid | CodeCode Available | 1 |
| Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference | Oct 4, 2023 | MathQuestion Answering | CodeCode Available | 1 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |