| An Early Evaluation of GPT-4V(ision) | Oct 25, 2023 | Math | CodeCode Available | 1 |
| Expression Syntax Information Bottleneck for Math Word Problems | Oct 24, 2023 | Math | CodeCode Available | 1 |
| Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts | Oct 23, 2023 | Logical ReasoningMath | CodeCode Available | 1 |
| We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields | Oct 23, 2023 | DiversityMath | CodeCode Available | 0 |
| Teaching Language Models to Self-Improve through Interactive Demonstrations | Oct 20, 2023 | Math | CodeCode Available | 1 |
| SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving | Oct 19, 2023 | GSM8KMath | CodeCode Available | 0 |
| Llemma: An Open Language Model For Mathematics | Oct 16, 2023 | Arithmetic ReasoningAutomated Theorem Proving | CodeCode Available | 3 |
| Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes | Oct 16, 2023 | Decision MakingMath | CodeCode Available | 1 |
| Let's reward step by step: Step-Level reward model as the Navigators for Reasoning | Oct 16, 2023 | Code GenerationGSM8K | —Unverified | 0 |
| Improving Large Language Model Fine-tuning for Solving Math Problems | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Solving Math Word Problems with Reexamination | Oct 14, 2023 | DescriptiveMath | CodeCode Available | 0 |
| An Expression Tree Decoding Strategy for Mathematical Equation Generation | Oct 14, 2023 | MathMathematical Reasoning | CodeCode Available | 2 |
| The Search-and-Mix Paradigm in Approximate Nash Equilibrium Algorithms | Oct 12, 2023 | Math | —Unverified | 0 |
| LLMs as Potential Brainstorming Partners for Math and Science Problems | Oct 10, 2023 | Math | —Unverified | 0 |
| Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding | Oct 10, 2023 | Mathvalid | CodeCode Available | 1 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning | Oct 9, 2023 | Arithmetic ReasoningData Augmentation | CodeCode Available | 2 |
| How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition | Oct 9, 2023 | Code GenerationInstruction Following | CodeCode Available | 3 |
| Guiding Language Model Reasoning with Planning Tokens | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models | Oct 7, 2023 | Math | —Unverified | 0 |
| Critique Ability of Large Language Models | Oct 7, 2023 | Code CompletionDecision Making | —Unverified | 0 |
| Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models | Oct 6, 2023 | 8kMath | —Unverified | 0 |
| Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models | Oct 6, 2023 | Code GenerationDecision Making | CodeCode Available | 2 |
| DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines | Oct 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning | Oct 5, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |