| FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains | Nov 16, 2023 | MathMath Word Problem Solving | CodeCode Available | 1 |
| StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving | Nov 15, 2023 | Math | CodeCode Available | 1 |
| Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration | Nov 14, 2023 | DiversityMath | CodeCode Available | 1 |
| First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning | Nov 14, 2023 | GSM8KMath | —Unverified | 0 |
| SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks | Nov 14, 2023 | GSM8KMath | —Unverified | 0 |
| VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency | Nov 13, 2023 | MathMathematical Reasoning | CodeCode Available | 0 |
| Large Language Models' Understanding of Math: Source Criticism and Extrapolation | Nov 12, 2023 | Automated Theorem ProvingMath | —Unverified | 0 |
| Let's Reinforce Step by Step | Nov 10, 2023 | GSM8KLogical Reasoning | —Unverified | 0 |
| Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset | Nov 9, 2023 | MathNatural Language Understanding | CodeCode Available | 1 |
| Agent Lumos: Unified and Modular Training for Open-Source Language Agents | Nov 9, 2023 | MathQuestion Answering | CodeCode Available | 2 |