A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving Jan 7, 2025 Diversity Knowledge Distillation
Code Code Available 0Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval Nov 25, 2024 Math Math Word Problem Solving
— Unverified 0SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation Oct 17, 2024 GSM8K Language Modeling
Code Code Available 0When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems Oct 16, 2024 Hallucination Math
— Unverified 0Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models Oct 10, 2024 Arithmetic Reasoning Math
Code Code Available 0OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data Oct 2, 2024 Arithmetic Reasoning Large Language Model
Code Code Available 4Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement Sep 18, 2024 GSM8K Math
— Unverified 0Qwen2 Technical Report Jul 15, 2024 Arithmetic Reasoning GSM8K
Code Code Available 13Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs Jun 26, 2024 Arithmetic Reasoning GSM8K
Code Code Available 3DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving Jun 18, 2024 Arithmetic Reasoning Math
Code Code Available 2Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment Jun 17, 2024 Logical Reasoning Math
— Unverified 0AlphaMath Almost Zero: Process Supervision without Process May 6, 2024 Mathematical Reasoning Math Word Problem Solving
Code Code Available 3Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems Apr 23, 2024 Arithmetic Reasoning GSM8K
Code Code Available 1Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Apr 18, 2024 Arithmetic Reasoning GSM8K
Code Code Available 1MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems Apr 6, 2024 Logical Reasoning Math
Code Code Available 2Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving Apr 5, 2024 Data Augmentation In-Context Learning
— Unverified 0Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Mar 12, 2024 Arithmetic Reasoning Code Generation
Code Code Available 0Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Mar 8, 2024 1 Image, 2*2 Stitching Code Generation
Code Code Available 3Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning Mar 4, 2024 GSM8K Math
— Unverified 0GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers Feb 29, 2024 GSM8K Math
Code Code Available 2An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning Feb 23, 2024 Arithmetic Reasoning Automated Theorem Proving
Code Code Available 2OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Feb 15, 2024 Arithmetic Reasoning GSM8K
Code Code Available 4DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Feb 5, 2024 Arithmetic Reasoning Math
Code Code Available 9Augmenting Math Word Problems via Iterative Question Composing Jan 17, 2024 Math Mathematical Reasoning
Code Code Available 1Mixtral of Experts Jan 8, 2024 Code Generation Common Sense Reasoning
Code Code Available 4Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Jan 5, 2024 Arithmetic Reasoning Code Generation
Code Code Available 2Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations Dec 14, 2023 Arithmetic Reasoning GSM8K
Code Code Available 1Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning Dec 9, 2023 Arithmetic Reasoning Mathematical Reasoning
Code Code Available 0FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains Nov 16, 2023 Math Math Word Problem Solving
Code Code Available 1VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency Nov 13, 2023 Math Mathematical Reasoning
Code Code Available 0ATHENA: Mathematical Reasoning with Thought Expansion Nov 2, 2023 Math Mathematical Reasoning
Code Code Available 0An Expression Tree Decoding Strategy for Mathematical Equation Generation Oct 14, 2023 Math Mathematical Reasoning
Code Code Available 2Mistral 7B Oct 10, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning Oct 9, 2023 Arithmetic Reasoning Data Augmentation
Code Code Available 2MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning Oct 5, 2023 Arithmetic Reasoning GSM8K
Code Code Available 2ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Sep 29, 2023 Arithmetic Reasoning Computational Efficiency
Code Code Available 3MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models Sep 21, 2023 Arithmetic Reasoning GSM8K
Code Code Available 2OpenChat: Advancing Open-source Language Models with Mixed-Quality Data Sep 20, 2023 Arithmetic Reasoning Code Generation
Code Code Available 0WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct Aug 18, 2023 Arithmetic Reasoning GSM8K
Code Code Available 5Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification Aug 15, 2023 Arithmetic Reasoning Math
Code Code Available 2Cumulative Reasoning with Large Language Models Aug 8, 2023 Decision Making Logical Reasoning
Code Code Available 2Do Multilingual Language Models Think Better in English? Aug 2, 2023 Common Sense Reasoning Cross-Lingual Natural Language Inference
Code Code Available 1Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models Aug 1, 2023 In-Context Learning Math
— Unverified 0Llama 2: Open Foundation and Fine-Tuned Chat Models Jul 18, 2023 Arithmetic Reasoning
Code Code Available 8CMATH: Can Your Language Model Pass Chinese Elementary School Math Test? Jun 29, 2023 Language Modeling Language Modelling
— Unverified 0Math Word Problem Solving by Generating Linguistic Variants of Problem Statements Jun 24, 2023 Decoder Ingenuity
Code Code Available 0Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems? Jun 3, 2023 Math Math Word Problem Solving
Code Code Available 0MathChat: Converse to Tackle Challenging Math Problems with LLM Agents Jun 2, 2023 Elementary Mathematics Math
Code Code Available 1Learning Multi-Step Reasoning by Solving Arithmetic Tasks Jun 2, 2023 Math Mathematical Reasoning
Code Code Available 1Let's Verify Step by Step May 31, 2023 Active Learning Math
Code Code Available 4