Qwen2 Technical Report Jul 15, 2024 Arithmetic Reasoning GSM8K
Code Code Available 135 DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Feb 5, 2024 Arithmetic Reasoning Math
Code Code Available 95 Llama 2: Open Foundation and Fine-Tuned Chat Models Jul 18, 2023 Arithmetic Reasoning
Code Code Available 85 LLaMA: Open and Efficient Foundation Language Models Feb 27, 2023 Arithmetic Reasoning Code Generation
Code Code Available 75 Mistral 7B Oct 10, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 65 Sparks of Artificial General Intelligence: Early experiments with GPT-4 Mar 22, 2023 Arithmetic Reasoning Mathematical Reasoning
Code Code Available 65 WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct Aug 18, 2023 Arithmetic Reasoning GSM8K
Code Code Available 55 Let's Verify Step by Step May 31, 2023 Active Learning Math
Code Code Available 45 OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Feb 15, 2024 Arithmetic Reasoning GSM8K
Code Code Available 45 Galactica: A Large Language Model for Science Nov 16, 2022 Anachronisms Bias Detection
Code Code Available 45 OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data Oct 2, 2024 Arithmetic Reasoning Large Language Model
Code Code Available 45 Mixtral of Experts Jan 8, 2024 Code Generation Common Sense Reasoning
Code Code Available 45 ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Sep 29, 2023 Arithmetic Reasoning Computational Efficiency
Code Code Available 35 PAL: Program-aided Language Models Nov 18, 2022 Arithmetic Reasoning GSM8K
Code Code Available 35 Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs Jun 26, 2024 Arithmetic Reasoning GSM8K
Code Code Available 35 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Mar 8, 2024 1 Image, 2*2 Stitching Code Generation
Code Code Available 35 AlphaMath Almost Zero: Process Supervision without Process May 6, 2024 Mathematical Reasoning Math Word Problem Solving
Code Code Available 35 Measuring Mathematical Problem Solving With the MATH Dataset Mar 5, 2021 Math Mathematical Problem-Solving
Code Code Available 25 Solving Quantitative Reasoning Problems with Language Models Jun 29, 2022 Arithmetic Reasoning Language Modeling
Code Code Available 25 MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models Sep 21, 2023 Arithmetic Reasoning GSM8K
Code Code Available 25 MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning Oct 9, 2023 Arithmetic Reasoning Data Augmentation
Code Code Available 25 Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Jan 5, 2024 Arithmetic Reasoning Code Generation
Code Code Available 25 Cumulative Reasoning with Large Language Models Aug 8, 2023 Decision Making Logical Reasoning
Code Code Available 25 An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning Feb 23, 2024 Arithmetic Reasoning Automated Theorem Proving
Code Code Available 25 DeBERTa: Decoding-enhanced BERT with Disentangled Attention Jun 5, 2020 Common Sense Reasoning Coreference Resolution
Code Code Available 25 An Expression Tree Decoding Strategy for Mathematical Equation Generation Oct 14, 2023 Math Mathematical Reasoning
Code Code Available 25 DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving Jun 18, 2024 Arithmetic Reasoning Math
Code Code Available 25 GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers Feb 29, 2024 GSM8K Math
Code Code Available 25 Large Language Models are Zero-Shot Reasoners May 24, 2022 Arithmetic Reasoning Common Sense Reasoning
Code Code Available 25 Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem Oct 21, 2022 Contrastive Learning Math
Code Code Available 25 MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning Oct 5, 2023 Arithmetic Reasoning GSM8K
Code Code Available 25 Progressive-Hint Prompting Improves Reasoning in Large Language Models Apr 19, 2023 Arithmetic Reasoning GSM8K
Code Code Available 25 Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification Aug 15, 2023 Arithmetic Reasoning Math
Code Code Available 25 MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems Apr 6, 2024 Logical Reasoning Math
Code Code Available 25 ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler Oct 18, 2022 Math Word Problem Solving Question Answering
Code Code Available 15 Do Multilingual Language Models Think Better in English? Aug 2, 2023 Common Sense Reasoning Cross-Lingual Natural Language Inference
Code Code Available 15 Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems Sep 24, 2020 Diversity Math
Code Code Available 15 Recall and Learn: A Memory-augmented Solver for Math Word Problems Sep 27, 2021 Math Math Word Problem Solving
Code Code Available 15 MathChat: Converse to Tackle Challenging Math Problems with LLM Agents Jun 2, 2023 Elementary Mathematics Math
Code Code Available 15 Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations Dec 14, 2023 Arithmetic Reasoning GSM8K
Code Code Available 15 Math Word Problem Solving with Explicit Numerical Values Aug 1, 2021 Math Math Word Problem Solving
Code Code Available 15 RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning May 23, 2023 In-Context Learning Language Modelling
Code Code Available 15 FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains Nov 16, 2023 Math Math Word Problem Solving
Code Code Available 15 IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning Oct 25, 2021 Arithmetic Reasoning Mathematical Question Answering
Code Code Available 15 MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers Sep 2, 2021 Math Math Word Problem Solving
Code Code Available 15 Automatic Model Selection with Large Language Models for Reasoning May 23, 2023 Arithmetic Reasoning GSM8K
Code Code Available 15 Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem Apr 7, 2020 Decoder Machine Translation
Code Code Available 15 Graph-to-Tree Learning for Solving Math Word Problems Jul 1, 2020 Decoder Math
Code Code Available 15 Automatic Generation of Socratic Subquestions for Teaching Math Word Problems Nov 23, 2022 Math Math Word Problem Solving
Code Code Available 15 Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems Apr 23, 2024 Arithmetic Reasoning GSM8K
Code Code Available 15