Qwen2 Technical Report Jul 15, 2024 Arithmetic Reasoning GSM8K
Code Code Available 13DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Feb 5, 2024 Arithmetic Reasoning Math
Code Code Available 9Llama 2: Open Foundation and Fine-Tuned Chat Models Jul 18, 2023 Arithmetic Reasoning
Code Code Available 8LLaMA: Open and Efficient Foundation Language Models Feb 27, 2023 Arithmetic Reasoning Code Generation
Code Code Available 7Sparks of Artificial General Intelligence: Early experiments with GPT-4 Mar 22, 2023 Arithmetic Reasoning Mathematical Reasoning
Code Code Available 6Mistral 7B Oct 10, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct Aug 18, 2023 Arithmetic Reasoning GSM8K
Code Code Available 5OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data Oct 2, 2024 Arithmetic Reasoning Large Language Model
Code Code Available 4Mixtral of Experts Jan 8, 2024 Code Generation Common Sense Reasoning
Code Code Available 4Let's Verify Step by Step May 31, 2023 Active Learning Math
Code Code Available 4Galactica: A Large Language Model for Science Nov 16, 2022 Anachronisms Bias Detection
Code Code Available 4OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Feb 15, 2024 Arithmetic Reasoning GSM8K
Code Code Available 4PAL: Program-aided Language Models Nov 18, 2022 Arithmetic Reasoning GSM8K
Code Code Available 3ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Sep 29, 2023 Arithmetic Reasoning Computational Efficiency
Code Code Available 3AlphaMath Almost Zero: Process Supervision without Process May 6, 2024 Mathematical Reasoning Math Word Problem Solving
Code Code Available 3Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Mar 8, 2024 1 Image, 2*2 Stitching Code Generation
Code Code Available 3Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs Jun 26, 2024 Arithmetic Reasoning GSM8K
Code Code Available 3MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning Oct 5, 2023 Arithmetic Reasoning GSM8K
Code Code Available 2Progressive-Hint Prompting Improves Reasoning in Large Language Models Apr 19, 2023 Arithmetic Reasoning GSM8K
Code Code Available 2Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Jan 5, 2024 Arithmetic Reasoning Code Generation
Code Code Available 2DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving Jun 18, 2024 Arithmetic Reasoning Math
Code Code Available 2MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning Oct 9, 2023 Arithmetic Reasoning Data Augmentation
Code Code Available 2Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem Oct 21, 2022 Contrastive Learning Math
Code Code Available 2An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning Feb 23, 2024 Arithmetic Reasoning Automated Theorem Proving
Code Code Available 2DeBERTa: Decoding-enhanced BERT with Disentangled Attention Jun 5, 2020 Common Sense Reasoning Coreference Resolution
Code Code Available 2An Expression Tree Decoding Strategy for Mathematical Equation Generation Oct 14, 2023 Math Mathematical Reasoning
Code Code Available 2MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models Sep 21, 2023 Arithmetic Reasoning GSM8K
Code Code Available 2Cumulative Reasoning with Large Language Models Aug 8, 2023 Decision Making Logical Reasoning
Code Code Available 2GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers Feb 29, 2024 GSM8K Math
Code Code Available 2Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification Aug 15, 2023 Arithmetic Reasoning Math
Code Code Available 2MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems Apr 6, 2024 Logical Reasoning Math
Code Code Available 2Measuring Mathematical Problem Solving With the MATH Dataset Mar 5, 2021 Math Mathematical Problem-Solving
Code Code Available 2Solving Quantitative Reasoning Problems with Language Models Jun 29, 2022 Arithmetic Reasoning Language Modeling
Code Code Available 2Large Language Models are Zero-Shot Reasoners May 24, 2022 Arithmetic Reasoning Common Sense Reasoning
Code Code Available 2ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler Oct 18, 2022 Math Word Problem Solving Question Answering
Code Code Available 1Do Multilingual Language Models Think Better in English? Aug 2, 2023 Common Sense Reasoning Cross-Lingual Natural Language Inference
Code Code Available 1Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems Sep 24, 2020 Diversity Math
Code Code Available 1Recall and Learn: A Memory-augmented Solver for Math Word Problems Sep 27, 2021 Math Math Word Problem Solving
Code Code Available 1MathChat: Converse to Tackle Challenging Math Problems with LLM Agents Jun 2, 2023 Elementary Mathematics Math
Code Code Available 1RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning May 23, 2023 In-Context Learning Language Modelling
Code Code Available 1FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains Nov 16, 2023 Math Math Word Problem Solving
Code Code Available 1IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning Oct 25, 2021 Arithmetic Reasoning Mathematical Question Answering
Code Code Available 1Automatic Model Selection with Large Language Models for Reasoning May 23, 2023 Arithmetic Reasoning GSM8K
Code Code Available 1MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving Jul 28, 2021 Common Sense Reasoning Language Modeling
Code Code Available 1Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem Apr 7, 2020 Decoder Machine Translation
Code Code Available 1Graph-to-Tree Learning for Solving Math Word Problems Jul 1, 2020 Decoder Math
Code Code Available 1Automatic Generation of Socratic Subquestions for Teaching Math Word Problems Nov 23, 2022 Math Math Word Problem Solving
Code Code Available 1Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems Apr 23, 2024 Arithmetic Reasoning GSM8K
Code Code Available 1Augmenting Math Word Problems via Iterative Question Composing Jan 17, 2024 Math Mathematical Reasoning
Code Code Available 1Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations Dec 14, 2023 Arithmetic Reasoning GSM8K
Code Code Available 1