SOTAVerified

Math Word Problem Solving

A math word problem is a mathematical exercise (such as in a textbook, worksheet, or exam) where significant background information on the problem is presented in ordinary language rather than in mathematical notation. As most word problems involve a narrative of some sort, they are sometimes referred to as story problems and may vary in the amount of technical language used.

Papers

Showing 51100 of 107 papers

TitleStatusHype
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human AnnotationsCode1
Math Word Problem Solving with Explicit Numerical ValuesCode1
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem SolvingCode1
Automatic Model Selection with Large Language Models for ReasoningCode1
MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem SolversCode1
Semantically-Aligned Universal Tree-Structured Solver for Math Word ProblemsCode1
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word ProblemsCode1
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced LearningCode0
Reverse Operation based Data Augmentation for Solving Math Word ProblemsCode0
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?Code0
A Goal-Driven Tree-Structured Neural Model for Math Word ProblemsCode0
Adversarial Examples for Evaluating Math Word Problem SolversCode0
Math Word Problem Solving by Generating Linguistic Variants of Problem StatementsCode0
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language ModelsCode0
Improving Compositional Generalization in Math Word Problem SolvingCode0
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem SolvingCode0
MAWPS: A Math Word Problem RepositoryCode0
Analysing Mathematical Reasoning Abilities of Neural ModelsCode0
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem SolvingCode0
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit ConsistencyCode0
SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented GenerationCode0
Translating a Math Word Problem to an Expression TreeCode0
ATHENA: Mathematical Reasoning with Thought ExpansionCode0
Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer ModelCode0
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head AttentionsCode0
Semantically-Aligned Equation Generation for Solving and Reasoning Math Word ProblemsCode0
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic ReasoningCode0
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbersCode0
Learn to Solve Algebra Word Problems Using Quadratic Programming0
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach0
A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving0
An Improved Coarse-to-Fine Method for Solving Generation Tasks0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM0
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?0
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving0
Deep Neural Solver for Math Word Problems0
Generate & Rank: A Multi-task Framework for Math Word Problems0
Generating Equation by Utilizing Operators : GEO model0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation0
Illinois Math Solver: Math Reasoning on the Web0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning0
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval0
Learning Fine-Grained Expressions to Solve Math Word Problems0
Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems0
Learning to Automatically Solve Algebra Word Problems0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation0
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms0
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving0
Neural Math Word Problem Solver with Reinforcement Learning0
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Gemini 2.0 Flash ExperimentalAccuracy89.7Unverified
2Qwen2.5-Math-72B-Instruct(TIR,Greedy)Accuracy88.1Unverified
3GPT-4 Turbo (MACM, w/code, voting)Accuracy87.92Unverified
4Qwen2.5-Math-72B-Instruct(COT,Greedy)Accuracy85.9Unverified
5Qwen2.5-Math-7B-Instruct(TIR,Greedy)Accuracy85.2Unverified
6GPT-4-code model (CSV, w/ code, SC, k=16)Accuracy84.3Unverified
7Qwen2-Math-72B-Instruct(greedy)Accuracy84Unverified
8Qwen2.5-Math-7B-Instruct(COT,Greedy)Accuracy83.6Unverified
9Qwen2.5-Math-1.5B-Instruct(TIR,Greedy)Accuracy79.9Unverified
10OpenMath2-Llama3.1-70B (majority@256)Accuracy79.6Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4 DUPAccuracy94.2Unverified
2GPT-4 (Teaching-Inspired)Execution Accuracy93.9Unverified
3GPT-4 (Model Selection)Execution Accuracy93.7Unverified
4Qwen2(CoT + Code Interpreter)Execution Accuracy92.3Unverified
5GPT-4 (PHP)Execution Accuracy91.9Unverified
6OpenMath-CodeLlama-70B (w/ code)Execution Accuracy87.8Unverified
7MathCoder-L-70BExecution Accuracy84.9Unverified
8PoT_Eng (self-consistency @ 5)Execution Accuracy83.7Unverified
9CoT_Eng (self-consistency @ 5)Execution Accuracy82.5Unverified
10MMOS-CODE-34B(0-shot)Execution Accuracy80.6Unverified
#ModelMetricClaimedVerifiedStatus
1OpenMath-CodeLlama-70B (w/ code)Accuracy (%)95.7Unverified
2MsAT-DeductReasonerAccuracy (%)94.3Unverified
3ATHENA (roberta-large)Accuracy (%)93Unverified
4Exp-TreeAccuracy (%)92.3Unverified
5Multi-viewAccuracy (%)92.3Unverified
6ATHENA (roberta-base)Accuracy (%)92.2Unverified
7Roberta-DeductReasonerAccuracy (%)92Unverified
8DeBERTa (PM + VM)Accuracy (%)91Unverified
9EPTAccuracy (%)88.7Unverified
10Graph2Tree with RoBERTaAccuracy (%)88.7Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4 (Teaching-Inspired)Accuracy (5-fold)94.3Unverified
2ATHENA (roberta-large)Accuracy (training-test)86.5Unverified
3Multi-view* (ours)Accuracy (5-fold)85.2Unverified
4ATHENA (roberta-base)Accuracy (training-test)84.4Unverified
5Generate and RankAccuracy (5-fold)84.3Unverified
6Exp-TreeAccuracy (5-fold)84.1Unverified
7REAL2: Memory-augmented SolverAccuracy (5-fold)83.18Unverified
8Roberta-DeductReasonerAccuracy (5-fold)83Unverified
9MWP-BERTAccuracy (5-fold)82.4Unverified
10Recall and LearnAccuracy (5-fold)80.8Unverified