SOTAVerified

Math Word Problem Solving

A math word problem is a mathematical exercise (such as in a textbook, worksheet, or exam) where significant background information on the problem is presented in ordinary language rather than in mathematical notation. As most word problems involve a narrative of some sort, they are sometimes referred to as story problems and may vary in the amount of technical language used.

Papers

Showing 51100 of 107 papers

TitleStatusHype
Augmenting Math Word Problems via Iterative Question ComposingCode1
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human AnnotationsCode1
Math Word Problem Solving with Explicit Numerical ValuesCode1
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem SolvingCode1
Automatic Generation of Socratic Subquestions for Teaching Math Word ProblemsCode1
MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem SolversCode1
Automatic Model Selection with Large Language Models for ReasoningCode1
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced LearningCode0
Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer ModelCode0
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?Code0
Adversarial Examples for Evaluating Math Word Problem SolversCode0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLMCode0
Math Word Problem Solving by Generating Linguistic Variants of Problem StatementsCode0
OpenChat: Advancing Open-source Language Models with Mixed-Quality DataCode0
MAWPS: A Math Word Problem RepositoryCode0
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit ConsistencyCode0
Translating a Math Word Problem to an Expression TreeCode0
A Goal-Driven Tree-Structured Neural Model for Math Word ProblemsCode0
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language ModelsCode0
Improving Compositional Generalization in Math Word Problem SolvingCode0
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem SolvingCode0
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem SolvingCode0
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head AttentionsCode0
Semantically-Aligned Equation Generation for Solving and Reasoning Math Word ProblemsCode0
SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented GenerationCode0
Reverse Operation based Data Augmentation for Solving Math Word ProblemsCode0
ATHENA: Mathematical Reasoning with Thought ExpansionCode0
Analysing Mathematical Reasoning Abilities of Neural ModelsCode0
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic ReasoningCode0
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbersCode0
Learn to Solve Algebra Word Problems Using Quadratic Programming0
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach0
A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving0
An Improved Coarse-to-Fine Method for Solving Generation Tasks0
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?0
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving0
Deep Neural Solver for Math Word Problems0
Generate & Rank: A Multi-task Framework for Math Word Problems0
Generating Equation by Utilizing Operators : GEO model0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation0
Illinois Math Solver: Math Reasoning on the Web0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning0
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval0
Learning Fine-Grained Expressions to Solve Math Word Problems0
Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems0
Learning to Automatically Solve Algebra Word Problems0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation0
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms0
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving0
Neural Math Word Problem Solver with Reinforcement Learning0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Gemini 2.0 Flash ExperimentalAccuracy89.7Unverified
2Qwen2.5-Math-72B-Instruct(TIR,Greedy)Accuracy88.1Unverified
3GPT-4 Turbo (MACM, w/code, voting)Accuracy87.92Unverified
4Qwen2.5-Math-72B-Instruct(COT,Greedy)Accuracy85.9Unverified
5Qwen2.5-Math-7B-Instruct(TIR,Greedy)Accuracy85.2Unverified
6GPT-4-code model (CSV, w/ code, SC, k=16)Accuracy84.3Unverified
7Qwen2-Math-72B-Instruct(greedy)Accuracy84Unverified
8Qwen2.5-Math-7B-Instruct(COT,Greedy)Accuracy83.6Unverified
9Qwen2.5-Math-1.5B-Instruct(TIR,Greedy)Accuracy79.9Unverified
10OpenMath2-Llama3.1-70B (majority@256)Accuracy79.6Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4 DUPAccuracy94.2Unverified
2GPT-4 (Teaching-Inspired)Execution Accuracy93.9Unverified
3GPT-4 (Model Selection)Execution Accuracy93.7Unverified
4Qwen2(CoT + Code Interpreter)Execution Accuracy92.3Unverified
5GPT-4 (PHP)Execution Accuracy91.9Unverified
6OpenMath-CodeLlama-70B (w/ code)Execution Accuracy87.8Unverified
7MathCoder-L-70BExecution Accuracy84.9Unverified
8PoT_Eng (self-consistency @ 5)Execution Accuracy83.7Unverified
9CoT_Eng (self-consistency @ 5)Execution Accuracy82.5Unverified
10MMOS-CODE-34B(0-shot)Execution Accuracy80.6Unverified
#ModelMetricClaimedVerifiedStatus
1OpenMath-CodeLlama-70B (w/ code)Accuracy (%)95.7Unverified
2MsAT-DeductReasonerAccuracy (%)94.3Unverified
3ATHENA (roberta-large)Accuracy (%)93Unverified
4Exp-TreeAccuracy (%)92.3Unverified
5Multi-viewAccuracy (%)92.3Unverified
6ATHENA (roberta-base)Accuracy (%)92.2Unverified
7Roberta-DeductReasonerAccuracy (%)92Unverified
8DeBERTa (PM + VM)Accuracy (%)91Unverified
9EPTAccuracy (%)88.7Unverified
10Graph2Tree with RoBERTaAccuracy (%)88.7Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4 (Teaching-Inspired)Accuracy (5-fold)94.3Unverified
2ATHENA (roberta-large)Accuracy (training-test)86.5Unverified
3Multi-view* (ours)Accuracy (5-fold)85.2Unverified
4ATHENA (roberta-base)Accuracy (training-test)84.4Unverified
5Generate and RankAccuracy (5-fold)84.3Unverified
6Exp-TreeAccuracy (5-fold)84.1Unverified
7REAL2: Memory-augmented SolverAccuracy (5-fold)83.18Unverified
8Roberta-DeductReasonerAccuracy (5-fold)83Unverified
9MWP-BERTAccuracy (5-fold)82.4Unverified
10Recall and LearnAccuracy (5-fold)80.8Unverified