Math Word Problem Solving

A math word problem is a mathematical exercise (such as in a textbook, worksheet, or exam) where significant background information on the problem is presented in ordinary language rather than in mathematical notation. As most word problems involve a narrative of some sort, they are sometimes referred to as story problems and may vary in the amount of technical language used.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 107 papers

Title	Date	Tasks	Status	Hype	Score
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations	Dec 14, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	1	5
Math Word Problem Solving with Explicit Numerical Values	Aug 1, 2021	MathMath Word Problem Solving	CodeCode Available	1	5
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving	Jul 28, 2021	Common Sense ReasoningLanguage Modeling	CodeCode Available	1	5
Automatic Model Selection with Large Language Models for Reasoning	May 23, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	1	5
MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers	Sep 2, 2021	MathMath Word Problem Solving	CodeCode Available	1	5
Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems	Oct 14, 2020	DecoderMath	CodeCode Available	1	5
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems	Apr 23, 2024	Arithmetic ReasoningGSM8K	CodeCode Available	1	5
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning	May 17, 2022	MathMath Word Problem Solving	CodeCode Available	0	5
Reverse Operation based Data Augmentation for Solving Math Word Problems	Oct 4, 2020	Data AugmentationMath	CodeCode Available	0	5
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?	Jun 3, 2023	MathMath Word Problem Solving	CodeCode Available	0	5
A Goal-Driven Tree-Structured Neural Model for Math Word Problems	Aug 10, 2019	MathMath Word Problem Solving	CodeCode Available	0	5
Adversarial Examples for Evaluating Math Word Problem Solvers	Sep 13, 2021	Adversarial RobustnessMath	CodeCode Available	0	5
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements	Jun 24, 2023	DecoderIngenuity	CodeCode Available	0	5
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models	Oct 10, 2024	Arithmetic ReasoningMath	CodeCode Available	0	5
Improving Compositional Generalization in Math Word Problem Solving	Sep 3, 2022	Data AugmentationMath	CodeCode Available	0	5
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem Solving	Nov 1, 2021	DecoderMath	CodeCode Available	0	5
MAWPS: A Math Word Problem Repository	Jun 1, 2016	MathMath Word Problem Solving	CodeCode Available	0	5
Analysing Mathematical Reasoning Abilities of Neural Models	Apr 2, 2019	Mathematical Question AnsweringMathematical Reasoning	CodeCode Available	0	5
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Jan 7, 2025	DiversityKnowledge Distillation	CodeCode Available	0	5
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency	Nov 13, 2023	MathMathematical Reasoning	CodeCode Available	0	5
SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation	Oct 17, 2024	GSM8KLanguage Modeling	CodeCode Available	0	5
Translating a Math Word Problem to an Expression Tree	Nov 14, 2018	MathMath Word Problem Solving	CodeCode Available	0	5
ATHENA: Mathematical Reasoning with Thought Expansion	Nov 2, 2023	MathMathematical Reasoning	CodeCode Available	0	5
Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer Model	Nov 1, 2020	Math Word Problem Solving	CodeCode Available	0	5
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions	Jul 1, 2019	Deep LearningMath	CodeCode Available	0	5
Semantically-Aligned Equation Generation for Solving and Reasoning Math Word Problems	Nov 2, 2018	DecoderMath	CodeCode Available	0	5
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning	Dec 9, 2023	Arithmetic ReasoningMathematical Reasoning	CodeCode Available	0	5
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers	May 1, 2022	MathMath Word Problem Solving	CodeCode Available	0	5
Learn to Solve Algebra Word Problems Using Quadratic Programming	Sep 1, 2015	Math Word Problem Solving	—Unverified	0	0
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach	Sep 1, 2020	MathMath Word Problem Solving	—Unverified	0	0
A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving	Nov 1, 2020	Common Sense ReasoningDecoder	—Unverified	0	0
An Improved Coarse-to-Fine Method for Solving Generation Tasks	Apr 1, 2019	MathMath Word Problem Solving	—Unverified	0	0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified	0	0
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?	Jun 29, 2023	Language ModelingLanguage Modelling	—Unverified	0	0
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving	Apr 5, 2024	Data AugmentationIn-Context Learning	—Unverified	0	0
Deep Neural Solver for Math Word Problems	Sep 1, 2017	Feature EngineeringMachine Translation	—Unverified	0	0
Generate & Rank: A Multi-task Framework for Math Word Problems	Sep 7, 2021	Language ModelingLanguage Modelling	—Unverified	0	0
Generating Equation by Utilizing Operators : GEO model	Dec 1, 2020	DecoderMachine Translation	—Unverified	0	0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation	Aug 1, 2016	Community Question AnsweringMath	—Unverified	0	0
Illinois Math Solver: Math Reasoning on the Web	Jun 1, 2016	MathMath Word Problem Solving	—Unverified	0	0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning	Mar 4, 2024	GSM8KMath	—Unverified	0	0
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Nov 25, 2024	MathMath Word Problem Solving	—Unverified	0	0
Learning Fine-Grained Expressions to Solve Math Word Problems	Sep 1, 2017	MathMath Word Problem Solving	—Unverified	0	0
Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems	Nov 1, 2016	Math Word Problem Solving	—Unverified	0	0
Learning to Automatically Solve Algebra Word Problems	Jun 1, 2014	Math Word Problem Solving	—Unverified	0	0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation	May 22, 2023	Knowledge TracingMath	—Unverified	0	0
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms	May 30, 2019	MathMath Word Problem Solving	—Unverified	0	0
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified	0	0
Neural Math Word Problem Solver with Reinforcement Learning	Aug 1, 2018	Feature EngineeringMath	—Unverified	0	0
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data	Sep 20, 2023	Arithmetic ReasoningCode Generation	—Unverified	0	0

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets MATH SVAMP MAWPS Math23K ALG514 ASDiv-A ParaMAWPS DRAW-1K MathQA SVAMP (1:N)GSM-Plus MATH minival

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Gemini 2.0 Flash Experimental	Accuracy	89.7	—	Unverified
2	Qwen2.5-Math-72B-Instruct(TIR,Greedy)	Accuracy	88.1	—	Unverified
3	GPT-4 Turbo (MACM, w/code, voting)	Accuracy	87.92	—	Unverified
4	Qwen2.5-Math-72B-Instruct(COT,Greedy)	Accuracy	85.9	—	Unverified
5	Qwen2.5-Math-7B-Instruct(TIR,Greedy)	Accuracy	85.2	—	Unverified
6	GPT-4-code model (CSV, w/ code, SC, k=16)	Accuracy	84.3	—	Unverified
7	Qwen2-Math-72B-Instruct(greedy)	Accuracy	84	—	Unverified
8	Qwen2.5-Math-7B-Instruct(COT,Greedy)	Accuracy	83.6	—	Unverified
9	Qwen2.5-Math-1.5B-Instruct(TIR,Greedy)	Accuracy	79.9	—	Unverified
10	OpenMath2-Llama3.1-70B (majority@256)	Accuracy	79.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 DUP	Accuracy	94.2	—	Unverified
2	GPT-4 (Teaching-Inspired)	Execution Accuracy	93.9	—	Unverified
3	GPT-4 (Model Selection)	Execution Accuracy	93.7	—	Unverified
4	Qwen2(CoT + Code Interpreter)	Execution Accuracy	92.3	—	Unverified
5	GPT-4 (PHP)	Execution Accuracy	91.9	—	Unverified
6	OpenMath-CodeLlama-70B (w/ code)	Execution Accuracy	87.8	—	Unverified
7	MathCoder-L-70B	Execution Accuracy	84.9	—	Unverified
8	PoT_Eng (self-consistency @ 5)	Execution Accuracy	83.7	—	Unverified
9	CoT_Eng (self-consistency @ 5)	Execution Accuracy	82.5	—	Unverified
10	MMOS-CODE-34B(0-shot)	Execution Accuracy	80.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OpenMath-CodeLlama-70B (w/ code)	Accuracy (%)	95.7	—	Unverified
2	MsAT-DeductReasoner	Accuracy (%)	94.3	—	Unverified
3	ATHENA (roberta-large)	Accuracy (%)	93	—	Unverified
4	Exp-Tree	Accuracy (%)	92.3	—	Unverified
5	Multi-view	Accuracy (%)	92.3	—	Unverified
6	ATHENA (roberta-base)	Accuracy (%)	92.2	—	Unverified
7	Roberta-DeductReasoner	Accuracy (%)	92	—	Unverified
8	DeBERTa (PM + VM)	Accuracy (%)	91	—	Unverified
9	EPT	Accuracy (%)	88.7	—	Unverified
10	Graph2Tree with RoBERTa	Accuracy (%)	88.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 (Teaching-Inspired)	Accuracy (5-fold)	94.3	—	Unverified
2	ATHENA (roberta-large)	Accuracy (training-test)	86.5	—	Unverified
3	Multi-view* (ours)	Accuracy (5-fold)	85.2	—	Unverified
4	ATHENA (roberta-base)	Accuracy (training-test)	84.4	—	Unverified
5	Generate and Rank	Accuracy (5-fold)	84.3	—	Unverified
6	Exp-Tree	Accuracy (5-fold)	84.1	—	Unverified
7	REAL2: Memory-augmented Solver	Accuracy (5-fold)	83.18	—	Unverified
8	Roberta-DeductReasoner	Accuracy (5-fold)	83	—	Unverified
9	MWP-BERT	Accuracy (5-fold)	82.4	—	Unverified
10	Recall and Learn	Accuracy (5-fold)	80.8	—	Unverified