Math Word Problem Solving

A math word problem is a mathematical exercise (such as in a textbook, worksheet, or exam) where significant background information on the problem is presented in ordinary language rather than in mathematical notation. As most word problems involve a narrative of some sort, they are sometimes referred to as story problems and may vary in the amount of technical language used.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 107 papers

Title	Date	Tasks	Status	Hype
RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning	May 23, 2023	In-Context LearningLanguage Modelling	CodeCode Available	1
Automatic Model Selection with Large Language Models for Reasoning	May 23, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	1
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation	May 22, 2023	Knowledge TracingMath	—Unverified	0
Progressive-Hint Prompting Improves Reasoning in Large Language Models	Apr 19, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	2
Sparks of Artificial General Intelligence: Early experiments with GPT-4	Mar 22, 2023	Arithmetic ReasoningMathematical Reasoning	CodeCode Available	6
LLaMA: Open and Efficient Foundation Language Models	Feb 27, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	7
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems	Nov 23, 2022	MathMath Word Problem Solving	CodeCode Available	1
PAL: Program-aided Language Models	Nov 18, 2022	Arithmetic ReasoningGSM8K	CodeCode Available	3
Galactica: A Large Language Model for Science	Nov 16, 2022	AnachronismsBias Detection	CodeCode Available	4
Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem	Oct 21, 2022	Contrastive LearningMath	CodeCode Available	2
ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler	Oct 18, 2022	Math Word Problem SolvingQuestion Answering	CodeCode Available	1
Improving Compositional Generalization in Math Word Problem Solving	Sep 3, 2022	Data AugmentationMath	CodeCode Available	0
Solving Quantitative Reasoning Problems with Language Models	Jun 29, 2022	Arithmetic ReasoningLanguage Modeling	CodeCode Available	2
Large Language Models are Zero-Shot Reasoners	May 24, 2022	Arithmetic ReasoningCommon Sense Reasoning	CodeCode Available	2
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning	May 17, 2022	MathMath Word Problem Solving	CodeCode Available	0
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers	May 1, 2022	MathMath Word Problem Solving	CodeCode Available	0
Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction	Mar 19, 2022	MathMath Word Problem Solving	CodeCode Available	1
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified	0
Towards Interpretable Math Word Problem Solving with Grounded Linguistic Logic Reasoning	Nov 16, 2021	MathMath Word Problem Solving	—Unverified	0
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem Solving	Nov 1, 2021	DecoderMath	CodeCode Available	0
IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning	Oct 25, 2021	Arithmetic ReasoningMathematical Question Answering	CodeCode Available	1
Recall and Learn: A Memory-augmented Solver for Math Word Problems	Sep 27, 2021	MathMath Word Problem Solving	CodeCode Available	1
Adversarial Examples for Evaluating Math Word Problem Solvers	Sep 13, 2021	Adversarial RobustnessMath	CodeCode Available	0
Generate & Rank: A Multi-task Framework for Math Word Problems	Sep 7, 2021	Language ModelingLanguage Modelling	—Unverified	0
MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers	Sep 2, 2021	MathMath Word Problem Solving	CodeCode Available	1
Math Word Problem Solving with Explicit Numerical Values	Aug 1, 2021	MathMath Word Problem Solving	CodeCode Available	1
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving	Jul 28, 2021	Common Sense ReasoningLanguage Modeling	CodeCode Available	1
Are NLP Models really able to Solve Simple Math Word Problems?	Mar 12, 2021	MathMath Word Problem Solving	CodeCode Available	1
Measuring Mathematical Problem Solving With the MATH Dataset	Mar 5, 2021	MathMathematical Problem-Solving	CodeCode Available	2
Generating Equation by Utilizing Operators : GEO model	Dec 1, 2020	DecoderMachine Translation	—Unverified	0
A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving	Nov 1, 2020	Common Sense ReasoningDecoder	—Unverified	0
Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer Model	Nov 1, 2020	Math Word Problem Solving	CodeCode Available	0
Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems	Oct 14, 2020	DecoderMath	CodeCode Available	1
Reverse Operation based Data Augmentation for Solving Math Word Problems	Oct 4, 2020	Data AugmentationMath	CodeCode Available	0
Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems	Sep 24, 2020	DiversityMath	CodeCode Available	1
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach	Sep 1, 2020	MathMath Word Problem Solving	—Unverified	0
Graph-to-Tree Learning for Solving Math Word Problems	Jul 1, 2020	DecoderMath	CodeCode Available	1
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	Jun 5, 2020	Common Sense ReasoningCoreference Resolution	CodeCode Available	2
Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem	Apr 7, 2020	DecoderMachine Translation	CodeCode Available	1
A Goal-Driven Tree-Structured Neural Model for Math Word Problems	Aug 10, 2019	MathMath Word Problem Solving	CodeCode Available	0
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions	Jul 1, 2019	Deep LearningMath	CodeCode Available	0
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms	May 30, 2019	MathMath Word Problem Solving	—Unverified	0
Analysing Mathematical Reasoning Abilities of Neural Models	Apr 2, 2019	Mathematical Question AnsweringMathematical Reasoning	CodeCode Available	0
An Improved Coarse-to-Fine Method for Solving Generation Tasks	Apr 1, 2019	MathMath Word Problem Solving	—Unverified	0
Translating a Math Word Problem to an Expression Tree	Nov 14, 2018	MathMath Word Problem Solving	CodeCode Available	0
Semantically-Aligned Equation Generation for Solving and Reasoning Math Word Problems	Nov 2, 2018	DecoderMath	CodeCode Available	0
Translating a Math Word Problem to a Expression Tree	Oct 1, 2018	Machine TranslationMath	—Unverified	0
Neural Math Word Problem Solver with Reinforcement Learning	Aug 1, 2018	Feature EngineeringMath	—Unverified	0
Using Intermediate Representations to Solve Math Word Problems	Jul 1, 2018	MathMath Word Problem Solving	—Unverified	0
Deep Neural Solver for Math Word Problems	Sep 1, 2017	Feature EngineeringMachine Translation	—Unverified	0

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets MATH SVAMP MAWPS Math23K ALG514 ASDiv-A ParaMAWPS DRAW-1K MathQA SVAMP (1:N)GSM-Plus MATH minival

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Gemini 2.0 Flash Experimental	Accuracy	89.7	—	Unverified
2	Qwen2.5-Math-72B-Instruct(TIR,Greedy)	Accuracy	88.1	—	Unverified
3	GPT-4 Turbo (MACM, w/code, voting)	Accuracy	87.92	—	Unverified
4	Qwen2.5-Math-72B-Instruct(COT,Greedy)	Accuracy	85.9	—	Unverified
5	Qwen2.5-Math-7B-Instruct(TIR,Greedy)	Accuracy	85.2	—	Unverified
6	GPT-4-code model (CSV, w/ code, SC, k=16)	Accuracy	84.3	—	Unverified
7	Qwen2-Math-72B-Instruct(greedy)	Accuracy	84	—	Unverified
8	Qwen2.5-Math-7B-Instruct(COT,Greedy)	Accuracy	83.6	—	Unverified
9	Qwen2.5-Math-1.5B-Instruct(TIR,Greedy)	Accuracy	79.9	—	Unverified
10	OpenMath2-Llama3.1-70B (majority@256)	Accuracy	79.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 DUP	Accuracy	94.2	—	Unverified
2	GPT-4 (Teaching-Inspired)	Execution Accuracy	93.9	—	Unverified
3	GPT-4 (Model Selection)	Execution Accuracy	93.7	—	Unverified
4	Qwen2(CoT + Code Interpreter)	Execution Accuracy	92.3	—	Unverified
5	GPT-4 (PHP)	Execution Accuracy	91.9	—	Unverified
6	OpenMath-CodeLlama-70B (w/ code)	Execution Accuracy	87.8	—	Unverified
7	MathCoder-L-70B	Execution Accuracy	84.9	—	Unverified
8	PoT_Eng (self-consistency @ 5)	Execution Accuracy	83.7	—	Unverified
9	CoT_Eng (self-consistency @ 5)	Execution Accuracy	82.5	—	Unverified
10	MMOS-CODE-34B(0-shot)	Execution Accuracy	80.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OpenMath-CodeLlama-70B (w/ code)	Accuracy (%)	95.7	—	Unverified
2	MsAT-DeductReasoner	Accuracy (%)	94.3	—	Unverified
3	ATHENA (roberta-large)	Accuracy (%)	93	—	Unverified
4	Exp-Tree	Accuracy (%)	92.3	—	Unverified
5	Multi-view	Accuracy (%)	92.3	—	Unverified
6	ATHENA (roberta-base)	Accuracy (%)	92.2	—	Unverified
7	Roberta-DeductReasoner	Accuracy (%)	92	—	Unverified
8	DeBERTa (PM + VM)	Accuracy (%)	91	—	Unverified
9	EPT	Accuracy (%)	88.7	—	Unverified
10	Graph2Tree with RoBERTa	Accuracy (%)	88.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 (Teaching-Inspired)	Accuracy (5-fold)	94.3	—	Unverified
2	ATHENA (roberta-large)	Accuracy (training-test)	86.5	—	Unverified
3	Multi-view* (ours)	Accuracy (5-fold)	85.2	—	Unverified
4	ATHENA (roberta-base)	Accuracy (training-test)	84.4	—	Unverified
5	Generate and Rank	Accuracy (5-fold)	84.3	—	Unverified
6	Exp-Tree	Accuracy (5-fold)	84.1	—	Unverified
7	REAL2: Memory-augmented Solver	Accuracy (5-fold)	83.18	—	Unverified
8	Roberta-DeductReasoner	Accuracy (5-fold)	83	—	Unverified
9	MWP-BERT	Accuracy (5-fold)	82.4	—	Unverified
10	Recall and Learn	Accuracy (5-fold)	80.8	—	Unverified